Using Apache Spark to Solve Sessionization Problem in Batch and Streaming - Bartosz Konieczny Canal+

#SparkAISummit

Скачать Using Apache Spark to Solve Sessionization Problem in Batch and Streaming - Bartosz Konieczny Canal+ бесплатно в качестве 4к (2к / 1080p)

У нас вы можете скачать бесплатно Using Apache Spark to Solve Sessionization Problem in Batch and Streaming - Bartosz Konieczny Canal+ или посмотреть видео с ютуба в максимальном доступном качестве.

Для скачивания выберите вариант из формы ниже:

Cкачать музыку Using Apache Spark to Solve Sessionization Problem in Batch and Streaming - Bartosz Konieczny Canal+ бесплатно в формате MP3:

Если иконки загрузки не отобразились, ПОЖАЛУЙСТА, НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если у вас возникли трудности с загрузкой, пожалуйста, свяжитесь с нами по контактам, указанным в нижней части страницы.
Спасибо за использование сервиса video2dn.com

Описание к видео Using Apache Spark to Solve Sessionization Problem in Batch and Streaming - Bartosz Konieczny Canal+

Analyzing sessions can bring a lot of useful feedback about what works and what does not. But implementing them is not easy because of data issues and operational costs that you will meet sooner or later. In this talk I will present 2 approaches to compute sessions with Apache Spark and AWS services. The first one will use batch and therefore, Spark SQL, whereas the second streaming and Structured Streaming module. During the talk I will cover different problems you may encounter when creating sessions, like late data, incomplete dataset, duplicated data, reprocessing or fault-tolerance aspects. I will try to solve them and show how Apache Spark features and AWS services (EMR, S3) can help to do that. After the talk you should be aware of the problems you may encounter with session pipelines and understand how to address them with Apache Spark features like watermarks, state store, checkpoints and how to integrate your code with a cloud provider.

About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: https://databricks.com/product/unifie...

Connect with us:
Website: https://databricks.com
Facebook:   / databricksinc
Twitter:   / databricks
LinkedIn:   / databricks
Instagram:   / databricksinc   Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. https://databricks.com/databricks-nam...

Комментарии

Информация по комментариям в разработке

Using Apache Spark to Solve Sessionization Problem in Batch and Streaming - Bartosz Konieczny Canal+

Скачать Using Apache Spark to Solve Sessionization Problem in Batch and Streaming - Bartosz Konieczny Canal+ бесплатно в качестве 4к (2к / 1080p)

Cкачать музыку Using Apache Spark to Solve Sessionization Problem in Batch and Streaming - Bartosz Konieczny Canal+ бесплатно в формате MP3:

Описание к видео Using Apache Spark to Solve Sessionization Problem in Batch and Streaming - Bartosz Konieczny Canal+

Похожие видео