RDDs, DataFrames and Datasets in Apache Spark - NE Scala 2016

videosharingcamera phonevideo phonefreeupload

Скачать RDDs, DataFrames and Datasets in Apache Spark - NE Scala 2016 бесплатно в качестве 4к (2к / 1080p)

У нас вы можете скачать бесплатно RDDs, DataFrames and Datasets in Apache Spark - NE Scala 2016 или посмотреть видео с ютуба в максимальном доступном качестве.

Для скачивания выберите вариант из формы ниже:

Cкачать музыку RDDs, DataFrames and Datasets in Apache Spark - NE Scala 2016 бесплатно в формате MP3:

Если иконки загрузки не отобразились, ПОЖАЛУЙСТА, НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если у вас возникли трудности с загрузкой, пожалуйста, свяжитесь с нами по контактам, указанным в нижней части страницы.
Спасибо за использование сервиса video2dn.com

Описание к видео RDDs, DataFrames and Datasets in Apache Spark - NE Scala 2016

Traditionally, Apache Spark jobs have been written using Resilient Distributed Datasets (RDDs), a Scala Collections-like API. RDDs are type-safe, but they can be problematic: It's easy to write a suboptimal job, and RDDs are significantly slower in Python than in Scala. DataFrames address some of these problems, and they're much faster, even in Scala; but, DataFrames aren't type-safe, and they're arguably less flexible.

Enter Datasets, a type-safe, object-oriented programming interface that works with the DataFrames API, provide some of the benefits of RDDs, and can be optimized via the Catalyst optimizer.

This talk will briefly recap RDDs and DataFrames, introduce the Datasets API, and then, through a live demonstration, compare the performance of all three against the same non-trivial data source.

Talk by Brian Clapper
March 4th, 2016

http://www.nescala.org/

Produced by NewCircle - Spark Training & Resources:
https://newcircle.com

Комментарии

Информация по комментариям в разработке