19 Understand and Optimize Shuffle in Spark

Описание к видео 19 Understand and Optimize Shuffle in Spark

Video explains - How Shuffle works in Spark ? How to optimize Shuffle in Spark ?

Chapters
00:00 - Introduction
00:20 - Understand Pipelining in Spark
02:18 - Demonstration
11:40 - Performance with Partitioned Data
14:19 - Few More Tips

Local PySpark Jupyter Lab setup -    • 03 Data Lakehouse | Data Warehousing ...  
Python Basics - https://www.learnpython.org/
GitHub URL for code - https://github.com/subhamkharwal/pysp...

The series provides a step-by-step guide to learning PySpark, a popular open-source distributed computing framework that is used for big data processing.

New video in every 3 days ❤️

#spark #pyspark #python #dataengineering

Комментарии

Информация по комментариям в разработке