Dask in 15 Minutes | Machine Learning & Data Science Open-source Spotlight #5

Описание к видео Dask in 15 Minutes | Machine Learning & Data Science Open-source Spotlight #5

Should you use Dask or PySpark for Big Data? 🤔

Dask is a flexible library for parallel computing in Python.
In this video I give a tutorial on how to use Dask for parallel computing, handling Big Data and integration with Deep Learning frameworks.
I compare Dask to PySpark and list the relative advantages I see of choosing Dask as your primary choice for Big Data handling.

Link to Notebook:
https://nbviewer.jupyter.org/github/d...

With these "Machine Learning & Data Science Open Source Spotlight" weekly videos, my objective is to introduce many game-changing libraries, which I believe many people can benefit from.

I would love to hear your feedback!
Did this video teach you something new?
Are there any open-source libraries you think deserve a spotlight?
Let me know in the comments! 👇🏻

Комментарии

Информация по комментариям в разработке