Dask DataFrames Tutorial: Best practices for larger-than-memory dataframes

Описание к видео Dask DataFrames Tutorial: Best practices for larger-than-memory dataframes

Learn best practices for larger-than-memory dataframes. Investigate Uber/Lyft data and learn to do the following:
Manipulate Parquet files and optimize queries
Navigate inconvenient file sizes and data types
Tune Parquet storage, build features, and explore a challenging dataset with Pandas and Dask.

Notebook here: https://github.com/coiled/dask-tutori...

Tutorial repo: https://github.com/coiled/dask-tutorial/

---
Scale Your Python Workloads with Dask and Coiled.
Coiled is a Dask company. With Coiled's rock-solid infrastructure, you can quickly and securely create Dask clusters in your cloud account.

Learn more about Coiled and get started for free
https://coiled.io/start

More content on our blog:
https://coiled.io/blog

Комментарии

Информация по комментариям в разработке