Matthew Topol- State of the Apache Arrow Ecosystem:How your project can leverage Arrow! | Øredev2023

Описание к видео Matthew Topol- State of the Apache Arrow Ecosystem:How your project can leverage Arrow! | Øredev2023

Session description:
If you work with data in any development capacity you've probably heard of Apache Arrow at this point, even if you don't exactly know what it is. Apache Arrow is an in-memory data format designed to accelerate analytics and allow the exchange of data across big data systems easily. In other words, it makes your workflows faster! The data community is increasingly seeing more tools adopt Arrow (or a very Arrow-like format) as their internal memory representation (AlloyDB, DuckDB, BigQuery, Velox, etc..) because of the benefits to utilizing columnar representations of data for analytics and data transport. This talk will explain what Arrow is for the uninitiated and then go on to examine the current ecosystem of tools, libraries and utilities that surround it. Regardless of whether you're working in Python, C++, Go, Rust, Java, R, or whatever, there's likely tools and libraries that can help you leverage Arrow for whatever your data project is. Hopefully coming out of here you'll have lots of great ideas to make your data workflows faster, more efficient, and easier to develop!

Connect with us!
Website: https://oredev.org
LinkedIn:   / oredev  
Twitter:   / oredev  
Facebook:   / oredev  
Instagram:   / oredev  

Комментарии

Информация по комментариям в разработке