Faster DataFusion with StringView - Xiangpeng Hao (Aug 15, 2024)

Описание к видео Faster DataFusion with StringView - Xiangpeng Hao (Aug 15, 2024)

Xiangpeng Hao summarizes what Apache Arrow StringView is, why it can improve performance, and the practical challenges overcome when realizing the potential.

Xiangpeng Hao presents his 2024 Summer Intern project at @influxdata8893: improving performance in Apache DataFusion, the query engine used in InfluxDB 3.0.

Talk Abstract: We implemented a new string representation—StringView—in the Rust implementation of Apache Arrow, arrow-rs and integrated it into Apache DataFusion, significantly accelerating string-intensive queries in the ClickBench benchmark by 20%- 200%.



‪@influxdata8893‬

Комментарии

Информация по комментариям в разработке