ML Infra + Apps

Polars: Blazingly fast single-node data processing

Tuesday, August 23
1:30 PM - 2:00 PM

This talk will introduce Polars, a blazingly fast DataFrame library written in Rust on top of Apache Arrow. It's a DataFrame library designed from the ground up for OLAP processing, hoping to learn from accumulative research in data processing. CPUs today come with many cores and with their superscalar designs and SIMD registers allow for even more parallelism. Polars is written from the ground up to fully utilize the CPUs of this generation. Besides blazingly fast algorithms, cache-efficient memory layout and multi-threading, it consists of a lazy query engine, allowing Polars to do several optimizations that may improve query time and memory usage. Polars entirely focusses on single node performance. Combined with a distribution framework like Ray, it could significantly improve scalable data querying performances.

About Ritchie

Ritchie Vink is the author of the Polars DataFrame library. He has a background is machine learning, software engineering, and structural engineering. In recent years his focus has been entirely on data processing.

Ritchie Vink

Machine Learning Engineer, Xomnia
chucks
Ray Summit 2022 horizontal logo

Ready to Register?

Come connect with the global community of thinkers and disruptors who are building and deploying the next generation of AI and ML applications.

Save your spot
register-bottom-mobile
beanbags

Join the Conversation

Ready to get involved in the Ray community before the conference? Ask a question in the forums. Open a pull request. Or share why you’re excited with the hashtag #RaySummit on Twitter.