About Me
I’m a software engineer with more than 30 years of professional experience in a wide range of industries, including Banking, Media, Insurance, Hardware, and Software. I have co-founded two startups (one was acquired, one failed).
I have been specializing in query engines and distributed systems for the past 10 years.
I started the DataFusion and Ballista query engine projects and donated both to the Apache Software Foundation as part of the Apache Arrow project, where I am now a PMC member. I also donated the initial Rust implementation of Apache Arrow.
I am the original author of the sqlparser-rs project, which is the leading open-source SQL parser for the Rust ecosystem.
I currently work for NVIDIA, where I contribute to the Spark RAPIDS accelerator for Apache Spark, which GPU-accelerates Spark SQL and ETL jobs without code changes. I also contribute to Dask SQL, which now uses DataFusion for query planning and optimization.
Technical Writing
- How Query Engines Work an introductory guide to query engines
Talks
- 2021-07-08 Apache Arrow, Ballista and Big Data in Rust @ Data Science at Home
- 2021-05-14 Ballista- Distributed Compute with Apache Arrow and Rust @ SF Big Analytics Meetup
- 2021-02-24 Ballista: Distributed Compute with Rust and Apache Arrow @ New York Open Statistical Meetup
- 2020-10-20 Data Science with Rust - Arrow, DataFusion, and Ballista @ Denver Rust Meetup