Accelerating Analytics with Apache Arrow

The Apache Arrow project is a cross-language development platform for in-memory data designed to improve system performance, memory use, and interoperability.

Accelerating Analytics with Apache Arrow

January 30, 2020

The Apache Arrow project is a cross-language development platform for in-memory data designed to improve system performance, memory use, and interoperability. This talk presents recent developments in the 'arrow' package, which provides an R interface to the Arrow C++ library. We'll cover the goals of the broader Arrow project, how to get started with the 'arrow' package in R, some general concepts for working with data efficiently in Arrow, and a brief overview of upcoming features.

 

View Materials: slides


About the speaker

Currently Director of Engineering at Ursa Labs / RStudio. Previously led product and engineering at Crunch.io. Ph.D. in Political Science from the University of California, Berkeley.