Accelerating Analytics with Apache Arrow - Neal Richardson

January 30, 2020 Neal Richardson

The Apache Arrow project is a cross-language development platform for in-memory data designed to improve system performance, memory use, and interoperability. This talk presents recent developments in the 'arrow' package, which provides an R interface to the Arrow C++ library. We'll cover the goals of the broader Arrow project, how to get started with the 'arrow' package in R, some general concepts for working with data efficiently in Arrow, and a brief overview of upcoming features.

 

View Materials: slides

About the Author

Neal Richardson

Currently Director of Engineering at Ursa Labs / RStudio. Previously led product and engineering at Crunch.io. Ph.D. in Political Science from the University of California, Berkeley.

Follow on Twitter Follow on Linkedin More Content by Neal Richardson
Previous Video
renv: Project Environments for R - Kevin Ushey
renv: Project Environments for R - Kevin Ushey

The renv package helps you create reproducible environments for your R projects. With renv, you can make yo...

Next Video
What's new in TensorFlow for R - Daniel Falbel
What's new in TensorFlow for R - Daniel Falbel

TensorFlow is the most popular open-source platform for machine learning and it's ecosystem is evolving inc...