Data wrangling with R and RStudio

October 24, 2016 Garrett Grolemund

Download Materials

A recent article from the New York Times said “Data scientists, according to interviews and expert estimates, spend from 50 percent to 80 percent of their time mired in the mundane labor of collecting and preparing data, before it can be explored for useful information.”

Before an R program can look for answers, your data must be cleaned up and converted to a form that makes information accessible. In this webinar, you will learn how to use the `dplyr` and `tidyr` packages to optimise the data wrangling process. You’ll learn to:

  • Spot the variables and observations within your data
  • Quickly derive new variables and observations to explore
  • Reshape your data into the layout that works best for R
  • Join multiple data sets together
  • Use group-wise summaries to explore hidden levels of information within your data

About the Author

Garrett Grolemund

Garrett is a data scientist and master instructor for RStudio. He excels at teaching, statistics, and teaching statistics. He wrote the popular lubridate package and is the author of Hands On Programming with R and the upcoming book, Data Science with R, from O’Reilly Media. He holds a PhD in Statistics and specializes in Data Visualization.

Follow on Twitter Visit Website More Content by Garrett Grolemund
Previous Video
Sparklyr: Using Spark with RMarkdown
Sparklyr: Using Spark with RMarkdown

RStudio recently announced a new open-source package called sparklyr that facilitates a connection between ...

Next Video
Introducing Shiny Gadgets: Interactive Tools
Introducing Shiny Gadgets: Interactive Tools

A Shiny Gadget is an interactive tool that enhances your R programming experience.

×

Please register to receive regular updates on our webinars.

!
Thank you!
Error - something went wrong!