The tidymodels framework is a collection of R packages for modeling and machine learning using tidyverse principles. Whether you are just starting out today or have years of experience with ML, tidymodels offers a consistent, flexible framework for your work. In this talk, learn how tidymodels has been designed to promote ergonomic, effective, and safe modeling practice. We will discuss how to think about the steps of building a model from beginning to end, how to fluently use different modeling and feature engineering approaches, how to avoid common pitfalls of modeling like overfitting and data leakage, and how to version and deploy reliable models trained in R.
09:00 AM to 10:30 AMLive Stream
Julia Silge is a data scientist and software engineer at RStudio PBC where she works on open source tools for machine learning and MLOps. She holds a PhD in astrophysics and has worked as a data scientist in tech and the nonprofit sector, as well as a technical advisory committee member for the US Bureau of Labor Statistics. She is a coauthor of Tidy Text Mining with R, Supervised Machine Learning for Text Analysis in R, and Tidy Modeling with R. An international keynote speaker and a real-world practitioner focusing on data analysis and machine learning, Julia loves text analysis, making beautiful charts, and communicating about technical topics with diverse audiences.
Max Kuhn is a software engineer at RStudio. He is currently working on improving R’s modeling capabilities. He was a Director of Nonclinical Statistics at Pfizer Global R&D in Connecticut. He was applying models in the pharmaceutical and diagnostic industries for over 18 years. Max has a Ph.D. in Biostatistics. Max is the author of numerous R packages for techniques in machine learning and reproducible research and is an Associate Editor for the Journal of Statistical Software. He, and Kjell Johnson, wrote the book Applied Predictive Modeling, which won the Ziegel award from the American Statistical Association, which recognizes the best book reviewed in Technometrics in 2015. Their latest book, Feature Engineering and Selection, was published in 2019.