R Views
Recent content on R Views, an R community blog edited by RStudio

Calculating AlwaysValid pvalues in R
In this post, we will develop a framework for alwaysvalid inference based on the paper Always Valid Inference: Continuous Monitoring of A/B Tests (2019 Johari, Pekelis, Walsh). Using an...

Tech Dividends, Part 2
In a previous post, we explored the dividend history of stocks included in the SP500, and we followed that with exploring the dividend history of some NASDAQ tickers. Today’s post is a short...

Plumber Logging
The plumber R package is used to expose R functions as API endpoints. Due to plumber’s incredible flexibility, most major API design decisions are left up to the developer. One important...

Tech Dividends, Part 1
In a previous post, we explored the dividend history of stocks included in the SP500. Today, we’ll extend that analysis to cover the Nasdaq because, well, because in the previous post I said I...

Validating Type I and II Errors in A/B Tests in R
In this post, we seek to develop an intuitive sense of what type I (falsepositive) and type II (falsenegative) errors represent when comparing metrics in A/B tests, in order to gain an...

June 2019 "Top 40" R Packages
Approximately 136 new packages stuck to CRAN in June. (This number is difficult to nail down with certainty because packages may be removed from CRAN after sitting there for a few days.) Here are...

An R Users Guide to JSM 2019
If you are like me, and rather last minute about making a plan to get the most out of a large conference, you are just starting to think about JSM 2019 which will begin in just a few days. My...

Three Strategies for Working with Big Data in R
For many R users, it’s obvious why you’d want to use R with big data, but not so obvious how. In fact, many people (wrongly) believe that R just doesn’t work very well for big data. In this...

Dividend Sleuthing with R
Welcome to a midsummer edition of Reproducible Finance with R. Today, we’ll explore the dividend histories of some stocks in the S&P 500. By way of history for all you young tech IPO and crypto...

Imagine your Data Before You Collect It
As data scientists, we are often presented with a dataset and are asked to use it to produce insights. We use R to wrangle, visualize, model, and produce tables and plots for sharing or...

May 2019: "Top 40" New CRAN Packages
Two hundred twentytwo new packages made it to CRAN in May, and it was more of an effort than usual to select the “Top 40”. Nevertheless, here they are in nine categories, Computational Methods,...

A Gentle Introduction to tidymodels
Recently, I had the opportunity to showcase tidymodels in workshops and talks. Because of my vantage point as a user, I figured it would be valuable to share what I have learned so far. Let’s...

Equal Size kmeans
We were recently presented with a problem where the decision maker wanted to understand how their data would naturally group together. The classic technique of kmeans clustering was a natural...

reticulate, virtualenv, and Python in Linux
Roland Stevenson is a data scientist and consultant who may be reached on Linkedin. reticulate is an R package that allows us to use Python modules from within RStudio. I recently found this...

Introducing DeclareDesign, a Platform for Research Design
Graeme Blair is an Assistant Professor of Political Science at UCLA. Jasper Cooper is a Postdoctoral Research Associate at the KahnemanTreisman Center for Behavioral Science and Public Policy at...

April 2019: "Top 40" New CRAN Packages
One hundred eightyseven new packages made it to CRAN in April. Here are my picks for the “Top 40”, organized into ten categories: Biotechnology, Data, Econometrics, Machine Learning, Medicine,...

Momentum Investing with R
After an extended hiatus, Reproducible Finance is back! We’ll celebrate by changing focus a bit and coding up an investment strategy called Momentum. Before we even tiptoe in that direction,...

Analysing the HIV pandemic, Part 4: Classification of lab samples
Andrie de Vries is the author of “R for Dummies” and a Solutions Engineer at RStudio Phillip (Armand) Bester is a medical scientist, researcher, and lecturer at the Division of Virology,...

Analysing the HIV pandemic, Part 3: Genetic diversity
Phillip (Armand) Bester is a medical scientist, researcher, and lecturer at the Division of Virology, University of the Free State, and National Health Laboratory Service (NHLS), Bloemfontein,...

Virtual Morel Foraging with R
Bryan Lewis is a mathematician, R developer and mushroom forager. Morchella Americana by Bryan W. Lewis, see https://ohiomushroomsociety.wordpress.com/ It’s that...
 Loading More...