Working With Missing Values In R

Kevin Feasel



Anisa Dhana has a few examples of ways we can work with data containing missing values in R:

Imputation is a complex process that requires a good knowledge of your data. For example, it is crucial to know whether the missing is at random or not before you impute the data. I have read a nice tutorial which visualize the missing data and help to understand the type of missing, and another post showing how to impute the data with MICE package.

In this short post, I will focus on management of the missing data using the tidyverse package. Specifically, I will show how to manage missings in the long data format (i.e., more than one observation for id).

Anisa shows a few different techniques, depending upon what you need to do with the data.  I’d caution about using mean in the second example and instead typically prefer median, as replacing missing values with the median won’t alter the distribution in the way that it can with mean.

Related Posts

Icon Maps in R

Laura Ellis shows how you can build maps full of little icons: That was ok, but we should try to make the images more aesthetically pleasing using the magick package. We make each image transparent with the image_transparent() function. We can also make the resulting image a specific color with image_colorize(). I then saved the […]

Read More

R User Salaries By Country

Capri Granville shares a chart showing a box plot of salaries for professional R users by country: Interesting analysis done in R, about salaries of R developers broken down by country, featuring salary range and median salary.  The dataset consists of survey answers from nearly 90,000 respondents. About 5,000 of them reported using R for “extensive development […]

Read More


December 2018
« Nov Jan »