The Theory Behind cdata

Kevin Feasel



John Mount has a video explaining the concepts behind cdata:

We also have two really nifty articles on the theory and methods:

Please give it a try!

Click through for the video, which I found very helpful in tying together a number of data transformation operations (pivoting, unpivoting, one-hot encoding, etc.).

Related Posts

Principal Component Analysis With Faces

Mic at The Beginner Programmer shows us how to creepy PCA diagrams with human faces: PCA looks for a new the reference system to describe your data. This new reference system is designed in such a way to maximize the variance of the data across the new axis. The first principal component accounts for as […]

Read More

Sorting With data.table Versus dplyr

John Mount shows us that data.table is way faster for sorting than dplyr‘s arrange function: Notice on the above semi-log plot the run time ratio is growing roughly linearly. This makes sense: data.table uses a radix sort which has the potential to perform in near linear time (faster than the n log(n) lower bound known comparison sorting) for a range of […]

Read More


January 2018
« Dec Feb »