Matt Eland is speaking my language (F#):
One of the most common tasks with data roles is the need to perform exploratory data analysis (EDA).
With EDA a data scientist, data analyst, or other data-oriented programmer can:
- Understand the value distributions of their data
- Identify outliers and data anomalies
- Visualize correlations, trends, and relationships between multiple variables
Exploratory data analysis usually involves:
- Loading the data into a DataFrame
- Performing descriptive statistics to identify the raw shape of the data
- Visualizing variables of interest on their own or with other variables.
In this article I’ll walk you through the process of loading data from a sample dataset into a
Microsoft.Data.Analysis
DataFrame
(the kind featured in ML.NET). Next, we’ll look at the descriptive statistics theDataFrame
class provides and then explore the process of creating some simple visualizations with Plotly.NET.
Read on for the scenario and analysis.