Shubham Agarwal explains the difference between three Spark data structures:
DataFrame(DF) –
DataFrame is an abstraction which gives a schema view of data. Which means it gives us a view of data as columns with column name and types info, We can think data in data frame like a table in the database.
Like RDD, execution in Dataframe too is lazy triggered.
Read on to learn more about Resilient Distributed Datasets, DataFrames, and DataSets.