Press "Enter" to skip to content

Creating a Spark DataFrame Ex Nihilo

Rahul Agarwal shows how you can gin up your own Spark DataFrame:

In broad terms, a DataFrame(DF) is a distributed, table-like structure with rows and columns and has a well-defined schema. DataFrames can be constructed from a wide variety of sources such as structured data files, tables in Hive, external databases, or existing RDDs.

Click through for an example in Scala.