Press "Enter" to skip to content

Recursive Subdirectory Reads with Hive

The Hadoop in Real World team wants to dig deeper:

Let’s say you have a Hive table and the Hive table is pointing at a location or directory which has several sub directories and each subdirectories has files underneath it.

When you query the table however, Hive is only reading the files at the top level folder and ignoring all the files under the subdirectories.

Click through for the solution.