Using PolyBase Without Local Tables

I have a post up on PolyBase without local tables:

With this combination of external data source, file format, and table, we can create our next table. This is a data set which lives in Spark and contains…well, the same data. What we join together isn’t that interesting, after all.

The end result is you get your data back as though these were all local, on-prem tables.

Related Posts

PolyBase and Dockerized Hadoop

I have a solution to a problem which vexed me for quite some time: Quite some time ago, I posted about PolyBase and the Hortonworks Data Platform 2.5 (and later) sandbox. The summary of the problem is that data nodes in HDP 2.5 and later are on a Docker private network. For most cases, this works […]

Read More

PolyBase and External Column Names

I have another post looking at external columns on PolyBase V2 data sources: I’m going to use external two tables in this experiment. In the left corner, we have some ORC files stored in Azure Blob Storage which we’ll represent as FireIncidents2017. In the right corner, we have data stored in a remote SQL Server […]

Read More

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Categories

September 2019
MTWTFSS
« Aug  
 1
2345678
9101112131415
16171819202122
23242526272829
30