Curated SQL – Page 445 – A Fine Slice Of SQL Server

Lessons Learned from Azure Data Factory Integrating with DB/2 on Mainframe

Published 2023-08-14 by Kevin Feasel

I’ve done a few BI integration projects extracting data from ERPs running on IBM Db2. Most of the implementations would use a hybrid architecture where the ERP would be running on an on-prem mainframe while the data was loaded in Microsoft Azure. Here are a few tips if you’re facing this challenge:

Click through for five major points. Surprisingly, one of them isn’t “Avoid DB/2 like the plague.”

Comments closed

Power BI and Eventual Browser Development

Published 2023-08-14 by Kevin Feasel

Chris Webb talks about the present and the future:

Turning the question around, however, leads you to some aspects of the question that haven’t been fully explored. Instead of asking “Can I run Power BI Desktop on my Mac?”, you can instead ask “Can I do all of my Power BI development using only a browser?”. At Microsoft our long-term goal is to make all Power BI development web-based, but how close are we to that goal?

Read on for Chris’s answer.

Comments closed

SQL Server on Linux 2022 Available in Preview

Published 2023-08-14 by Kevin Feasel

Amit Khandelwal has an update on SQL Server on Linux:

We are glad to announce that SQL Server 2022 is now available in preview mode for both Red Hat Enterprise Linux (RHEL) 9 and Ubuntu 22.04. For this preview, only Evaluation edition is available, which is limited to 180 days starting Thursday, July 27th, 2023.

In your Dev/Test environments, you may now take advantage of the most recent SQL Server 2022 improvements on both RHEL 9 and Ubuntu 22.04. Currently, production workloads on RHEL 9 and Ubuntu 22.04 are not supported by the SQL Server 2022 preview packages. You can run the production workloads for SQL Server 2022 on RHEL 8 and Ubuntu 22.04 and they are fully supported.

I’m going to wait until it’s actually available for real, not just in preview.

Comments closed

Shuffling Columns: R and Python Options

Published 2023-08-11 by Kevin Feasel

Tom Shafer does some testing:

Last year I benchmarked a few ways of shuffling columns in a data.table, but what about pandas? I didn’t know, so let’s revisit those tests and add a few more operations! pandas winds up being much more competitive than I expected.

Click through for those findings and the code Tom used for the task. H/T R-Bloggers.

Comments closed

Things that Make SQL Server Queries Run Single-Threaded

Published 2023-08-11 by Kevin Feasel

Erik Darling sets MAXDOP to 1:

It’s August, and that means one thing: Family Vacation. I’m taking this month off from regular blogging, and posting some of my paid beginner training content for you to enjoy, while I enjoy not blogging.

Click through for a good video and no text I can use as a helpful graf.

Comments closed

Controlling Sort Order when using Field Parameters in Power BI

Published 2023-08-11 by Kevin Feasel

Erik Svensen gets things organized:

This might be the default way of doing it but if you always want the chart to be sorted by a particular measure even though you are showing another measure then you can use this workaround. You should of course inform the user about the chosen sort order for instance in the subtitle or similar.

Click through to see what Erik has in mind.

Comments closed

Diving into the Microsoft Fabric Copy Activity

Published 2023-08-11 by Kevin Feasel

Reza Rad does more than copies:

Copy Activity is one of the most commonly used activities in Microsoft Fabric’s Data Factory Pipeline. The Copy Activity copies the data from a source to a destination. However, there is more to that rather than just a simple copy. In this article, you will learn what Copy Activity is, its rationale, how it works, and its configuration options.

Reza has a video, as well as a demo-heavy full-length article on the topic.

Comments closed

Executing SQL Queries in Files against Postgres

Published 2023-08-11 by Kevin Feasel

Salman Ahmed automates query execution:

In PostgreSQL, there are several ways to execute queries, and one of them is by executing queries from SQL files. This approach allows users to manage and store their SQL queries separately and make debugging and development simpler. Using SQL files also helps in replication of database schemas. This blog discusses how to execute queries from SQL files in PostgreSQL.

Read on to see how you can use the psql command line tool to do just that.

Comments closed

Creating a Simple Date Dimension in Databricks

Published 2023-08-10 by Kevin Feasel

Chen Hirsh builds a table:

A date dimension is extremely useful and is required by most BI applications. This kind of dimension has a key of time level (day, month, etc.), and attributes that describe it such as year, month, etc. In your BI model, you join this dimension to facts on their date fields, to aggregate from day level to week, month, and year.

In this post, I will demonstrate how to create a date dimension on Azure Databricks using Python. A link to the complete Databricks notebook is at the end of the post.

Check out the code, as well as explanation, in that post.

Comments closed

Managing Plot Parameters in R

Published 2023-08-10 by Kevin Feasel

Steven Sanderson switches up a visual:

When it comes to data visualization in R, the par() function is an indispensable tool that often goes overlooked. This function allows you to control various graphical parameters, unleashing a world of customization possibilities for your plots. In this blog post, we’ll demystify the par() function, break down its syntax, and provide you with hands-on examples to help you create stunning visualizations.

Click through to check it out. My loyalties definitely lie with ggplot2 for static visual development in R but it’s definitely not the only way to get images to look the way you want them.

Comments closed

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Curated SQL Posts