Curated SQL – Page 671 – A Fine Slice Of SQL Server

Query Plan Warnings and Operator Properties

Published 2022-03-21 by Kevin Feasel

Erik Darling continues a series on understanding execution plans. First up, what some of those warnings mean:

The good news is that SQL Server’s query plans will attempt to warn you about problems. The bad news is that most of the warnings only show up in Actual Execution Plans. The worse news is that a lot of the warnings that try to be helpful in Estimated Execution plans can be pretty misleading.

Next, Erik reviews some interesting properties:

There’s a lot of stuff flying around in a query plan. Data and what not.
Sure, you can hover over operators and arrows and see some stuff, but if you really wanna see stuff — I mean REALLY wanna see stuff — you gotta get into the properties.

Check out both posts.

Comments closed

Power Query BufferMode Types

Published 2022-03-21 by Kevin Feasel

Chris Webb looks at a new addition to Power Query:

Anyone with previous experience of Table.Buffer will see that BufferMode.Eager is the existing behaviour, but what is BufferMode.Delayed for?
It turns out that it’s there to make development faster.

Read on for the answer and what it does.

Comments closed

Visualizing Networks of R Library Usage

Published 2022-03-18 by Kevin Feasel

Bryan Shalloway has fun with network plots:

In previous posts and threads I’ve alluded to the potential utility of visualizing the relationships between parsed functions/packages and files as a network plot.
I added the function network_plot() to funspotr. In this post I’ll simply output the network plots of the parsed-out packages from the code collections discussed in the prior two posts:

Click through for interactive plots of what different people in the R community use.

Comments closed

Algorithmic Trading with ML.NET

Published 2022-03-18 by Kevin Feasel

Zadhid Powell has an example of working with ML.NET:

Machine learning is one of those areas of programming which is very capable of invitations and creativity. But, are you limited to any particular language like Python or R to develop either AI or ML projects? Who says that?
Nowadays, many developers have started learning to program with C#. But, if you’re one of them, you’d probably have heard that C# is not the best choice to start programming or it’s just useful for windows applications which is a wrong belief.

I mean, I’m still going to the mats for F# here but yeah, anybody who says C# is a bad starting language for programmers or that you can only build Windows apps with it lacks sufficient information on the language to make a sound judgment.

Comments closed

Managed Instance Link in Preview

Published 2022-03-18 by Kevin Feasel

Dani Ljepava announces support for Managed Instance link is now in public preview:

As of today, we are pleased to announce that the link feature for Managed Instance is available in the open public preview, in all Azure regions worldwide. It can be used with existing, or new managed instances, and SQL Server 2019 Enterprise, or Developer edition, including SQL Server 2022 CTP (available through EAP). We have also released the tooling support for the link in the form of automated wizards available in SQL Server Management Studio, starting from SSMS v18.11.1.
With the link, replicated databases from SQL Server on Managed Instance are usable as R/O secondary replicas. While the link is in operation, transactions commited on SQL Server (primary) are instantaneously committed to Managed Instance (secondary). This provides an exact replica of your SQL Server database on Managed Instance, synced near real-time. The link was built to be resilient, in case of the network being down, SQL Server being rebooted, or maintained, or in case of some other issue, the link will automatically resume replicating where it has left off when the issue has been resolved.

Support for 2019 is a shrewd idea, given the SQL Server version adoption curve for companies. This isn’t going to replace having a proper availability group for high availability or even (most) disaster recovery options, though, because the link is currently one-way—though Dani does mention eventual support for bi-directional operation with SQL Server 2022.

Comments closed

Database Scripting via Temporary Stored Procedure

Published 2022-03-18 by Kevin Feasel

Kenneth Fisher has a use for temporary stored procedures:

The other day I was asked to create a SQL Audit on several different databases. Completely unexpectedly (sarcasm warning!) the list grew, not once, not twice, but enough times that I’ve lost count, and each time I would copy and paste my code for the new databases and change the database name in each piece. Then on one notable occasion I had to change the code for each of the, at that point 10, copies of the code. Talk about a headache.

Then there was the epiphany.

Comments closed

Power BI Misconceptions

Published 2022-03-18 by Kevin Feasel

Reza Rad has a video (and article):

Misconception 1: Power BI is not an enterprise reporting tool, it is only good for self-service.
This is a misconception. And it is there because many people who have heard of Power BI, are not aware of the data modeling engine, the data transformation, and other main components of it. Maybe they just know Power BI as a visualization tool.
Power BI came to the market with the promise of binging data analysis to everyone using extra-ordinary self-service ability using Power BI Desktop and Power BI Service. However, Power BI itself is built on top of Microsoft enterprise data analysis toolset.

Read on for more information about this, as well as four other misconceptions.

Comments closed

Running MSDTC on Linux Containers in Kubernetes

Published 2022-03-18 by Kevin Feasel

Amit Khandelwal reminds us that MSDTC exists:

It’s been a while since I’ve had the opportunity to write and share a blog post about SQL Server containers and Linux. Today, I’d like to show you how to set up and use MSDTC (Microsoft Distributed Transaction Coordinator) to execute distributed transactions for SQL Server containers running on a Kubernetes platform.
Please see the following documentation for more information on DTC and SQL Server on Linux. How to configure MSDTC on Linux – SQL Server | Microsoft Docs.

I kid (sort of) but it is good to see as much parity between the Windows and Linux versions of SQL Server as possible.

Comments closed

Unpivoting in Spark with Stack()

Published 2022-03-17 by Kevin Feasel

The Hadoop in Real World team does some stacking:

stack function in Spark takes a number of rows as an argument followed by expressions.
stack(n, expr1, expr2.. exprn)
stack function will generate n rows by evaluating the expressions.

Think of it as the Spark alternative to UNPIVOT.

Comments closed

Dynamic DAGs with Apache Airflow

Published 2022-03-17 by Kevin Feasel

Bhavya Garg explains how we can create dynamic directed acyclic graphs in Apache Airflow:

Airflow dynamic DAGs can save you a ton of time. As you know, Apache Airflow is written in Python, and DAGs are created via Python scripts. That makes it very flexible and powerful (even complex sometimes). By leveraging Python, you can create DAGs dynamically based on variables, connections, a typical pattern, etc. This very nice way of generating DAGs comes at the price of higher complexity and subtle tricky things that you must know

Read on for an example.

Comments closed

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Curated SQL Posts