Curated SQL – Page 33 – A Fine Slice Of SQL Server

A Primer on Data Storage in PostgreSQL

Published 2026-02-24 by Kevin Feasel

The whole idea behind a database is the ability to persist the data. You want your inventory of widgets to get stored so you can look at it later. That means writing out to disks. However, what is writing to disk and where is it being written? Unlike SQL Server which has one (or more) big file for all data, PostgreSQL has a collection of a large number of files. There is a methodology and structure to these files that you need to understand in order to later understand how the data gets written to and retrieved from these files.

While we’re going to be very focused on file, page, folder, etc., throughout this article, that’s just part of the physical nature of persisting your data. What is being persisted is still the logical information you’re most interested in – rows and columns. I just wanted to emphasize the distinction between the two here.

Click through to see how PostgreSQL stores information.

Comments closed

An Overview of Fabric Cost Analysis

Published 2026-02-24 by Kevin Feasel

Rob Sewell provides a primer on the Fabric Cost Analysis tool:

Today we look at another monitoring tool in the collection: FCA, the Fabric Cost Analysis solution.

Where FUAM answers “how is my Fabric capacity being used?”, FCA answers “what is it costing us, and who is spending what?”

Click through to see how it works.

Comments closed

A Primer on dbt against DuckDB

Published 2026-02-23 by Kevin Feasel

Robin Moffatt shares a tutorial on dbt:

In 2022 I made a couple of attempts to learn dbt, but it never really ‘clicked’.

I’m rather delighted to say that as of today, dbt has definitely ‘clicked’. How do I know? Because not only can I explain what I’ve built, but I’ve even had the 💡 lightbulb-above-the-head moment seeing it in action and how elegant the code used to build pipelines with dbt can be.

In this blog post I’m going to show off what I built with dbt, contrasting it to my previous hand-built method.

I also had heard of dbt but haven’t really spent the time to learn it because I’m not really a data engineer. But this tutorial has me interested in diving in further.

Comments closed

Spark Schema Inference in Production

Published 2026-02-23 by Kevin Feasel

Miles Cole shares some advice:

To show the impact I want to highlight a benchmark that included Fabric Spark on a single 19GB CSV input file (100M Contoso dataset, sales table) for the benchmark. While there were a number of issue with this benchmark that inadvertently make Spark appear to be slow, this is only focused on the impact of inferring schema and practical recommendations.

Read on to see a performance problem that schema inference brings up. I’d also want to mention the risk of data updates blowing up your well-laid plans as a risk. Schema inference is a double-edged sword: it can be convenient and open up new approaches to development, but can just as easily cause unexpected failures.

Comments closed

Optimizing Planned Availability Group Failover in SQL Server

Published 2026-02-23 by Kevin Feasel

Aaron Bertrand shares some advice:

Shaving even a handful of seconds from the process can improve the application and end user experience; it can also drastically reduce alert noise or, at least, how long alerts have to stay muted. There’s a lot of material out there about performing AG failovers correctly (no data loss), but far less that focuses on shortening the disruption window. The difference is usually some combination of redo volume, checkpoint behavior, open transactions, and secondary readiness.

I wanted to share some techniques I use to make planned failovers faster and more predictable. Some of these techniques are well documented, while others come from real-world patterns I’ve observed across many SQL Server environments. I’ll talk about what I do before, during, and after the failover to minimize disruption and increase the chance that end users are oblivious that anything happened.

Aaron provides several tips to help reduce the pain of failover.

Comments closed

An Overview of the Fabric Unified Admin Monitoring Tool

Published 2026-02-23 by Kevin Feasel

Rob Sewell lays out some information:

When you are responsible for a Microsoft Fabric tenant, it will not be very long before you are facing many questions.

Questions like:

How is my capacity being used?

Which workspaces are consuming the most resources?

What are my users actually doing?

When are my peak usage times?

You can scabble around in the Admin portal and try to piece together the answers, but it is a bit like trying to navigate a city with a paper map — you can get there eventually, but it is slow and painful, and you will probably miss some things along the way.

Read on to see how FUAM can help answer these sorts of questions.

Comments closed

An Introduction to Transactions in SQL Server

Published 2026-02-23 by Kevin Feasel

Paul Randal hits us with another blast from the past:

One of the most fundamental concepts in any relational database management system (RDBMS), such as SQL Server, is the transaction. During my consulting career, I’ve seen many instances of performance problems caused by developers not understanding how transactions work in SQL Server, so in this tutorial, I’ll explain what transactions are and why they’re necessary, plus some details of how they work in SQL Server. There are nuances to some of this when Accelerated Database Recovery (ADR) is in use – topics for future articles.

One thing that Paul does not point out here but has done in other places is just how bad the implementation is for nested transactions in SQL Server. They’re very limited in nature, so the best advice there would be not to use them at all.

Comments closed

Opening the VS Code Terminal in a New Window

Published 2026-02-23 by Kevin Feasel

Andrew Pruski splits things out:

A while back I wrote a quick post on setting up key mappings in Visual Studio Code…they make presenting (and generally working) in Visual Studio Code really smooth.

But one thing that kinda bugs me is the location of the terminal…I’ve always had it at the bottom, which is generally fine, and I know you can move it around (top, right, left)…however I’ve found that when presenting, space is at a premium. I bump up the font size and this can result in a lot of scrolling through results in the terminal, which ain’t great.

Read on to see how you can make the integrated terminal a separate tab.

Comments closed

Building a Graph for Its Takeaway

Published 2026-02-20 by Kevin Feasel

Cole Nussbaumer Knaflic reminds us that visuals should have purpose:

I was facilitating a workshop recently when someone asked one of my favorite questions about a graph on the screen: “So… what are we supposed to take away from this?”

Such a simple—and useful—question.

One challenge was that the graph was attempting to show multiple comparisons at once, so it wasn’t clear what mattered most. To further complicate things, the data in question spanned very different magnitudes, with one category dwarfing the rest.

Click through for a demonstration and how changing the visual layout can affect the message. The challenge I tend to run into is that, when I’m developing a visual for an application or a report, I don’t know what the precise message should be at that moment in time. I have to design with an idea of the data, but what actually emerges will depend upon what data is in there. Tailoring a visual for a specific message at a specific point in time is a lot easier when building a presentation, but it gets tricky when you’re building an application for the long haul.

Comments closed

The Makeup of an 8KB Page in PostgreSQL

Published 2026-02-20 by Kevin Feasel

Radim Marek takes us through the structure of a page:

If you read previous post about buffers, you already know PostgreSQL might not necessarily care about your rows. You might be inserting a user profile, or retrieving payment details, but all that Postgres works with are blocks of data. 8KB blocks, to be precise. You want to retrieve one tiny row? PostgreSQL hauls an entire 8,192-byte page off the disk just to give it to you. You update a single boolean flag? Same thing. The 8KB page is THE atomic unit of I/O.

But knowing those pages exist isn’t enough. To understand why the database behaves the way it does, you need to understand how it works. Every time you execute INSERT, PostgreSQL needs to figure out how to fit it into one of those 8,192-byte pages.

It is a little wild how three of the largest relational database systems use 8KB pages. I know that, on the SQL Server side, they’ve experimented with different page sizes internally and have repeatedly said that, even recently (the last time I heard this was maybe about 3 years ago at a SQL Saturday), there just isn’t a benefit from moving away from 8KB. But what’s in those 8KB differ, and Radim goes into details on what’s in PostgreSQL.

Comments closed

M	T	W	T	F	S	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Curated SQL Posts