Curated SQL – Page 131 – A Fine Slice Of SQL Server

The Dual Perils of Overfitting and Data Leakage

Published 2025-05-07 by Kevin Feasel

One of the bigger risks of iterative statistical or machine learning fitting procedures is over-fit or the dreaded data leak.

Over-fit is when: a model performs better on training data than on future data. Some degree of over-fit is expected. A data leak is when: the model learns things about the evaluation set that it would not know about the future data the model will be applied on. This can drive models that look great on training and (supposedly) held-out data, but don’t work in practice.

Click through for the rest of the story, and be sure to check out the comments for a notebook digging further into one of the topics.

Comments closed

The APPLY Operator in T-SQL

Published 2025-05-07 by Kevin Feasel

I have a new video:

In this video, I explain the history behind the APPLY operator in T-SQL. I also demonstrate one use case for it: executing table-valued functions.

As I mention at the end of the video, this is by far the least interesting use case for the APPLY operator. Subsequent videos in the series get a lot more exciting. But we have to start at the beginning, unless we’re going full Pulp Fiction.

Comments closed

Organizing a Microsoft Fabric Data Platform with Domains

Published 2025-05-07 by Kevin Feasel

Jon Vöge does a bit of organization:

A topic which seems more relevant than ever, is the question of how to organize the contents of your Microsoft Fabric Platform.

Through the contents of a few blogs, I will give you an overview of things to consider, as well as suggestions that you can choose from when designing your platform.

This first week, we’ll take a look at Domains in Microsoft Fabric.

Read on to understand why domains can be valuable and a solid way to structure them.

Comments closed

Choosing a Warehousing Data Architecture

Published 2025-05-07 by Kevin Feasel

James Serra compares and contrasts OLAP architectures:

As discussed in my blog and book “Deciphering Data Architectures: Choosing Between a Modern Data Warehouse, Data Fabric, Data Lakehouse, and Data Mesh” (Amazon), organizations are often challenged with choosing the right data architecture to meet their business goals—especially as AI and data-driven decision-making take center stage. To help clarify, here’s a quick review of the four core architectures, followed by guidance on when to use each. Each architecture includes five stages of data movement – ingest, store, transform, model, and visualize (described here).

Click through for James’s take on how each of them works and when you might choose one over the other.

Comments closed

Writing to Microsoft FabricDelta Tables in Python via DuckDB

Published 2025-05-07 by Kevin Feasel

Gilbert Quevauvilliers does a bit of writing:

When I was exploring how to easily write to Delta Tables with a Python notebook, it took me a considerable amount of time to find out how to do this.

This is my learnings below, and from my point of view it makes it easy to write to a Lakehouse table, like what is done with a PySpark notebook.

Click through for one very important note, as well as the process.

Comments closed

Tracking Wait Classes in Oracle

Published 2025-05-07 by Kevin Feasel

Kellyn Gorman looks for performance bugbears:

When we talk about optimization in Oracle, many DBAs immediately jump into metrics like CPU utilization, I/O throughput, or specific SQL queries. But there’s a critical layer of understanding that often gets overlooked and that’s Oracle Wait Classes. These categories are essential to making sense of what’s really going on inside your database, and they’re often the starting point for diagnosing and optimizing performance.

Read on to learn what wait classes are and why they’re so important for Oracle DBAs.

Comments closed

Rounded Corners in Power BI

Published 2025-05-07 by Kevin Feasel

Elena Drakulevska breaks out the sandpaper:

I’ve been seeing all kinds of shapes in Power BI reports lately—and seems like there’s a lot of guesswork when it comes to rounded corners. Some people just throw in a number because it “feels modern”, others skip it altogether. But rounding shouldn’t be random.

Wondering what to do with your edges? Putting in a radius just because? Or maybe your designs still feel… too pointy?

Click through for some thoughts on rounding corners in Power BI, as well as how to do it.

Comments closed

SQL Server Performance Office Hours Episode 10

Published 2025-05-06 by Kevin Feasel

Erik Darling has some doozies today:

Many of our Tables have 15 plus guid s from other tables, but not declared as Foreign Key. Plus Foreigne Keys. When I add a Index on for every Foreign Key plus Indexes for declared foreign keys we end up with tables with 30 plus indexes for fks only. We don’t have write performance issues. would you index every fk for every possible join?

Click through for the answer to this and four other questions.

Comments closed

Task Flows now GA in Microsoft Fabric

Published 2025-05-06 by Kevin Feasel

Dan Liu makes an announcement:

Task flows feature is now generally available! Task flows streamline the design of your data solutions and ensure consistency between design and development efforts. It also allows you to navigate items and manage your workspace more easily, even as it becomes more complex over time.

Read on to see what’s available to us now.

Comments closed

An Overview of the Power BI Ecosystem

Published 2025-05-06 by Kevin Feasel

Kurt Buhler looks at tools:

In general, the process to build and manage both models and reports in Power BI can be simplified to a few sequential steps that describe the “lifecycle” of this content. You can see this depicted below, as well as some examples of the various tools that you might use to support each of these steps. Shameless plug, Bravo, Vertipaq Analyzer, and Analyze in Excel were created by SQLBI, and SQLBI is involved in the development of DAX Studio, DAX Optimizer, and Tabular Editor.

Click through for an interesting table showing what fits where. If all you’re using is Power BI Desktop, there’s a whole world of additional functionality you’re missing out on.

Comments closed

M	T	W	T	F	S	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31

Curated SQL Posts