2026-05-29 – Curated SQL

In this article, you will learn how logits, temperature, and top-p sampling work together to control next-token prediction in large language models.

Topics we will cover include:

What logits are and how they are produced by a transformer’s final linear layer.

How temperature and top-p (nucleus sampling) shape the probability distribution used for token selection.

How these three components fit into a sequential pipeline that governs LLM output generation.

Click through for that explanation.

Comments closed

Filtered Indexes in SQL Server

Published 2026-05-29 by Kevin Feasel

Erik Darling has a new video:

Now, you just can’t talk about indexing in SQL Server really without talking about filtered indexes. They are a very, very important thing. Conceptually, they are just not that hard to figure out.

It’s an index with a where clause. It only indexes some of the data. It qualifies for the where clause. I don’t know. Like the benefits of that just seem rather apparent to me.

Benjamin Franklin highly encourages you to watch this video, even though filtered indexes are one of the most frustrating things in SQL Server. There are so many cases where I think they should work, and they actually work in approximately a third of those cases.

Comments closed

Tips for a Terabyte-Sized Database

Published 2026-05-29 by Kevin Feasel

Brent Ozar recommends some actions:

You were minding your own business, and all of a sudden it happened.

You glanced at file sizes one day, and your eyes got big. The numbers got a little large while you weren’t looking. This is a great time to stop and think about a few changes to the way you’re managing this database.

These are some good recommendations on the whole. 1TB isn’t a magic number, but it’s a pretty decent dividing line.

Comments closed

Automatic Index Compaction in Azure SQL

Published 2026-05-29 by Kevin Feasel

Chad Callihan takes a look at a preview feature:

There isn’t one set way to manage indexes. Maybe you use Ola Hallengren scripts. Maybe it’s something you put together yourself. Either way, there might be a big shift coming for SQL Server database administrators and how index management is handled.

Last month, Microsoft announced Automatic Index Compaction, which is in preview for Azure SQL Database, Azure SQL Managed Instance, and SQL Database in Fabric. Instead of utilizing something like Ola Hallengren scripts or your own homegrown setup to monitor and rebuild indexes, the database engine will continuously run in the background and handle indexes for you, hence the “automatic” in the name.

Read on to see how it works, as well as a note around page density and index fragmentation. But Jeff Moden makes a good point in the comments, so check that out.

Comments closed

Polymorphic Associations in Postgres

Published 2026-05-29 by Kevin Feasel

Andrei Lepikhov has multiple types:

Planning such a query efficiently is no easy task — and in my experience, this is confirmed by user reports from the 1C world, since PostgreSQL is currently not rich in LEFT JOIN optimisations. At the same time, the properties of this pattern enable the development of various techniques to improve execution efficiency. I’ve managed to implement several straightforward optimisations of this template. But first, let’s understand what polymorphic references actually are, where they come from, and how common they really are. That’s the gap I’m trying to fill with this post.

Click through for the explanation. This isn’t the easiest problem to solve in the relational world, though I do tend to prefer the subclass/superclass solution, myself.

Comments closed

Semantic Similarity Search Distance Metrics

Published 2026-05-29 by Kevin Feasel

Andrew Pruski does a bit of math:

Here we are using cosine as a distance metric, but there are three available to us in SQL Server: –

Euclidean Distance

Negative Dot Product

Cosine Distance

In this post we’ll run through all three, see how they are calculated, and explore how they differ from each other.

Click through for a description of each.

Comments closed

M	T	W	T	F	S	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Day: May 29, 2026

Next Token Selection in Language Models

Filtered Indexes in SQL Server

Tips for a Terabyte-Sized Database

Automatic Index Compaction in Azure SQL

Polymorphic Associations in Postgres

Semantic Similarity Search Distance Metrics