Categories: SQL Server

T-SQL Tuesday 86: SQL Server Bugs & Enhancement Requests

 Let’s kick off the new year with a T-SQL Tuesday! This month’s edition is hosted by Brent Ozar:

Announcing T-SQL Tuesday #86: SQL Server Bugs & Enhancement Requests

Brent wants us to blog about our favorite Connect items because Microsoft is currently listening to its customers and actually closing items. So it is the time to take advantage! The Connect item that I’m going to highlight was submitted by Niko Neugebauer (blog | twitter) and is – of course – about columnstore indexes:

Implement Batch Mode Support for Row Store

One of the biggest advantages of columnstore indexes – aside from the huge compression ratio – is Batch Mode. With batch mode, certain operators in a query plan work on blocks of 1,000 rows at a time instead of doing operations row by row. This is a huge performance improvement: doing something once to 1,000 rows instead of doing it 1,000 times to an individual row tends to be faster. The downside is that it only works on tables with a columnstore index (clustered or non-clustered). If the table is in row storage, batch mode is not available and that’s what the connect item is about: make batch mode available in all scenarios. This can have some serious performance impact.

With SQL Server 2016, you can have batch mode on row storage table by using a little trick (detailed by Itzik Ben-Gan in SQLMag.com):

you can create a filtered non-clustered columnstore index on a single column. For example:

CREATE NONCLUSTERED COLUMNSTORE INDEX ix_dummy ON dbo.Table1(col1)
WHERE col1 = 0 AND col1 = 1;

Since col1 cannot be 0 and 1 at the same time, the index will have no data at all. So there is no performance overhead (no writes) and the optimizer will use a constant scan in the plan. However, since a columnstore index is involved, any query on Table1 can now run in batch mode. How crazy is that? This means SQL Server can already execute batch mode just fine on tables that are not really using a columnstore index, so there is no reason for not having this connect item fixed.

At the time of writing, the Connect items is over 2 years old but has only 36 votes. Time to change this! Please go to Connect and upvote it!


------------------------------------------------
Do you like this blog post? You can thank me by buying me a beer 🙂
Koen Verbeeck

Koen Verbeeck is a Microsoft Business Intelligence consultant at AE, helping clients to get insight in their data. Koen has a comprehensive knowledge of the SQL Server BI stack, with a particular love for Integration Services. He's also a speaker at various conferences.

Recent Posts

dataMinds Saturday 2024 – Session Materials

It was great being at dataMinds Saturday 2024 this past weekend. A great crowd of…

9 hours ago

Check your regions people

Today I was having a nice discussion with some colleagues about Fabric and pricing/licensing came…

2 weeks ago

Book Review – Deciphering Data Architectures

I recently purchased and read the book Deciphering Data Architectures - Choosing Between a Modern…

3 weeks ago

Fabulous Stuff in Fabric – Part 5: Folders in Workspaces

A while ago I had a little blog post series about cool stuff in Snowflake. I’m…

4 weeks ago

Speaking at dataMinds Saturday & Techorama 2024

I have the pleasure to announce I'll be presenting at two conferences this spring. The…

1 month ago

Change Notebook Spark Session Time-Out in Microsoft Fabric

You might know the feeling: you're writing code in a Notebook in Microsoft Fabric and…

1 month ago