Star Join Optimization in SQL Server 2008

When working with a dimensionally modeled data warehouse it is common for a large number of your queries to follow a "star pattern". This pattern consist of Fact data being retrieved along with it's related dimension data from a start schema. What makes the Star Schema look like a star is the dimensions rotating around the Fact Table. See a good example of it here. Star queries also usually have most, if not all, of the following characteristics:

They follow a pattern of joining fact tables with their related dimensions
They apply some types of filtering
They usually perform some type of aggregation or additive operation
A true relational thought process can hinder the performance of this type of query, so in SQL Server 2008 Enterprise Edition (sorry standard edition folks) they have added new optimizations in the query optimizer to significantly improve query performance for queries that retrieve a larger portion of those tables.
The new additions are based on Bitmap filters, which you may be familiar with if you've played with Bitmap indexing in Analysis Services. Some good info on those can be found here:
Basically it allows SQL Server to remove non-qualifying fact table rows from any further processing, resulting in significant amount of processing time compared to comparable products. Results of 15%-30%. Some individual queries get a n even greater boost.
Below is a diagram from MSDN showing the mechanism in action.
FROM MSDN:
" Figure 1: Star join query plan with join reduction processing for efficient DW
The new star join optimization uses a series of hash joins, building a hash table for each dimension table that participates. As a byproduct of building this hash table, additional information, called a bitmap filter, is built. Bitmap filters are represented as boxes in Figure 1, labeled “Join Reduction Info.” These filters are pushed down into the scan on the fact table, and effectively eliminate almost all the rows that would be eliminated later by the joins. This eliminates the need to spend CPU time later copying the eliminated rows and probing the hash tables for them. The illustration shows the effect of this filtering within the fact table scan. The SQL Server 2008 query executor also re-orders the bitmaps during execution, putting the most selective one first, then the next most selective one, and so forth. This saves more CPU time, because once a fact table row fails a check against a bitmap, the row is skipped. "
Enjoy this new feature in your data warehousing efforts! Let me know if you have any success stories or challenges so we can work through them together! 🙂
Don't forget to post your thoughts or email me your questions to ajorgensen@pragmaticworks.com. As always, this Blog is to help you better understand the tools at your disposal ...

You can see more posts like this as well as other great content on my main blog at http://blogs.pragmaticworks.com/Adam_Jorgensen

Book Review: Big Red - Voyage of a Trident Submarine

by Andy Warren

SQLServerCentral.com

Blogs

I've grown up reading Tom Clancy and probably most of you have at least seen Red October, so this book caught my eye when browsing used books for a recent trip. It's a fairly human look at what's involved in sailing on a Trident missile submarine...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-03-10

1,439 reads

Database Mirroring FAQ: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup?

by Robert Davis

SQLServerCentral.com

Blogs

Question: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup? This question was sent to me via email. My reply follows. Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup? Databases to be mirrored are currently running on 2005 SQL instances but will be upgraded to 2008 SQL in the near future.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-23

1,567 reads

Inserting Markup into a String with SQL

by Phil Factor

SQLServerCentral.com

T-SQL

In which Phil illustrates an old trick using STUFF to intert a number of substrings from a table into a string, and explains why the technique might speed up your code...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-18

1,631 reads

Networking - Part 4

by Andy Warren

SQLServerCentral.com

Blogs

You may want to read Part 1 , Part 2 , and Part 3 before continuing. This time around I'd like to talk about social networking. We'll start with social networking. Facebook, MySpace, and Twitter are all good examples of using technology to let...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-17

1,530 reads

Speaking at Community Events - More Thoughts

by Andy Warren

SQLServerCentral.com

Blogs

Last week I posted Speaking at Community Events - Time to Raise the Bar?, a first cut at talking about to what degree we should require experience for speakers at events like SQLSaturday as well as when it might be appropriate to add additional focus/limitations on the presentations that are accepted. I've got a few more thoughts on the topic this week, and I look forward to your comments.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-13

360 reads

Star Join Optimization in SQL Server 2008

Rate

Share

Share

Rate

Star Join Optimization in SQL Server 2008

Rate

Share

Share

Rate

Related content

Book Review: Big Red - Voyage of a Trident Submarine

Database Mirroring FAQ: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup?

Inserting Markup into a String with SQL

Networking - Part 4

Speaking at Community Events - More Thoughts