Introduction to Parallel Data Warehousing

What are we talking about?

Recently Microsoft released a new version of SQL Server 2008 R2 called Parallel Data Warehouse Edition. there has been a lot of buzz about this new architecture because it is Microsoft’s entry into the Massive Parallel Processing (MPP) Scale out data warehousing arena. Typically Microsoft has offered SQL Server in a SMP or Symmetric Multi-processing architecture where all the CPUS memory and storage are in one physical architecture while the database operations take place entirely within on instance of SQL Server.

Parallel Data Warehousing or PDW is an appliance based architecture that provides significant scale out capabilities based on the technology the acquired from Datallegro Corp in 2008. This MPP architecture provides more scalable and predictable performance for significantly greater workloads up into the 100's of terabytes. Microsoft’s implementation is particularly exciting because PDW provides a much lower cost per terabyte since you can implement it with commodity hardware instead of a proprietary system like Teradata or Neteeza

PDW Works by controlling several different physical servers each running their own instance of SQL Server 2008 R2. The database and it’s tables are spread across these physical servers but appear as one database and table(s) to the end user. The appliance or brain of the PDW manages query execution and the meta data for what is stored and processed on what portion of the PDW.

See an overview of this in the diagram below.

Why do we need this?

PDW is important because it opens the possibilities of large scale data processing in a much more economical package? How economical you ask? Well let’s say I won’t have one in my garage. The price tag is still around $1mUS to get going, but that is a lot of hardware, licensing and processing power for the money. The entry level package is two racks of gear including storage, network, it’s own domain controller etc.. We’ll talk more about the architecture in the next article, for now we want to focus on what PDW is and why its so exciting!

Business these days are processing large volumes of data and the definition of “large volume of data” grows every day. With PDW now the SQL server community has a comparable architecture to that of Teradata or Neteeza. The major difference though is instead of adding a node to Teradata for approximately $850K, the PDW uses HP and Dell off the self hardware making expansion much more cost effective and drastically increasing ROI.

This introduction is the kick-start to a series I’m writing covering all aspects of the new PDW. If you have areas you would like to know more about, please email me or post comments so I can make sure to follow up with the product teams to get your more information. We are excited to be one of the few partners already working with PDW so we want to help you understand and see how this package could be beneficial for your enterprise.

Upcoming articles in the series

1. Architecture overview of PDW

2. Intro to new PDW Objects and Schema Features

3. Working with PDW Database Objects

4. Partitioning with PDW and Querying your PDW

5. Working with PDW Databases

6. How to get PDW in your environment

7. Fast Track Architecture vs. PDW

Thanks for checking out this introduction to the series. Please post comments and feel free to email me with questions. In the meantime, check out Microsoft’s PDW site for SQL Server at:

http://www.microsoft.com/sqlserver/2008/en/us/parallel-data-warehouse.aspx

Thanks for stopping in. Keep making your business intelligent!

Adam

Book Review: Big Red - Voyage of a Trident Submarine

by Andy Warren

SQLServerCentral.com

Blogs

I've grown up reading Tom Clancy and probably most of you have at least seen Red October, so this book caught my eye when browsing used books for a recent trip. It's a fairly human look at what's involved in sailing on a Trident missile submarine...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-03-10

1,439 reads

Database Mirroring FAQ: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup?

by Robert Davis

SQLServerCentral.com

Blogs

Question: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup? This question was sent to me via email. My reply follows. Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup? Databases to be mirrored are currently running on 2005 SQL instances but will be upgraded to 2008 SQL in the near future.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-23

1,567 reads

Inserting Markup into a String with SQL

by Phil Factor

SQLServerCentral.com

T-SQL

In which Phil illustrates an old trick using STUFF to intert a number of substrings from a table into a string, and explains why the technique might speed up your code...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-18

1,631 reads

Networking - Part 4

by Andy Warren

SQLServerCentral.com

Blogs

You may want to read Part 1 , Part 2 , and Part 3 before continuing. This time around I'd like to talk about social networking. We'll start with social networking. Facebook, MySpace, and Twitter are all good examples of using technology to let...

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-17

1,530 reads

Speaking at Community Events - More Thoughts

by Andy Warren

SQLServerCentral.com

Blogs

Last week I posted Speaking at Community Events - Time to Raise the Bar?, a first cut at talking about to what degree we should require experience for speakers at events like SQLSaturday as well as when it might be appropriate to add additional focus/limitations on the presentations that are accepted. I've got a few more thoughts on the topic this week, and I look forward to your comments.

★ ★ ★ ★ ★ ★ ★ ★ ★ ★

You rated this post out of 5. Change rating

2009-02-13

360 reads

Introduction to Parallel Data Warehousing

Rate

Share

Share

Rate

Introduction to Parallel Data Warehousing

Rate

Share

Share

Rate

Related content

Book Review: Big Red - Voyage of a Trident Submarine

Database Mirroring FAQ: Can a 2008 SQL instance be used as the witness for a 2005 database mirroring setup?

Inserting Markup into a String with SQL

Networking - Part 4

Speaking at Community Events - More Thoughts