More Problems with Data Warehousing

Richard Gardner-291039

Hall of Fame

Points: 3603
More actions
March 12, 2007 at 10:38 am

#194490

Comments posted here are about the content posted at http://www.sqlservercentral.com/columnists/rgardner/2913.asp

Viewing 4 posts - 1 through 4 (of 4 total)

You must be logged in to reply to this topic. Login to reply

RonKyle SSC-Dedicated Points: 31582 More actions · Answer 1

I disagree that you can't reach 100% accuracy of the known data. If the bottom line of the data warehouse doesn't match the inputing systems, then the data warehouse creator needs to be able to explain why. Every possible permutation can't be checked, but a good sampling can be, and should be.

We had a consultant who built us a warehouse and said that the data shouldn't be expected to match completely. After some investigation, however, it was discovered that he had violated the "level of granularity" rule which is crucial. In this case the data didn't match because of a flaw in the data warehouse.

Data warehouse designers should consider stored procedure groups to self-document the steps. They can only be created and edited in the analyzer (at least in 2K and earlier) but hopefully if someone is building a dw using the analyzer is not a big issue.

chung-327878 SSC-Addicted Points: 432 More actions · Answer 2

This article hit the nail on its head. Data quality was the number one problem when we were building our data warehouse. I agree that your data warehouse can reach 100% accuracy of the known data (assume you meant user entries). But if the known data is not accurate, your reports are still off.

Loner SSC-Insane Points: 21279 More actions · Answer 3

I did wrote an article about data quality before I wrote the article Problems with data warehouse. But you are right, data quality is number one problem in building data warehouse. Nice Article.

http://www.sqlservercentral.com/columnists/jchan/2593.asp

Janet