http://www.sqlservercentral.com/blogs/vinaythakur/2012/05/23/denali-day-23-data-quality-services-dqs/

Printed 2014/12/18 04:51PM

Denali – Day 23: Data Quality Services (DQS)

2012/05/23

Denali – Day 23: Data Quality Services(DQS)

Data intelligence vs. human intelligence: Sql server RDBMS is a database system which has data in it, most of the time, we enter the data but as a user they enter the data assuming things are good, like “USA, US or America are same for user as a knowledgeable person but for the system it is not, so using DQS we can integrate the human intelligence to make our data become more productive and can cleanse or correct the data.

 

Not installed by default.

Custom Install/activate using sql server Engine setup, to check the option.

Once setup done, need to activate or install the DQS using GUI – Data Quality Installer or DQSInstaller.exe. to complete the installation.

Need to provide strong password for master key.

It has two types

 

Useful for BI.

 

SSIS for data cleansing or TOOL: DQSInstaller.exe

 

Metadata DATABASE: DQS_Main, DQS_Projects, DQS_Staging_data

 

Security:

Dqs_administrator

Dsq_kb_editor

Dsq_kb_operator

 

To DQS Tool have three major components in it.

  1. Knowledge Base Management
  2. Data Quality Projects:
  3. Administration

 

KB is to let DQS build the knowledge and let it declare what is correct, means educate or knowledge the system, once system is having data, we can build the project to produce the required output like correct the data or make the report or produce the duplicate data, and to administrate the same using Administration.

 

In xls sheet. Ongoing. New DQP , project

Create cleansing.

Excel file

Xls columns to kb domain.

Start …

 

Interactive cleansing.

Results…

Job Title: Suggested, new, invalid, corrected, correct -à Approve/Reject

 

We can correct/approval the correction

Lastly: exporting the cleansing data.

Destination … xls or sql data (cannot to existing table)

 

“It uses KB gathering in a data quality KB. Cleansing results can be exported to xls or sql table”

 

*Composite Domain to cleanse data

 

Matching: CTP3. :

Matching is used to check for duplicates /matching records and produced and exports it.

 

Reflects business requirement to find duplicate records

Better KB more matching results would be.

 

Ref:

 

http://msdn.microsoft.com/en-us/library/ff877925(SQL.110).aspx

http://blogs.msdn.com/b/dqs/

http://www.mssqltips.com/tipprint.asp?tip=2575

http://www.youtube.com/watch?v=jfDVG8Nf8No



Copyright © 2002-2014 Simple Talk Publishing. All Rights Reserved. Privacy Policy. Terms of Use. Report Abuse.