New and Exciting in SQL Server Code Name “Denali”: Knowledge Driven Data Quality Services (DQS)

In a world of continuously growing amounts of data, data quality is becoming crucial. In order to clean your data and get to a high quality, you need to know “things” about it. We call this “knowledge”.

Knowledge can be found in difference places:

·         Inside the business data itself

·         In the organization’s procedures, processes and policies

·         In the hands and the heads of the Organization’s Data experts

·         Through external reference data providers… (many of them already out there floating on the Azure cloud J)


SQL Server Data Quality Services (DQS) is a new innovative Knowledge Driven data quality product that is delivered as part of SQL Server “Denali” release. DQS enables you to build a knowledge base, and use it to perform a variety of critical data quality tasks – correction, enrichment, standardization and de-duplication of your data.

Building a knowledge base is easy – using your own data, DQS allows you to discover knowledge directly from samples of your data, combining computer-assisted and interactive experiences. In addition you can also extend your knowledge with 3rd party IP, using cloud-based reference data services from Windows Azure Marketplace. DQS also provides batch capabilities with a SQL Server Integration Services (SSIS) component, as well as integration with the Master Data Services (MDS) Excel add-in.

 
For more information about DQS, check out the Forum and the SQL Server “Denali” CTP3 resource center.

In addition, a full-blown DQS overview presentation from the recent TechEd conference can be found here.