SQL webinar

10
Quality SQL Server Data An Introduction to Data Quality Services

Transcript of SQL webinar

Page 1: SQL webinar

Quality SQL Server Data

An Introduction to Data Quality Services

Page 2: SQL webinar

Topics of Discussion

• Introduction to SQL Data Quality and its Components

• Cleansing Data

• Identifying Matching (duplicate) Data

• Demonstration

Page 3: SQL webinar

Introduction to SQL Data Quality and its Components

The Need for a Data Quality Management Solution

What does the DQS Solution Provide?

What is a Knowledge Base ?

What is a Domain ?

What is Third Party Reference Data Service ?

What is a Matching Policy ?

Page 4: SQL webinar

The Need for a Data Quality Management Solution

• Business “Intelligence”• High quality data is critical

• Incorrect data can make its way into your data warehouse• Aggregating data from multiple sources can be problematic

• Inconsistent data

• Invalid data

• Duplicate entities

Page 5: SQL webinar

What does the DQS Solution Provide?

• Knowledge based solution for:•   Data Cleansing – identifying invalid or inconsistent data values and correcting them.

•   Data Matching – finding duplicate data entities

• DQS Components:• Data Quality Services Server

• Data Quality Client

• Data Cleansing SSIS Transformation

Page 6: SQL webinar

Knowledge Base Component

• Repository of knowledge about data:• Domains define values and rules for each field• Matching policies define rules for identifying duplicate records

Page 7: SQL webinar

What is a Domain?

• Domains are specific to a data field (column in a dataset)

• Domains can be individual or composite

• Domains contain the rules for the data• Valid • Invalid • Error

Page 8: SQL webinar

What is Third Party Reference Data Service ?

• The Reference Data Service feature in Data Quality Services (DQS) enables you to subscribe to third-party reference data providers

• Has the following benefits:• Comparing data guaranteed by a third-party company.• The reference data process is incorporated into DQS knowledge base building • Supports using reference data from Windows Azure Marketplace

Page 9: SQL webinar

What is a Matching Policy ?

• Define matching rules for business entities• Matching policies assess the likelihood of records being duplicates.• A matching policy can be added to a knowledge base containing rules that

help determine whether multiple data records represent the same business entity.

Page 10: SQL webinar

Demonstration

• Create a Knowledge base

• Use DQS to cleanse data

• Use DQS in an SSIS Package