What is the DMBoK definitiion of data quality management?
The planning, implementation, and control of activities that apply quality management techniques to data, in order to assure it is fit for consumption and meets the needs of data consumers.
What are the 4 business drivers for establishing a formal Data Quality Management program?
7 direct costs are associated with poor quality data. Name 4
What are the 4 goals Data Quality programs focus on?
Data Quality programs should be guided by these 10 principles
Which principle of Data Quality Management is to focus improvement efforts on data that is most important to the organization and its customers?
Criticality or Critical Data
What are the six core dimensions of data quality?
The _________ cycle is a problem-solving model known as “plan-do-check-act’.
Shewhart / Deming
In the ________ stage of the DQ Improvement Life Cycle, the Data Quality team assesses the scope, impact, and priority of known issues, and evaluates alternatives to address them.
Plan
In the ________ stage of the DQ Improvement Life Cycle, the DQ team leads efforts to address the root causes of issues and plan for ongoing monitoring of data.
Do
In the ________ stage of the DQ Improvement Life Cycle, the team actively monitors the quality of data as measured against requirements. As long as data meets defined thresholds for quality, additional actions are not required.
Check
In the ________ stage of the DQ Improvement Life Cycle, activities occur to address and resolve emerging data quality issues.
Act
What framework focuses on data consumers’ perceptions of data. It describes 15 dimensions across four general categories of data quality:
Strong-Wang Framework
What 4 general categories are described in the Strong-Wang framework?
In the Strong-Wang framework, What 4 dimensions are there in Intrinsic Data Quality?
o Accuracy
o Objectivity
o Believability
o Reputation
In the Strong-Wang framework, Which of these dimensions is not part of Contextual Data Quality?
o Value-added
o Interpretability
o Timeliness
o Completeness
o Appropriate amount of data
Interpretability. Should be relevancy
As part of the Strong-Wang Framework, which data quality category do these dimension belong?
o Interpretability
o Ease of understanding
o Representational consistency
o Concise representation
Representational DQ
In the Strong-Wang Framework Accessibility DQ category there are two dimensions, what are they?
o Accessibility
o Access security
There are 8 DQ issues caused by Poor System Design, name 6 of them.
________________ is a form of data analysis used to inspect data and assess quality. It uses statistical techniques to discover the true structure, content, and quality of a collection of data.
Data Profiling
Name the 2 activities prevalent in Data Quality Management
Maturity Assessment and Profiling
What are the 5 statistical techniques used to inspect data and assess quality in data profiling?
Profiling also includes __________ analysis, which can identify overlapping or duplicate columns and expose embedded value dependencies.
cross-column
_____________ analysis explores overlapping values sets and helps identify foreign key relationships.
Inter-table