Question 1

What is data quality?

Accepted Answer

Data quality is how well data fits the purpose it is used for. The standard dimensions are accuracy (right value), completeness (no missing values), consistency (same value across systems), timeliness (current enough), uniqueness (no duplicates), and validity (conforms to business rules). Fitness is purpose-specific, not absolute.

Question 2

How is data quality measured?

Accepted Answer

By running rule-based checks and statistical tests against the dataset on a schedule, reporting pass/fail per rule, and tracking quality scores at the dataset and column level over time. Modern tooling (Great Expectations, Soda, Monte Carlo, dbt tests) codifies these checks in the pipeline itself.

Question 3

Who is accountable for data quality?

Accepted Answer

The domain business owner of the data, not IT. IT can provide the tools to detect issues, but the business is the only party that knows whether an anomaly represents a bug, a pipeline break, or a genuine change in the world. Quality dies in organizations that delegate ownership to IT alone.

Question 4

What is the cost of poor data quality?

Accepted Answer

Gartner's long-running estimate puts the average annual cost at roughly $12.9M per organization in direct losses, before counting eroded trust in analytics and abandoned initiatives. The compound cost of lost decisions typically exceeds the direct cost by a wide margin.

Question 5

How does NUUN Digital remediate data quality?

Accepted Answer

We instrument the top-priority pipelines first, wire alerts into the team that can fix them, and measure incident mean-time-to-detect and mean-time-to-resolve. We do not pursue 100% quality — we pursue enough quality for the decisions the data supports.

Data Quality

WHAT IT IS

HOW IT WORKS

WHEN TO USE

RELATED

SOURCES

Related questions.

Need this term in action?