Part 5: Data Quality & Governance | Module 2

Introduction

The saying "garbage in, garbage out" is especially true for AI. Models are only as good as the data they're trained on. Data quality issues don't just reduce performance - they can embed biases, create legal liabilities, and cause real-world harm.

This part covers the critical data quality challenges in AI: labeling, bias, and the emerging threat of data poisoning. Understanding these issues is essential for anyone involved in AI governance.

The Data Labeling Challenge

Supervised learning requires labeled data - examples with the correct answers. Creating these labels is one of the most expensive and error-prone parts of AI development.

📄

Label Quality Issues

Inconsistency: Different labelers may apply different standards. What one person calls "urgent," another might call "high priority."

Errors: Human labelers make mistakes, especially with ambiguous or fatiguing tasks.

Subjectivity: Some labeling tasks are inherently subjective (e.g., "is this content offensive?").

Expertise: Some domains require expert labelers (medical imaging, legal documents), increasing cost.

Labeling Approaches

In-house Teams: Higher quality but expensive. Best for sensitive or specialized data.
Crowdsourcing: Scalable and cheaper but requires quality control measures.
Active Learning: Prioritize labeling examples where the model is most uncertain.
Weak Supervision: Use heuristics or other models to generate noisy labels at scale.
Self-supervision: Design tasks where labels come from data structure itself.

Quality Control Practices

Robust labeling programs include: multiple labelers per example with disagreement resolution, gold standard examples to check labeler quality, regular calibration sessions, clear labeling guidelines with examples, and ongoing monitoring of inter-rater reliability.

Bias in Training Data

AI systems can perpetuate and amplify biases present in their training data. Understanding the sources and types of bias is the first step toward mitigation.

Historical Bias

Data reflects past discrimination or inequality. A hiring model trained on historical decisions may learn to prefer candidates similar to those hired before - often reflecting past biases.

Representation Bias

Some groups are underrepresented in training data. Facial recognition systems trained primarily on lighter-skinned faces perform poorly on darker skin tones.

Measurement Bias

The way data is collected or measured differs across groups. Health data from populations with better healthcare access may not generalize to underserved communities.

Label Bias

Labelers bring their own biases. Content moderation labels may reflect cultural biases of the labeling workforce.

Selection Bias

Training data comes from a non-representative sample. Social media data represents only users of specific platforms.

Temporal Bias

Data from one time period may not apply to another. Consumer behavior during a pandemic differs from normal times.

Bias Amplification

AI systems don't just reflect existing biases - they can amplify them. If historical data shows a 60/40 gender split in certain roles, the model may learn to predict the majority class more strongly, effectively increasing discrimination.

Detecting and Mitigating Bias

Detection Approaches

Subgroup Analysis: Compare model performance across demographic groups
Fairness Metrics: Measure demographic parity, equal opportunity, predictive parity
Counterfactual Testing: See if predictions change when only protected attributes change
Disparate Impact Analysis: Check if outcomes differ significantly across groups

Mitigation Strategies

Pre-processing: Balance or re-weight training data before training
In-processing: Add fairness constraints to the training objective
Post-processing: Adjust decision thresholds to equalize outcomes
Diverse Data Collection: Actively seek representative data
Diverse Teams: Include varied perspectives in design and review

No Silver Bullet

Bias mitigation involves trade-offs. Different fairness definitions can conflict with each other and with overall accuracy. The appropriate approach depends on the specific context, stakeholders, and potential harms.

Data Poisoning Attacks

An emerging security concern is intentional manipulation of training data to compromise AI systems. This is known as data poisoning.

⚠

What is Data Poisoning?

Attackers inject malicious examples into training data to cause the model to behave incorrectly - either generally or in specific targeted situations.

Types of Data Poisoning

Backdoor Attacks

Insert a "trigger" pattern that causes specific misbehavior. Example: A stop sign with a specific sticker is classified as a speed limit sign.

Targeted Attacks

Cause misclassification of specific inputs while maintaining overall performance. Hard to detect with standard validation.

Availability Attacks

Degrade overall model performance. May be used to sabotage competitors or undermine trust in AI systems.

Real-World Risk

Data poisoning is especially concerning when training data comes from external or user-contributed sources. Web-scraped data, crowdsourced labels, and open-source datasets are all potential attack vectors. Models trained on internet data may have already been influenced by adversarial content.

Defending Against Data Poisoning

Data Provenance: Track sources and verify trustworthiness
Anomaly Detection: Identify unusual training examples
Robust Training: Use techniques that are resilient to outliers
Data Sanitization: Clean and validate data before training
Ensemble Methods: Use multiple models to detect inconsistencies

Data Governance Framework

Effective data governance for AI requires systematic approaches across the data lifecycle.

Data Governance Checklist

Data sources documented with quality assessments
Data collection consent and legal basis verified
Sensitive data identified and appropriately protected
Labeling processes documented with quality metrics
Bias assessment performed on training data
Data versioning implemented for reproducibility
Data retention and deletion policies defined
Third-party data agreements reviewed
Data security measures appropriate to sensitivity
Regular data quality monitoring in place

Data Documentation

"Datasheets for Datasets" and "Data Cards" are emerging standards for documenting training data. They capture information about collection methodology, intended use, known limitations, and ethical considerations - essential for responsible AI development.

Privacy Considerations

AI training data often contains personal information, raising privacy concerns and regulatory requirements.

Key Privacy Concerns

Training Data Privacy: Models may memorize and potentially leak private information
Consent: Was data collected with appropriate consent for AI training?
Right to Deletion: How do you remove someone's data from a trained model?
Cross-border Transfer: Data used for training may move between jurisdictions

Privacy-Preserving Techniques

Differential Privacy: Add noise to prevent identification of individuals
Federated Learning: Train on decentralized data without collecting it centrally
Anonymization: Remove or mask identifiers (though re-identification is possible)
Synthetic Data: Generate artificial data that preserves statistical properties

Key Takeaways

Data labeling is expensive, error-prone, and critical to model quality
Bias in training data leads to biased models - possibly amplified
Multiple types of bias exist: historical, representation, measurement, label, selection, temporal
Bias mitigation involves trade-offs between different fairness definitions
Data poisoning is an emerging security threat, especially for web-scraped data
Comprehensive data governance is essential for responsible AI
Privacy considerations affect both data collection and model deployment