National Science Foundation
Expeditions in Computing
AMPLab Publications
- ActiveClean: Interactive Data Cleaning For Statistical Modeling
- Data Cleaning: Overview and Emerging Challenges
- ActiveClean: An Interactive Data Cleaning Framework For Modern Machine Learning (Demonstration Paper)
- PrivateClean: Data Cleaning and Differential Privacy
- Clamshell: Scaling Up Crowds for Low Latency Data Labeling
- SampleClean: Fast and Reliable Analytics on Dirty Data
- Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views
- Wisteria: Nurturing Scalable Data Cleaning Infrastructure
- Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views
- A Sample-and-Clean Framework for Fast and Accurate Query Processing on Dirty Data
- Leveraging Transitive Relations for Crowdsourced Joins
- CrowdER: Crowdsourcing Entity Resolution