AMP Lab – UC Berkeley

National Science Foundation
Expeditions in Computing

Main menu

Skip to content
  • About
  • People
  • Papers
  • Projects
  • Software
  • Blog
  • Sponsors
  • Photos
  • Login

Tag Archives:

ActiveClean: An Interactive Data Cleaning Framework For Modern Machine Learning (Demonstration Paper)

Sanjay Krishnan, Michael Franklin, Ken Goldberg, Eugene Wu, Jiannan Wang
SIGMOD, Jun. 2016.
Tags: best demo award, crowdsourcing, Data Cleaning

Argonaut: Macrotask Crowdsourcing for Complex Data Processing

Daniel Haas, Jason Ansel, Lydia Gu, Adam Marcus
VLDB 2015, Aug. 2015.
Tags: crowdsourcing, Data Cleaning, data quality

Wisteria: Nurturing Scalable Data Cleaning Infrastructure

Daniel Haas, Sanjay Krishnan, Jiannan Wang, Michael Franklin, Eugene Wu
VLDB 2015, Aug. 2015.
Tags: crowdsourcing, Data Cleaning, sampleclean

Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views

Sanjay Krishnan, Jiannan Wang, Michael Franklin, Michael Jordan, Tim Kraska
VLDB 2015 (PVLDB Vol. 8 No. 12), Aug. 2015.
Tags: crowdsourcing, Data Cleaning, data quality, Materialized Views, Sampling

A Methodology for Learning, Analyzing, and Mitigating Social Influence Bias in Recommender Systems

Sanjay Krishnan, Jay Patel, Michael Franklin, Ken Goldberg
ACM Conference on Recommender Systems, Oct. 2014.
Tags: bias mitigation, crowdsourcing, Data Cleaning

A Sample-and-Clean Framework for Fast and Accurate Query Processing on Dirty Data

Jiannan Wang, Sanjay Krishnan, Michael Franklin, Ken Goldberg, Tim Kraska, Tova Milo
SIGMOD, Jun. 2014.
Tags: Big Data, crowdsourcing, Data Cleaning, query processing, Sampling

CrowdER: Crowdsourcing Entity Resolution

Jiannan Wang, Tim Kraska, Michael Franklin, Jianhua Feng
Proceedings of the VLDB Endowment 2012, Vol. 5, No. 10, Aug. 2012.
Tags: crowdsourcing, Data Cleaning, data quality


Tags

Akaros amp application Approximate Query Processing BDAS Best Paper Award Big Data BlinkDB Bootstrap cluster coflow consistency crowdsourcing databases Datacenters data centers Data Cleaning data quality Declarative ML distributed machine learning genomics Graphs hadoop Machine Learning Materialized Views matrix factorization mesos MLbase Optimization OS pbs PIQL query processing Sampling SCADS scalability scale independence scheduling Shark spark SQL storage Succinct transactions vldb

  • Come Visit
  • Contact
  • Open Positions


  • About
  • People
  • Publications
  • Projects
  • Seminars
  • Blog: AMP BLAB
  • Sponsors
  • Photos
  • Wiki
  • Jenkins
Copyright © 2021 AMPLab