What’s new in KeystoneML

Posted on March 28, 2016 by sparks

At the AMPLab, we are constantly looking for ways to improve the performance and user experience of large scale advanced … Continue reading →

SparkNet

Training deep networks is a time-consuming process, with networks for object recognition often requiring multiple days to train. For this … Continue reading →

CoCoA: A Framework for Distributed Optimization

A major challenge in many large-scale machine learning tasks is to solve an optimization objective involving data that is distributed … Continue reading →

SparkNet: Training Deep Networks on Spark

Philipp Moritz, Robert Nishihara, Ion Stoica, Michael Jordan
International Conference on Learning Representations (ICLR), May. 2016.

Tags: deep learning, distributed machine learning, Machine Learning, spark

KeystoneML

KeystoneML is a research project exploring techniques to simplify the construction of large scale, end-to-end, machine learning pipelines. KeystoneML is designed around … Continue reading →

Splash: Efficient Stochastic Learning on Clusters

Splash is a general framework for parallelizing stochastic learning algorithms (SGD, Gibbs sampling, etc.) on multi-node clusters. It consists of a … Continue reading →

Concurrency Control for Machine Learning

Many machine learning (ML) algorithms iteratively transform some global state (e.g., model parameters or variable assignment) giving the illusion of … Continue reading →

MLbase: A Distributed Machine-learning System

Tim Kraska, Ameet Talwalkar, John Duchi, Rean Griffith, Michael Franklin, Michael Jordan
CIDR 2013, Jan. 2013.

Tags: Declarative ML, distributed machine learning, Machine Learning, MLbase

MLbase: Distributed Machine Learning Made Easy

Implementing and consuming Machine Learning techniques at scale are difficulttasks for ML Developers and End Users. MLbase is a platform … Continue reading →

BLB: Bootstrapping Big Data

The bootstrap provides a simple and powerful means of assessing the quality of estimators. However, in settings involving very large … Continue reading →

DFC — Divide-and-Conquer Matrix Factorization

Divide-Factor-Combine (DFC) is a parallel divide-and-conquer framework for noisy matrix factorization problems, e.g., matrix completion and robust matrix factorization. DFC … Continue reading →

AMP Lab – UC Berkeley

Tag Archives: