AMP Camp Hands-on Big Data Mini Course now online

Error: Unable to create directory uploads/2024/04. Is its parent directory writable by the server?

In this post, I will first summarize AMP Camp Two, the Big Data Training course we recently ran at the Strata Big Data California conference, and then introduce our newly released online Big Data Mini Course.

In February we had the rare opportunity to host AMP Camp Two as part of the O’Reilly Strata Conference on Big Data in Santa Clara, CA.

AMP Camp Two was a full day event consisting of two tutorials. The first tutorial consisted of talks presenting an overview of the Berkeley Data Analytics Stack BDAS), Spark, Spark Streaming, Shark, and Machine Learning algorithms built on Spark. At the second tutorial, we provided each attendee with an EC2 cluster running Spark, Shark, and Spark Streaming, and guided them through a set of hands on exercises analyzing real Wikipedia and Twitter data.


Attendees getting hands-on, analyzing real data with Spark and Shark in our second Strata tutorial

The event was a great success! Both tutorials were packed and we received overwhelmingly positive feedback. Details about AMP Camp Two, including slides from the talks, are archived on the AMP Camp website.

Big Data Mini Course

As a follow-up to AMP Camp Two, we have posted an extensively revised and expanded hands-on AMP Camp Mini Course, which walks you through setting up a cluster on EC2 using your own Amazon credentials, then using Spark and Shark to do ad-hoc analytics on real Wikipedia data, writing a Spark Streaming job to process data collected via the Twitter API, and writing a more advanced machine learning data clustering algorithm. We hope you take the opportunity to learn more about some of the most exciting open-source data analytics tools.