Learn how to process massive amounts of streaming data in real time on a cluster, using Spark Streaming! Includes a crash course in Scala, and lots of hands-on examples of connecting to various data sources such as Kafka, Flume, TCP ports, Cassandra, and more.
Learn the technology that started it all – MapReduce! MapReduce is at the heart of Hadoop, and offers a programming model for processing massive data sets on a cluster in the cloud. Get hands-on with lots of examples using the Python programming language, ranging from simple tasks all the way to anlayzing social networks and making movie recommendations with real data sets.
Welcome! I’m excited to share my 9 years of experience at Amazon.com and IMDb.com with you, where I managed and developed features such as top sellers, “people who bought also bought”, personalized recommendations for products and movies, and homepage optimization. Thanks to the revolution in online learning, I can teach you the same data mining, machine learning, and big data processing techniques needed to land a job as a data scientist or software engineer in these very hot fields.
Hit the “get course” buttons above, and you’ll be taken to my course descriptions at Udemy.com along with a coupon code for a steep discount! All of my courses are very hands-on, and dive right into real exercises using the Python or Scala programming languages. We’ll mine big data to find relationships between movies, recommend movies, analyze social graphs of super-heroes, detect spam emails, search Wikipedia, and much more!