Learn the technology that started it all – MapReduce! MapReduce is at the heart of Hadoop, and offers a programming model for processing massive data sets on a cluster in the cloud. Get hands-on with lots of examples using the Python programming language, ranging from simple tasks all the way to anlayzing social networks and making movie recommendations with real data sets.
Author: Frank Kane
Learn Big Data on AWS / Elastic MapReduce at Safari Books Online
Check out my one-hour screencast courses at O’Reilly’s Safari Books Online! Learn the ins and outs of Analyzing Big Data with Hadoop, AWS, and EMR, Analyzing Big Data with Spark and Amazon EMR, or Introduction to the Amazon Elasticsearch Service available to subscribers of Safari.
About Frank Kane
Welcome! I’m excited to share my 9 years of experience at Amazon.com and IMDb.com with you, where I managed and developed features such as top sellers, “people who bought also bought”, personalized recommendations for products and movies, and homepage optimization. Thanks to the revolution in online learning, I can teach you the same data mining, machine learning, and big data processing techniques needed to land a job as a data scientist or software engineer in these very hot fields.
Hit the “get course” buttons above, and you’ll be taken to my course descriptions at Udemy.com along with a coupon code for a steep discount! All of my courses are very hands-on, and dive right into real exercises using the Python or Scala programming languages. We’ll mine big data to find relationships between movies, recommend movies, analyze social graphs of super-heroes, detect spam emails, search Wikipedia, and much more!