Become a data scientist in the tech industry! Learn how to use Python for a wide variety of data science, machine learning, and data mining application, with hands-on code and real-world examples. My most popular course!
Author Archives: fkane
Learn the larger Hadoop ecosystem and the distributed computing technologies it works with. 15 hours of video content covers over 25 different systems, with hands-on practice and exercises. My biggest course yet!
“Elasticsearch 6 and Elastic Stack: In-Depth and Hands-On!” is here! This comprehensive online course covers using Elasticsearch, Logstash, Beats, Kibana, and X-Pack with lots of hands-on examples and exercises, including importing data into Elasticsearch in many different ways. Enroll now to learn these very hot and very valuable skills. (Also available: Elasticsearch 5)
Learn the hottest technology in wrangling big data on a cluster: Apache Spark! Spark works best with the Scala programming language, so this course will get you up to speed on Scala before diving into everything Spark can do – with lots of hands-on examples and exercises. Now also available in book form!
Prefer to learn Spark using the more-familiar Python programming language? Get hands-on with the concepts of Apache Spark, and you’ll be computing similar movies using a million movie ratings on a real Hadoop cluster by the end of the course – all just using Python.
Learn how to process massive amounts of streaming data in real time on a cluster, using Spark Streaming! Includes a crash course in Scala, and lots of hands-on examples of connecting to various data sources such as Kafka, Flume, TCP ports, Cassandra, and more.
Learn the technology that started it all – MapReduce! MapReduce is at the heart of Hadoop, and offers a programming model for processing massive data sets on a cluster in the cloud. Get hands-on with lots of examples using the Python programming language, ranging from simple tasks all the way to anlayzing social networks and making movie recommendations with real data sets.
Check out my one-hour screencast courses at O’Reilly’s Safari Books Online! Learn the ins and outs of Analyzing Big Data with Hadoop, AWS, and EMR, Analyzing Big Data with Spark and Amazon EMR, or Introduction to the Amazon Elasticsearch Service available to subscribers of Safari.
Welcome! I’m excited to share my 9 years of experience at Amazon.com and IMDb.com with you, where I managed and developed features such as top sellers, “people who bought also bought”, personalized recommendations for products and movies, and homepage optimization. Thanks to the revolution in online learning, I can teach you the same data mining, machine learning, and big data processing techniques needed to land a job as a data scientist or software engineer in these very hot fields.
Hit the “get course” buttons above, and you’ll be taken to my course descriptions at Udemy.com along with a coupon code for a steep discount! All of my courses are very hands-on, and dive right into real exercises using the Python or Scala programming languages. We’ll mine big data to find relationships between movies, recommend movies, analyze social graphs of super-heroes, detect spam emails, search Wikipedia, and much more!