Blog

Apache Spark Tutorial for Beginners Part 1 – Installing Spark

Apache Spark is arguably the hottest technology in the field of big data right now. It allows you to process and extract meaning from massive data sets on a cluster, whether it is a Hadoop cluster you administer or a...

Apache Spark Tutorial for Beginners Part 2 – Introduction to Spark

Apache Spark is arguably the hottest technology in the field of big data right now. It allows you to process and extract meaning from massive data sets on a cluster, whether it is a Hadoop cluster you administer or a...

Apache Spark Tutorial for Beginners Part 3 – Resilient Distributed Dataset

Apache Spark is arguably the hottest technology in the field of big data right now. It allows you to process and extract meaning from massive data sets on a cluster, whether it is a Hadoop cluster you administer or a...

Apache Spark Tutorial for Beginners Part 4 – Using RDDs in Spark

Apache Spark is arguably the hottest technology in the field of big data right now. It allows you to process and extract meaning from massive data sets on a cluster, whether it is a Hadoop cluster you administer or a...

Apache Spark Tutorial for Beginners Part 5 – Spark SQL

Apache Spark is arguably the hottest technology in the field of big data right now. It allows you to process and extract meaning from massive data sets on a cluster, whether it is a Hadoop cluster you administer or a...

Apache Spark Tutorial for Beginners Part 6 – Using DataSets

Apache Spark is arguably the hottest technology in the field of big data right now. It allows you to process and extract meaning from massive data sets on a cluster, whether it is a Hadoop cluster you administer or a...

Apache Spark Tutorial for Beginners Part 7 – Using MLLib

Apache Spark is arguably the hottest technology in the field of big data right now. It allows you to process and extract meaning from massive data sets on a cluster, whether it is a Hadoop cluster you administer or a...

Apache Spark Tutorial for Beginners Part 8 – Project Solution

Apache Spark is arguably the hottest technology in the field of big data right now. It allows you to process and extract meaning from massive data sets on a cluster, whether it is a Hadoop cluster you administer or a...

How to Choose the Right Database? – MongoDB, Cassandra, MySQL, HBase

Choosing the right database for your application is no easy task. You have a wide variety of options relational databases such as MySQL, or distributed NoSQL solutions such as MongoDB, Cassandra, and HBase. NoSQL has come to mean not only...

Kafka Tutorial for Beginners

Learn to stream big data with Kafka, starting from scratch. Kafka is a powerful data streaming technology and a very hot technical skill to have right now. With Kafka, you can publish streams of data from web logs, sensors, or...