2:45 PM Sunday Room: SC-127
In this session, I will introduce the lightning-fast Apache Spark framework and discuss why people are replacing Hadoop MapReduce with Spark. I will discuss the areas where Spark really shines and cover some real-world Spark scenarios. In addition, I will review some misconceptions about Spark.
In case you’re not familiar with Spark….
Spark is a cluster-computing framework for processing a large amount of data. It has taken the big data world by fire. It is one of the most active open-source projects. All major Hadoop distributors including Cloudera, Hortonworks, and MapR include Spark in their distributions.