Silicon Valley Code Camp : October 7 & 8, 2017session
Using Apache Spark To Determine Whether San Francisco Restaurants Are Clean
Apache Spark is a powerful and popular data processing engine due to its speed, ease of use, and flexibility. This session will demonstrate some of Spark's advanced features through the analysis San Francisco Restaurant inspection data.
About This Session
Apache Spark has become one of the must-know big data technologies due to its speed, ease of use, and flexibility. With each newer version, Spark is even faster, provides more powerful new features to make it even easier than before to build intelligent and scalable data processing infrastructure and applications. This session will start with a quick introduction of Spark advanced features and then proceeds to demonstrate some of those advanced features through the analysis San Francisco Restaurant inspection data. The data analysis part will help answering the important question of whether San Francisco restaurants are clean. By attending this session, attendees will gain a good understanding of some of Spark’s advanced capabilities and see how Spark’s features make it easy to perform exploratory data analysis.