Mastering Apache Spark 2.0

Updated 8 hours ago

Mastering Apache Spark 2.0

Welcome to Mastering Apache Spark 2.0 (aka #SparkNotes)!

I’m Jacek Laskowski, an independent consultant who is passionate about software development and teaching people in effective use of Apache Spark, Scala, sbt, and Apache Kafka (with a bit of Hadoop YARN, Apache Mesos, and Docker). I lead Warsaw Scala Enthusiasts and Warsaw Spark meetups in Warsaw, Poland.

If you like the Apache Spark notes you should seriously consider participating in my own, very hands-on Spark Workshops for Developers, Administrators and Operators.

This collections of notes (what some may rashly call a "book") serves as the ultimate place of mine to collect all the nuts and bolts of using Apache Spark. The notes aim to help me designing and developing better products with Apache Spark. It is also a viable proof of my understanding of Apache Spark. I do eventually want to reach the highest level of mastery in Apache Spark.

It may become a book one day, but surely serves as the study material for trainings, workshops, videos and courses about Apache Spark. Follow me on twitter @jaceklaskowski to know it early. You will also learn about the upcoming events about Apache Spark.

Expect text and code snippets from Spark’s mailing lists, the official documentation of Apache Spark, StackOverflow, blog posts, books from O’Reilly, press releases, YouTube/Vimeo videos, Quora, the source code of Apache Spark, etc. Attribution follows.