Spark Structured Streaming (Apache Spark 2.2+)

Updated 10 hours ago

Spark Structured Streaming (Apache Spark 2.2+)

Welcome to Spark Structured Streaming gitbook!

I’m Jacek Laskowski, an independent consultant, developer and trainer who is specializing in Apache Spark, Apache Kafka, Scala and sbt (with a bit of Apache Mesos, Hadoop YARN, and DC/OS). I lead Warsaw Scala Enthusiasts and Warsaw Spark meetups in Warsaw, Poland.

If you like the gitbook you should seriously consider participating in my own, very hands-on, in-depth Apache Spark Workshops and Webinars.

Spark Structured Streaming gitbook serves as the ultimate place of mine to collect all the nuts and bolts of using Spark Structured Streaming in the most effective way. The notes aim to help me designing and developing better products with Apache Spark. It is also a viable proof of my current understanding of Apache Spark. I do eventually want to reach the highest level of mastery in Apache Spark (as do you!)

The collection of notes serves as the study material for my trainings, workshops, videos and courses about Apache Spark. Follow me on twitter @jaceklaskowski to know it early. You will also learn about the upcoming events about Apache Spark.

Expect text and code snippets from Spark’s mailing lists, the official documentation of Apache Spark, StackOverflow, blog posts, books from O’Reilly (and other publishers), press releases, conferences, YouTube or Vimeo videos, Quora, the source code of Apache Spark, etc. Attribution follows whenever possible.