This is a collections of notes about Apache Spark's best practices. The notes aim to help me design and develop better programs with Apache Spark.
Last updated 2 months ago
This book will introduce the core concepts in Apache Spark 1.x via hands-on coding to solve problems using real-world datasets over databricks notebooks.
Last updated 3 months ago
Last updated a year ago
A quick to devour series, specially made for fast learners...
Last updated 7 months ago
Notes made while solving various challenges encountered at work and personal study.
Last updated 8 months ago
Parse and validate [Gnip PowerTrack Rules](http://support.gnip.com/apis/powertrack/rules.html).
Last updated 2 years ago