Notes on Apache Spark Performance Optimization & Tuning, Part 1


TLDR; these are personal study notes on Apache Spark optimization, specially focusing on the basics but also features added after version 3.0.


Some Background on Adaptive Query Execution

Performance Optimization on Spark: Cost-Based Optimization