Tag Archives: Shuffle

Understand the shuffle component in spark-core

This blog has been moved to new address:¬†http://www.trongkhoanguyen.com. Shuffle is one of the most expensive operations that will affect the performance of the job. Even though Spark tries to avoid shuffle as possible as it can, some operations require shuffle … Continue reading

Posted in Spark | Tagged , | 1 Comment