Databricks

Shuffle Bottleneck from Network Saturation

Resource Contention

Spark shuffle operations (joins, aggregations) saturate network bandwidth when spark_executor_shuffleread and spark_executor_shufflewrite volumes spike, causing executor task delays. Network becomes bottleneck before CPU or memory pressure appears.

Databricks insight details requires a free account. Sign in with Google or GitHub to access the full knowledge base.

Sign in to access