datafusion.operator.spill_count
Number of spill eventsDimensions:None
Interface Metrics (3)
Dimensions:None
Dimensions:None
Sources
Technical Annotations (50)
Configuration Parameters (16)
memory_pool.soft_limitrecommended: for spillable operators onlymemory_pool.hard_limitrecommended: soft_limit * 120%datafusion.execution.max_spill_file_size_bytesrecommended: 134217728datafusion.execution.sort_spill_reservation_bytesrecommended: 10485760with_disk_manager_osrecommended: enabledwith_fair_spill_poolrecommended: 100000000batch_sizerecommended: increase from small values (e.g., 2)memory_pool.limitrecommended: increase from 1600 bytes minimumRuntimeConfig.memory_poolrecommended: FairSpillPool with 2-3x expected memoryMemoryConsumer.with_can_spillrecommended: truesort_spill_reservation_bytesmemory_limitprefer_hash_joinhash_join_single_partition_thresholdhash_join_single_partition_threshold_rowscollect_left_thresholdError Signatures (5)
ResourcesExhausted("Additional allocation failedexceptionFailed to allocate additionallog patternArrowError(InvalidArgumentError("number of columns(3) must match number of fields(2) in schema"), None)exceptionnumber of columns must match number of fields in schemalog patternResourcesExhaustedexceptionTechnical References (29)
multi_level_merge.rsfile pathnum_spillcomponentfair poolcomponentspillable consumerconceptRepartitionExeccomponentExternalSortercomponentFairSpillPoolcomponenttry_growcomponentMemoryPoolcomponentRuntimeConfigcomponentGroupValuesColumncomponentvectorized_interncomponentGroupOrdering::FullcomponentGroupedHashAggregateStreamcomponentAggregateExeccomponentdatafusion/physical-plan/src/aggregates/row_hash.rsfile pathHashAggregationExeccomponentsort_batchcomponentgroup_valuescomponentspillingconceptmemory_poolcomponenthybrid hash joinconceptspill thresholdconceptMemoryReservationcomponentpartitioned hash joinconceptJoinPlanner traitcomponenthash joincomponentsort-merge joincomponentmemory spillingconceptRelated Insights (18)
Multi-partition sorting hits memory bugs causing spill coordination failurescritical
▸
Fair pool unfairly allocates memory between spillable and non-spillable operatorswarning
▸
ExternalSort fails when non-spillable input operators exhaust memory poolcritical
▸
FairSpillPool allows premature OOM failures on non-spillable operatorswarning
▸
Excessive spill files created when max_spill_file_size_bytes is too lowwarning
▸
Sort operations run out of memory when sort_spill_reservation_bytes is insufficientcritical
▸
Missing disk spill manager prevents large query executionwarning
▸
Spillable aggregation produces duplicate group keys due to internal state mismatchcritical
▸
Schema mismatch causes GroupedHashAggregateStream spill failure with multiple aggregationscritical
▸
Hash aggregation spill doubles memory usage due to sort_batch copycritical
▸
Batch size checks prevent small spills under memory pressurewarning
▸
Out of memory on large table joins with 16GB RAMcritical
▸
Hybrid hash join spill threshold misconfiguration causes premature spillingwarning
▸
Partitioned hash join memory coordination failure with shared poolwarning
▸
Memory exhaustion from insufficient memory pool limits on resource-constrained systemscritical
▸
Join algorithm selection impacts memory usage vs CPU performance tradeoffswarning
▸
Memory pressure during joins triggers disk spilling with performance impactwarning
▸
Peak memory consumption during hash joins with poor cardinality estimateswarning
▸