datafusion.query.execution_time
Total query execution timeDimensions:None
Interface Metrics (2)
Dimensions:None
Dimensions:None
Sources
Technical Annotations (54)
Configuration Parameters (10)
datafusion.execution.parquet.pushdown_filtersrecommended: falsedatafusion.execution.target_partitionsrecommended: 1datafusion.execution.parquet.binary_as_stringrecommended: truedatafusion.optimizer.prefer_hash_joinrecommended: truedatafusion.optimizer.top_down_join_key_reorderingrecommended: truedatafusion.optimizer.filter_null_join_keysrecommended: truedatafusion.optimizer.max_passesrecommended: 3DATAFUSION_OPTIMIZER_REPARTITION_JOINSrecommended: trueDATAFUSION_OPTIMIZER_HASH_JOIN_SINGLE_PARTITION_THRESHOLDrecommended: 0DATAFUSION_OPTIMIZER_HASH_JOIN_SINGLE_PARTITION_THRESHOLD_ROWSrecommended: 0CLI Commands (3)
SELECT * FROM lineitem, orders WHERE l_orderkey = o_orderkey AND o_orderkey = 1 AND l_quantity < (SELECT avg(l_quantity) FROM lineitem WHERE l_orderkey = o_orderkey);diagnosticcargo run --profile release-nonlto --bin dfbench tpcds --query 99 --iterations 3 --path benchmarks/data/tpcds_sf1 --query_path datafusion/core/tests/tpc-ds --prefer_hash_join truediagnosticdatafusion-cli -c "select sum(l_extendedprice) / 7.0 as avg_yearly from lineitem, part where p_partkey = l_partkey and p_brand = 'Brand#23' and p_container = 'MED BOX' and l_quantity < (select 0.2 * avg(l_quantity) from lineitem where l_partkey = p_partkey);"diagnosticTechnical References (41)
ArrowPredicate APIcomponentlate materializationconceptmetrics collectioncomponentdata batchesconceptarray_hascomponentarray_has_anycomponentbranch-49componentf43df3f2ae3aafb347996c58e852cc378807095bcomponentCrossJoincomponentInner Joincomponentlogical_plancomponentSessionConfigcomponentRuntimeConfigcomponentoptdcomponentcardinality estimationconceptprostcomponentgogo/protobufcomponentprotobuf serializationconceptNestedLoopJoincomponentHashJoincomponentselectivityconceptIMDB benchmarkconceptjoin parameterizationconceptpredicate pushdownconceptTPC-Hconceptexternal joincomponentTreeNode APIcomponentLogicalPlancomponentCollectLeftcomponentPartitionedcomponentHashJoinExeccomponentAggregateExeccomponentFinalcomponentFinalPartitionedcomponentGroupedHashAggregateStreamcomponentconcat_batchescomponentbuild sideconceptprobe sideconceptstar schemaconceptright deep treeconceptEXPLAIN ANALYZEcomponentRelated Insights (19)
Parquet filter pushdown causes query slowdowns for specific query patternswarning
▸
Metrics collection overhead degrades query performance on small batcheswarning
▸
Array membership filter performance degrades linearly with array size in DataFusion 50warning
▸
DataFusion 49 nested loop join underutilizes CPU for array membership queriesinfo
▸
Query optimizer regression causes cross join instead of inner joinwarning
▸
Undefined pipeline success rate and duration thresholds delay detection of data issueswarning
▸
Default parallelism settings limit out-of-box query performancewarning
▸
Join order cardinality estimation failures cause query performance disasterscritical
▸
prost protobuf serialization bottleneck degrades throughput by 40%+warning
▸
Batch splitting in joins may cause performance regressionwarning
▸
Nested loop join batch size fix causes performance regression on certain query patternswarning
▸
Native DataFusion scan performance optimization opportunities identifiedinfo
▸
Join parameterization missing causes full table scans on selective queriescritical
▸
TPC-H queries fail under fuzzed memory limits with external joinscritical
▸
Cardinality estimation errors cause suboptimal plans for queries with 3+ joinswarning
▸
Hash join optimizer selects non-partitioned mode causing 52x slower query executioncritical
▸
Aggregation operations under-partition causing multi-fold performance degradationwarning
▸
concat_batches overhead causes 33x query performance degradationcritical
▸
Suboptimal join order causes 60% query performance degradation on multi-table joinswarning
▸