Technologies/LlamaIndex/llama_index.query_engine.duration
LlamaIndexLlamaIndexMetric

llama_index.query_engine.duration

Query engine duration
Dimensions:None
Available on:OpenTelemetryOpenTelemetry (1)
Interface Metrics (1)
OpenTelemetryOpenTelemetry
Duration of query engine operations in milliseconds
Dimensions:None
Related Insights (2)
LlamaIndex Query Latency P95 Degradationwarning

LlamaIndex query response times degrade at P95/P99 percentiles due to slow LLM calls, inefficient retrieval, or tool execution bottlenecks without granular latency breakdown.

LlamaIndex Query Engine Request Failurecritical

Query engine failures prevent users from receiving answers due to LLM API errors, retrieval failures, or agent execution errors without proper error handling and fallback mechanisms.