Technologies/LlamaIndex/llama_index.retrieval.duration
LlamaIndexLlamaIndexMetric

llama_index.retrieval.duration

Retrieval operation duration
Dimensions:None
Available on:OpenTelemetryOpenTelemetry (1)
Interface Metrics (1)
OpenTelemetryOpenTelemetry
Duration of document retrieval operations in milliseconds
Dimensions:None
Related Insights (3)
LlamaIndex Query Latency P95 Degradationwarning

LlamaIndex query response times degrade at P95/P99 percentiles due to slow LLM calls, inefficient retrieval, or tool execution bottlenecks without granular latency breakdown.

LlamaIndex Retrieval Result Quality Degradationwarning

LlamaIndex retrieval returns insufficient or irrelevant documents, degrading answer quality due to poor index coverage, misconfigured similarity thresholds, or index staleness.

LlamaIndex Query Engine Request Failurecritical

Query engine failures prevent users from receiving answers due to LLM API errors, retrieval failures, or agent execution errors without proper error handling and fallback mechanisms.