llama_index.embedding.requests

Embedding requests count

Dimensions:None

Available on:

Datadog (1)

Interface Metrics (1)

Datadog

llamaindex.embedding.requests

Number of embedding requests made to the embedding model

Dimensions:None

Sources

llamaindex.embedding.requestsdocs.datadoghq.com

Related Insights (3)

LlamaIndex Embedding Token Inefficiencywarning

LlamaIndex embedding operations consume excessive tokens due to redundant document processing, lack of caching, or inefficient chunking strategies, increasing costs and latency.

▸

LlamaIndex Embedding Request Failurecritical

Embedding API failures during document indexing or query time cause incomplete indexes or degraded retrieval quality without proper error handling and retry logic.

▸

LlamaIndex Embedding Batch Processing Inefficiencyinfo

Document embedding during indexing is inefficient due to small batch sizes or lack of batching, causing excessive API calls, increased latency, and higher costs compared to batch processing.

▸