llama_index.embedding.tokens

Total tokens in embeddings

Dimensions:None

Available on:

OpenTelemetry (1)

Datadog (1)

Interface Metrics (2)

OpenTelemetry

llama_index.embedding.tokens

Number of tokens processed during embedding operations

Dimensions:None

Datadog

llamaindex.embedding.tokens.total

Total number of tokens used in embedding operations (prompt + completion)

Dimensions:None

Sources

llama_index.embedding.tokensgithub.com

llamaindex.embedding.tokens.totaldocs.datadoghq.com

Related Insights (2)

LlamaIndex Embedding Token Inefficiencywarning

LlamaIndex embedding operations consume excessive tokens due to redundant document processing, lack of caching, or inefficient chunking strategies, increasing costs and latency.

▸

LlamaIndex Embedding Batch Processing Inefficiencyinfo

Document embedding during indexing is inefficient due to small batch sizes or lack of batching, causing excessive API calls, increased latency, and higher costs compared to batch processing.

▸