llama_index.embedding.tokens
Total tokens in embeddingsDimensions:None
Interface Metrics (2)
Dimensions:None
Dimensions:None
Related Insights (2)
LlamaIndex Embedding Token Inefficiencywarning
LlamaIndex embedding operations consume excessive tokens due to redundant document processing, lack of caching, or inefficient chunking strategies, increasing costs and latency.
▸
LlamaIndex Embedding Batch Processing Inefficiencyinfo
Document embedding during indexing is inefficient due to small batch sizes or lack of batching, causing excessive API calls, increased latency, and higher costs compared to batch processing.
▸