OpenAI

Token Usage Growth Driving Latency Increase

latency

Latency degradation can occur without code changes when average output token size increases over time. This is especially impactful for Request Time and can indicate shifting user behavior or prompt patterns.

OpenAI insight details requires a free account. Sign in with Google or GitHub to access the full knowledge base.

Sign in to access