Response latency degradation indicates prompt performance issues
warningperformanceUpdated Mar 17, 2026(via Exa)
Technologies:
How to detect:
Average response times increase, 95th percentile latency degrades, or timeout rates spike across different prompt configurations. Various error types occur including API failures, content policy violations, and parsing errors
Recommended action:
Monitor average response times, 95th percentile latency, and timeout rates. Create latency distribution charts showing performance consistency over time. Set up correlation analysis between prompt complexity, response length, and processing time. Use data to optimize prompt structure and identify performance bottlenecks