Task success rate below 99.95% indicates systemic issues
warningperformanceUpdated Dec 17, 2025(via Exa)
Technologies:
How to detect:
Task success rate falls below 99.95%, indicating systemic reliability issues. Pre-deployment baseline should be >99.95%, with alerts configured for <99%.
Recommended action:
Monitor task success rate continuously. Alert if rate drops below 99%. Analyze failure patterns to identify which tasks fail most frequently. Review the 3 most problematic tasks causing 85% of failures. Implement automatic retry with exponential backoff (configured 3x retry). Escalate to human intervention after retry exhaustion.