Celery

Task success rate below 99.95% indicates systemic issues

warning
performanceUpdated Dec 17, 2025(via Exa)
Technologies:
How to detect:

Task success rate falls below 99.95%, indicating systemic reliability issues. Pre-deployment baseline should be >99.95%, with alerts configured for <99%.

Recommended action:

Monitor task success rate continuously. Alert if rate drops below 99%. Analyze failure patterns to identify which tasks fail most frequently. Review the 3 most problematic tasks causing 85% of failures. Implement automatic retry with exponential backoff (configured 3x retry). Escalate to human intervention after retry exhaustion.