Service down detection requires immediate action
criticalavailabilityUpdated Mar 16, 2026(via Exa)
Sources
Technologies:
How to detect:
Service health check fails (up == 0) for 1 minute, indicating complete service unavailability
Recommended action:
Verify service process is running, check logs for crash reasons, restart service if needed, follow service-down runbook. Every alert must have an attached runbook for consistent incident response.