Service Component Health Monitoring Chain
criticalreliabilityUpdated Nov 5, 2025
Monitor the operational state of critical OpenStack service components (API servers, agents, schedulers) to detect cascading failures before user impact.
Sources
How to detect:
Track service state for nova-api, nova-scheduler, nova-compute, neutron-server, neutron-l3-agent, neutron-dhcp-agent, cinder-api, glance-api. Alert when services transition to DOWN state or when agent counts drop unexpectedly.
Recommended action:
Investigate service logs for failure reasons. Check systemd unit status. Verify database connectivity and message queue health. Review resource exhaustion on service nodes (RAM, CPU, file descriptors).