OpenStack

Service Component Health Monitoring Chain

critical
reliabilityUpdated Nov 5, 2025

Monitor the operational state of critical OpenStack service components (API servers, agents, schedulers) to detect cascading failures before user impact.

How to detect:

Track service state for nova-api, nova-scheduler, nova-compute, neutron-server, neutron-l3-agent, neutron-dhcp-agent, cinder-api, glance-api. Alert when services transition to DOWN state or when agent counts drop unexpectedly.

Recommended action:

Investigate service logs for failure reasons. Check systemd unit status. Verify database connectivity and message queue health. Review resource exhaustion on service nodes (RAM, CPU, file descriptors).