Apache ZooKeeper

Leader Election Delays Signal Network Partitions

critical
reliabilityUpdated Feb 6, 2026

Frequent or prolonged leader elections indicate network connectivity issues between ensemble members. During elections, the cluster cannot process write requests, causing application-level outages.

How to detect:

Monitor zookeeper.server_state for transitions and track time spent in election state. Alert when no leader is elected for >30 seconds, or when elections occur more than once per hour outside maintenance windows.

Recommended action:

Check network connectivity between all ensemble members using ping and traceroute. Review firewall rules and network equipment logs for packet drops. Verify inter-node ZooKeeper ports (typically 2888, 3888) are accessible. Check for split-brain scenarios where network partition prevents quorum formation.