cassandra_client_request_error
Number of request errors by operation type (timeout, unavailable, failure)Interface Metrics (1)
Knowledge Base (2 documents, 0 chunks)
Related Insights (3)
When heap usage climbs above 80-90% without GC recovery, or when GC pause times exceed 500ms, Cassandra cannot process requests during stop-the-world pauses. This manifests as WriteTimeoutException at the application layer despite healthy disk and network.
Rising cassandra_storage_count_hints while all nodes appear up in nodetool status indicates silent network partitions or zombie nodes that accept gossip but fail write requests. This causes eventual consistency drift between replicas.
cassandra_dropped_messages represents Cassandra's load-shedding mechanism when requests can't be processed within timeout. This is a leading indicator of cluster overload — messages drop before clients see timeouts, revealing capacity problems earlier than error rates.