CPU Throttling Under Query Load
criticalResource ContentionUpdated Feb 23, 2026
Container CPU throttling increases when vector search query volume spikes, indicating the cluster is CPU-constrained and unable to meet query demand efficiently.
Technologies:
How to detect:
Monitor container_cpu_cfs_throttled_periods_total counter for increases correlated with qdrant_collection_number_of_grpc_requests or qdrant_collection_number_of_rest_requests spikes. CPU throttling indicates demand exceeds allocated CPU.
Recommended action:
Scale cluster vertically by increasing CPU limits in kube_pod_container_resource_limits, or horizontally by adding replicas. Review query patterns to optimize top_k values and filter complexity.