Technologies/Prometheus/ray_serve_count_http_requested
PrometheusPrometheusMetric

ray_serve_count_http_requested

The number of HTTP requests processed.
Dimensions:None
Related Insights (2)
Ray Memory Limiter Shedding Under Traffic Spikescritical

OpenTelemetry collector processing Ray telemetry experiences OOMKills and restarts during traffic spikes when memory_limiter processor is not configured or placed incorrectly in pipeline.

Ray Telemetry High-Cardinality Cost Explosionwarning

Attaching Kubernetes pod-level attributes (k8s.pod.id, k8s.node.ip) to Ray metrics dramatically increases cardinality and observability costs, especially in autoscaling environments where pods are ephemeral.