Technologies/Prometheus/ray_serve_count_http_requested

ray_serve_count_http_requested

The number of HTTP requests processed.

Dimensions:None

Related Insights (2)

Ray Memory Limiter Shedding Under Traffic Spikescritical

OpenTelemetry collector processing Ray telemetry experiences OOMKills and restarts during traffic spikes when memory_limiter processor is not configured or placed incorrectly in pipeline.

▸

Ray Telemetry High-Cardinality Cost Explosionwarning

Attaching Kubernetes pod-level attributes (k8s.pod.id, k8s.node.ip) to Ray metrics dramatically increases cardinality and observability costs, especially in autoscaling environments where pods are ephemeral.

▸