ray_serve_count_http_requested
The number of HTTP requests processed.Dimensions:None
Related Insights (2)
Ray Memory Limiter Shedding Under Traffic Spikescritical
OpenTelemetry collector processing Ray telemetry experiences OOMKills and restarts during traffic spikes when memory_limiter processor is not configured or placed incorrectly in pipeline.
▸
Ray Telemetry High-Cardinality Cost Explosionwarning
Attaching Kubernetes pod-level attributes (k8s.pod.id, k8s.node.ip) to Ray metrics dramatically increases cardinality and observability costs, especially in autoscaling environments where pods are ephemeral.
▸