http.server.active_requests
Number of active HTTP server requestsInterface Metrics (1)
Knowledge Base (1 documents, 0 chunks)
Technical Annotations (48)
Configuration Parameters (19)
workersrecommended: cpu_countsleeprecommended: greater than 0.001DATABASES['default']['CONN_MAX_AGE']recommended: 10DATABASES['default']['ATOMIC_REQUESTS']recommended: Falsepool_moderecommended: transactiondefault_pool_sizerecommended: 20max_client_connrecommended: 1000listen_portrecommended: 6432CONN_MAX_AGErecommended: 0PORTrecommended: 6432--workersrecommended: 4 for single CPU; 16 for multi-coreCELERY_BROKER_URLrecommended: amqp://guest@localhost//db_pool_sizerecommended: 32metrics[].pods.target.averageValuerecommended: 10maxReplicasrecommended: 40spec.minAvailablerecommended: 6MetricInstruments.HTTP_SERVER_ACTIVE_REQUESTS.unitrecommended: {request}MetricInstruments.HTTP_SERVER_ACTIVE_REQUESTS.descriptionrecommended: Number of active HTTP server requests.starlette.versionrecommended: >=0.40.0Error Signatures (5)
OperationalError: sorry, too many clients alreadyexceptionFATAL: sorry, too many clients alreadylog patternFATAL: memory quota exceededlog patternOOM errorerror codeCVE-2024-47874error codeCLI Commands (2)
sudo apt-get install pgbouncerremediationcurl http://localhost:8000 -F 'big=</dev/urandom'diagnosticTechnical References (22)
Server-Sent EventsprotocolSSEprotocolTaskGroupcomponentasync context managerconceptexit stackconceptrequest queuecomponentconcurrency-based autoscalingconceptworkerscomponent@bentoml.servicecomponentCONN_MAX_AGEconfiguration parameterASGIcomponentPgBouncercomponentmax_connectionsconceptprocess workerconceptCelerycomponentAMQPprotocolStreamingResponse.stream_responsecomponentcreate_http_server_active_requestscomponentsemconv packagecomponentMetricInstruments.HTTP_SERVER_ACTIVE_REQUESTScomponentmultipart/form-dataprotocolfilenamecomponentRelated Insights (22)
FastAPI async endpoints exhibit serial-like behavior and inflated tail latency when synchronous operations (ORM calls, CPU-heavy tasks, blocking SDKs) execute directly on the event loop. Throughput plateaus while p95/p99 latencies climb despite moderate CPU usage.
FastAPI services experiencing event loop blocking show counterintuitive metrics: moderate CPU utilization (50-60%), healthy dependency performance, but rising tail latency and timeouts. This pattern indicates worker starvation rather than resource exhaustion.
DataHub backend API experiencing elevated error rates impacting metadata ingestion, UI operations, and external integrations, potentially indicating service degradation or infrastructure issues.
Request queue times increase at load balancer during traffic bursts despite moderate server resource utilization, indicating insufficient concurrency handling or event loop saturation.
Default concurrent execution limits for serverless functions trigger 429 errors under traffic spikes, causing request failures and degraded user experience when new function instances cannot be spawned fast enough to handle load.