anthropic_request_time
Duration of API requestsKnowledge Base (7 documents, 0 chunks)
Related Insights (6)
Invalid or expired API keys generate 'unable to connect' errors that appear identical to network failures, leading teams to troubleshoot network/DNS when the root cause is authentication. Error response codes distinguish these cases.
Console.anthropic dashboard can be inaccessible while the API remains fully operational (or vice versa), creating false alarms. Teams waste time troubleshooting local networks when only the console component is affected.
Anthropic console dashboard becomes inaccessible while API endpoints remain functional (or vice versa), causing teams to misdiagnose complete outages when only one service layer is affected. Creates deployment delays and unnecessary troubleshooting.
Initial response latency (TTFT) increases when backend processing saturates, creating poor user experience even when total request time remains acceptable. Critical for streaming applications where perceived responsiveness depends on first token delivery.
High Anthropic API latency (>500ms) signals backend strain or network issues. Early detection prevents cascading failures in AI-powered applications.
Elevated anthropic_time_time_to_first_token indicates backend strain, throttling, or network issues. Latency above 500ms may signal infrastructure problems. This metric is distinct from total request time and specifically captures model initialization and first response delays.