Monitor Workflow Failure Rates for Execution Health
criticalWorkflow failures indicate potential issues with Workflow execution health and should be monitored proactively. The Datadog integration provides built-in monitors specifically for detecting Workflow failure patterns.
Workflow failure rate increases above baseline. The integration includes pre-configured monitors that alert on Workflow failures to enable rapid response.
1. Access the pre-built Temporal Cloud metrics dashboard in Datadog to view Workflow success rates and failure patterns. 2. Review alerts from the built-in Workflow failure monitor. 3. Use metrics tagged by namespace and operation type to identify which Workflows or namespaces are experiencing failures. 4. Investigate correlated metrics such as service latency and replication lag to determine if infrastructure issues are contributing to failures. 5. Analyze the dashboard for detailed insights into Workflow efficiency and system health to diagnose root causes.