Presto insights

4mo ago▸

Network Timeout in Hive Metastore Communicationwarning

Presto queries fail sporadically with SocketTimeoutException when communicating with Hive metastore, often under high query concurrency or when metastore is under-resourced relative to query load.

presto_execution_external_failures_one_minute_rate presto_execution_running_queries

4mo ago▸

Insufficient Resources Leading to Query Queueingcritical

Presto coordinator unable to find nodes to run queries, indicated by 'No nodes available to run the query' errors, combined with increasing queued queries while running queries remain stable or decrease.

presto_execution_insufficient_resources_failures_one_minute_rate presto_execution_running_queries presto_memory_blocked_nodes

1y ago▸

Task Execution Backlog from Thread Pool Saturationwarning

Growing queues of tasks waiting for execution across executor pools, indicating thread pool saturation and potential query slowdown as splits wait for processing resources.

presto_execution_executor_queued_task presto_execution_executor_active presto_execution_executor_waiting_splits

1y ago▸

User Error Rate Spike Indicating Query Syntax or Schema Issuesinfo

Elevated user error failures suggesting widespread issues with query syntax, schema changes, or permission problems affecting multiple queries from users.

presto_execution_user_error_failures_one_minute_rate presto_execution_started_queries_one_minute_rate

1y ago▸

Query Abandonment Pattern Indicating Timeout or Client Issueswarning

High rate of abandoned queries where clients disconnect before completion, potentially indicating query timeouts, impatient users, or client application crashes.

presto_execution_abandoned_queries_one_minute_rate presto_execution_execution_time_one_minute_p95

1y ago▸

Query Memory Exhaustion and Distributed Join Imbalancecritical

Presto queries fail with 'Query exceeded max memory size' or 'Query exceeded local memory limit' errors, often caused by inefficient join ordering where larger tables are on the right side, forcing expensive hash joins instead of broadcast joins.

presto_execution_failed_queries_one_minute_rate presto_memory_reserved_size presto_memory_max_size+1 more