Skip to content

Commit

Permalink
add prometheus rule for 3 rabbitmq alam states (#1823)
Browse files Browse the repository at this point in the history
* rabbitmq_alarms_memory_used_watermark
* rabbitmq_alarms_file_descriptor_limit
* rabbitmq_alarms_free_disk_space_watermark
  • Loading branch information
awoimbee authored Feb 20, 2025
1 parent f218c0d commit df2d045
Showing 1 changed file with 60 additions and 0 deletions.
60 changes: 60 additions & 0 deletions observability/prometheus/rules/rabbitmq/cluster-alarms.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,60 @@
---
apiVersion: monitoring.coreos.com/v1
kind: PrometheusRule
metadata:
name: rabbitmq-cluster-alarms
# If labels are defined in spec.ruleSelector.matchLabels of your deployed Prometheus object, make sure to include them here.
labels:
role: alert-rules
spec:
groups:
- name: rabbitmq
rules:
- alert: MemoryAlarm
expr: |
max by(rabbitmq_cluster) (
max_over_time(rabbitmq_alarms_memory_used_watermark[5m])
* on(instance) group_left(rabbitmq_cluster, rabbitmq_node, pod) rabbitmq_identity_info
) > 0
keep_firing_for: 5m
annotations:
description: |
RabbitMQ cluster `{{ $labels.rabbitmq_cluster }}` memory alarm active. Publishers are blocked.
summary: |
A RabbitMQ node reached the `vm_memory_high_watermark` threshold.
See https://www.rabbitmq.com/docs/alarms#overview, https://www.rabbitmq.com/docs/memory.
labels:
rulesgroup: rabbitmq
severity: warning
- alert: RabbitmqDiskAlarm
expr: |
max by(rabbitmq_cluster) (
max_over_time(rabbitmq_alarms_free_disk_space_watermark[5m])
* on(instance) group_left(rabbitmq_cluster, rabbitmq_node, pod) rabbitmq_identity_info
) > 0
keep_firing_for: 5m
annotations:
description: |
RabbitMQ cluster `{{ $labels.rabbitmq_cluster }}` disk alarm active. Publishers are blocked.
summary: |
A RabbitMQ node reached the `disk_free_limit` threshold.
See https://www.rabbitmq.com/docs/alarms#overview, https://www.rabbitmq.com/docs/disk-alarms.
labels:
rulesgroup: rabbitmq
severity: warning
- alert: RabbitmqFileDescriptorAlarm
expr: |
max by(rabbitmq_cluster) (
max_over_time(rabbitmq_alarms_file_descriptor_limit[5m])
* on(instance) group_left(rabbitmq_cluster, rabbitmq_node, pod) rabbitmq_identity_info
) > 0
keep_firing_for: 5m
annotations:
description: |
RabbitMQ cluster `{{ $labels.rabbitmq_cluster }}` file descriptor alarm active. Publishers are blocked.
summary: |
A RabbitMQ node ran out of file descriptors.
See https://www.rabbitmq.com/docs/alarms#file-descriptors.
labels:
rulesgroup: rabbitmq
severity: warning

0 comments on commit df2d045

Please sign in to comment.