Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ATMOSPHERE-103] chore: Add loki rule to delect Nova cell down #495

Merged
merged 8 commits into from
Aug 31, 2024
Next Next commit
Fix lint error
  • Loading branch information
okozachenko1203 committed Aug 27, 2024
commit b6c755d6c777267a3ff07c10a5c147e5db28e4b4
17 changes: 17 additions & 0 deletions roles/loki/vars/main.yml
Original file line number Diff line number Diff line change
Expand Up @@ -39,6 +39,23 @@ _loki_helm_values:
period: 24h
test:
enabled: false
monitoring:
selfMonitoring:
enabled: false
grafanaAgent:
installOperator: false
lokiCanary:
enabled: false
rules:
additionalGroups:
- name: additional-loki-rules
rules:
- alert: NovaCellNotResponding
expr: 'kubernetes.labels.component="compute" and log =~ "not responding and hence is being omitted from the results"'
labels:
severity: critical
annotations:
summary: Nova Cell is not responding. It can cause port deletion in CAPI.
singleBinary:
replicas: 1
nodeSelector:
Expand Down