-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Helm chart #3
Helm chart #3
Conversation
Thanks @patrickmslatteryvt I'll review this ASAP :) |
This looks promising! What's next to help this moving forward? 😊 |
Can this be tested out locally? (without Tiller, ofcourse). If someone can help validating the chart and reviewing it, would be great! @MPV |
I'm trying this out here now. So far I've found 2 issues:
1. Hardcoded configmap:As I'm deploying this chart as a subchart as part of my "cluster-monitoring" chart, when the configmap is named after the release, I get ✅ The solution to this is just changing the referenced name of the configmap in the deployment: - name: calert-config
+ name: {{ template "calert.fullname" . }}-config ...as this is how the configmap is being named: 2. Broken alertmanager configI get this error: And here is the rendered alertmanager config, after using the supplied example: global:
resolve_timeout: 5m
receivers:
- name: google-chat
webhook_configs:
- url: http://calert.cluster-monitoring.svc.cluster.local:6000/create?room_name=my-room-is-here
route:
group_by:
- job
group_interval: 5m
group_wait: 30s
receiver: "null" # <--- could this be giving the issue?
repeat_interval: 12h
routes:
- group_by:
- alertname
match:
severity: critical
receiver: google-chat This is probably due to the fact that the prometheus-operator's default config for alertmanager includes a route with the "null" recevier (and also such a receiver): ❓ I'm not yet sure how to solve this. Could be that we should just add that "null" receiver to the |
2. Fixed example Alertmanager configHere's how I got it working by adapting the example alertmanager config: alertmanager:
config:
route:
+ receiver: "null"
routes:
- match:
severity: critical
receiver: google-chat
group_by: [alertname]
receivers:
+ - name: "null"
- name: 'google-chat'
webhook_configs:
- url: "http://calert.clu-inf-all.svc.cluster.local:6000/create?room_name=<room>" The second change adds back the receiver that was missing from the example, while the first change just makes it explicitly used (as it would still be set like this while merging with the prometheus-operator default values). |
I've opened PRs to @patrickmslatteryvt's fork with the fixes: |
@mr-karan ...but my changes above aren't any blockers for using this, so I would consider this PR tested and okay to merge, unless you have any other thoughts? If @patrickmslatteryvt isn't available at the moment, I could re-contribute those changes into this repo after merging this PR. |
Thanks for your contribution @MPV. Much appreciated. |
Hi, this is an initial Helm chart for calert.
It implements a similar deployment to the one you have in the k8s folder but without Redis.