Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Increased CortexRequestErrors alert severity #279

Merged
merged 2 commits into from
Apr 21, 2021

Conversation

pracucci
Copy link
Collaborator

What this PR does:
The CortexRequestErrors alert looks to be quite critical. If it triggers, it means > 1% of requests to a specific route are failing for 15 consecutive minutes.

What's the sentiment if we raise it to critical?

Which issue(s) this PR fixes:
N/A

Checklist

  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

@pracucci pracucci requested a review from a team as a code owner March 23, 2021 10:49
Signed-off-by: Marco Pracucci <[email protected]>
@pracucci pracucci requested review from pstibrany and gouthamve March 23, 2021 10:49
Copy link
Contributor

@jtlisi jtlisi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm 100% on board with this change. Before the the SLO alerts were added this alert was critical. One issue with the SLO alerts is it's difficult to tel if it was triggered by latency or errors. That means you can't really silence it for an intractable latency issue without being concerned an error issue could trigger later. This alert helps fill the gap.

@pracucci pracucci merged commit 1fb3caa into main Apr 21, 2021
@pracucci pracucci deleted the increase-cortex-request-error-level branch April 21, 2021 07:46
simonswine pushed a commit to grafana/mimir that referenced this pull request Oct 18, 2021
…ortex-request-error-level

Increased CortexRequestErrors alert severity
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants