Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change Harvest metrics to use a one for healthy #306

Closed
cgrinds opened this issue Jul 14, 2021 · 1 comment · Fixed by #319 or #320
Closed

Change Harvest metrics to use a one for healthy #306

cgrinds opened this issue Jul 14, 2021 · 1 comment · Fixed by #319 or #320
Labels
bug Something isn't working status/done

Comments

@cgrinds
Copy link
Collaborator

cgrinds commented Jul 14, 2021

Raised by hashi825 in #262

Few notes from chat with team:

  • The reason we map label values into a numeric value (0 or 1), is that you can't generate alerts in Grafana from labels, only numbers
  • Originally 0 represented ok in Harvest because many statuses are enums that were mapped like this:
enum int
ok 0
fail 1
missing 2
error 3
not_found 4
@Hardikl
Copy link
Contributor

Hardikl commented Jul 22, 2021

earlier:

cluster_status{cluster="C2_sti7-vsim-ucs569c_cluster", datacenter="DC-02", instance="localhost:12992", job="prometheus2", status="ok"} 0

now:

cluster_new_status{cluster="C2_sti7-vsim-ucs569c_cluster", datacenter="DC-02", instance="localhost:12992", job="prometheus2", status="ok"} 1

Node new status:
image

Cluster new subsystem:
image

Cluster new status:
image

Aggregate new status:
image

Shelf new status:
image

Volume new status:
image

normal/online/ok value mapped to 1 and rest non normal value mapped to 0.
With these detail, moving to status/done.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working status/done
Projects
None yet
2 participants