Add add Automated Dashboard for Kubernetes Node metrics #64

jeffkreeftmeijer · 2024-04-04T14:45:38Z

AppSignal for Kubernetes extracts node metrics for each node running in a Kubenetes cluster. This automated dashboard displays some of those metrics in an automated dashboard named "Kubernetes Nodes".

tombruijn · 2024-04-05T11:42:00Z

dashboards/kubernetes/node.json

+            "name": "node_fs_inodes",
+            "fields": [
+              {
+                "field": "GAUGE"
+              }
+            ],
+            "tags": [
+              {
+                "key": "node",
+                "value": "*"
+              }
+            ]
+          },
+          {
+            "name": "node_fs_inodes_free",
+            "fields": [
+              {
+                "field": "GAUGE"
+              }
+            ],
+            "tags": [
+              {
+                "key": "node",
+                "value": "*"
+              }
+            ]
+          },
+          {
+            "name": "node_fs_inodes_used",


Let's use tags for different states like free and used instead of reporting them as different metrics. We do this for other (host) metrics too. It would help that we don't have to show the full metric name then for every line in the graph, freeing up valuable space in the hover box.

For example:

Metric name: node_fs_inodes

Tags:

state

Values:

free

used

I also see this in some other graphs in this dashboard. We should update those as well.

I'm trying to keep this dashboard as close to what's reported from Kubernetes as I can. I think this is a good idea for the future, when we know what we'd like to report exactly, but let's get this out of the door and get users to try it first.

tombruijn · 2024-04-05T11:45:24Z

dashboards/kubernetes/node.json

+      {
+        "title": "Node CPU Usage",
+        "description": "node_cpu_usage_nano_cores",
+        "line_label": "%name% %node%",


We shouldn't include the metric name unless there are multiple metrics in a graph, and for those graphs we're better off using tags on one metric. See my other comment.

Suggested change

"line_label": "%name% %node%",

"line_label": "%node%",

tombruijn · 2024-04-05T11:45:51Z

dashboards/kubernetes/node.json

+    "visuals": [
+      {
+        "title": "Node CPU Usage",
+        "description": "node_cpu_usage_nano_cores",


These descriptions don't explain anything to me. Let's remove them if it's just for testing.

Or if we can do human-readable descriptions, let's do those instead!

These are the internal metric names, so they'll probably make sense to Kubernetes users.

I can't the names of these metrics anywhere, so I wouldn't be so sure that it's clear. The same way I can't find that you're using the kubernetes metric names, like you say here. In the library I see it being mapped from a JSON struct that doesn't use the same naming: https://github.com/appsignal/appsignal-kubernetes/blob/0b3f39d65ba99622ab3e647e8e4c012ee944baca/src/main.rs#L97-L167

Do you have any links to docs or source code that mentions these metric names?

roytomeij · 2024-04-05T12:29:05Z

I'm not sure what I should review :)

unflxw

If this is just a quick test with Jim and select users, then this is fine.

[But then, maybe we don't need a magic dashboard at all? Maybe we can just give them a JSON dashboard definition that they can manually import into AppSignal?]

Otherwise, what @tombruijn said.

And also maybe we could consider prefixing these metric names with something like k8s_? Otherwise it might be confusing with the Node.js Heap Statistic metrics (there's no overlap, though -- those are nodejs_, not node_)

jeffkreeftmeijer · 2024-04-08T12:41:02Z

Accidentally merged, reverted and continuing in #65.

jeffkreeftmeijer · 2024-04-08T12:46:53Z

If this is just a quick test with Jim and select users, then this is fine.

[But then, maybe we don't need a magic dashboard at all? Maybe we can just give them a JSON dashboard definition that they can manually import into AppSignal?]

We're adding a dashboard to give users a bit of an easier time getting their Kubernetes setup working. This is mostly an example dashboard at this point, but it saves users from having to write their own.

Otherwise, what @tombruijn said.

And also maybe we could consider prefixing these metric names with something like k8s_? Otherwise it might be confusing with the Node.js Heap Statistic metrics (there's no overlap, though -- those are nodejs_, not node_)

You're right, this could be a bit confusing. I'm putting this on the list for later, as we're going to add more metrics that also warrant namespacing (think pod metrics, for example). I'll pick this one up then.

Add add autmated dashboard for Kubernetes Node metrics

2f4dddc

AppSignal for Kubernetes extracts node metrics for each node running in a Kubenetes cluster. This automated dashboard displays some of those metrics in an automated dashboard named "Kubernetes Nodes".

jeffkreeftmeijer added the feature label Apr 4, 2024

jeffkreeftmeijer requested review from matsimitsu, roytomeij, luismiramirez and unflxw April 4, 2024 14:45

jeffkreeftmeijer self-assigned this Apr 4, 2024

jeffkreeftmeijer changed the title ~~Add add autmated dashboard for Kubernetes Node metrics~~ Add add Automated Dashboard for Kubernetes Node metrics Apr 4, 2024

luismiramirez approved these changes Apr 4, 2024

View reviewed changes

tombruijn requested changes Apr 5, 2024

View reviewed changes

jeffkreeftmeijer requested review from thijsc and removed request for roytomeij April 5, 2024 17:48

unflxw approved these changes Apr 5, 2024

View reviewed changes

jeffkreeftmeijer merged commit 2f4dddc into main Apr 8, 2024
1 check passed

jeffkreeftmeijer deleted the kubernetes-nodes branch April 8, 2024 12:38

jeffkreeftmeijer restored the kubernetes-nodes branch April 8, 2024 12:39

jeffkreeftmeijer mentioned this pull request Apr 8, 2024

Add add autmated dashboard for Kubernetes Node metrics #65

Merged

jeffkreeftmeijer mentioned this pull request Apr 25, 2024

Add dashboard for Kubernetes pods #66

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add add Automated Dashboard for Kubernetes Node metrics #64

Add add Automated Dashboard for Kubernetes Node metrics #64

jeffkreeftmeijer commented Apr 4, 2024

tombruijn Apr 5, 2024

jeffkreeftmeijer Apr 8, 2024

tombruijn Apr 5, 2024

tombruijn Apr 5, 2024

unflxw Apr 5, 2024

jeffkreeftmeijer Apr 5, 2024

tombruijn Apr 10, 2024 •

edited

Loading

roytomeij commented Apr 5, 2024

unflxw left a comment •

edited

Loading

jeffkreeftmeijer commented Apr 8, 2024

jeffkreeftmeijer commented Apr 8, 2024

Add add Automated Dashboard for Kubernetes Node metrics #64

Add add Automated Dashboard for Kubernetes Node metrics #64

Conversation

jeffkreeftmeijer commented Apr 4, 2024

tombruijn Apr 5, 2024

Choose a reason for hiding this comment

jeffkreeftmeijer Apr 8, 2024

Choose a reason for hiding this comment

tombruijn Apr 5, 2024

Choose a reason for hiding this comment

tombruijn Apr 5, 2024

Choose a reason for hiding this comment

unflxw Apr 5, 2024

Choose a reason for hiding this comment

jeffkreeftmeijer Apr 5, 2024

Choose a reason for hiding this comment

tombruijn Apr 10, 2024 • edited Loading

Choose a reason for hiding this comment

roytomeij commented Apr 5, 2024

unflxw left a comment • edited Loading

Choose a reason for hiding this comment

jeffkreeftmeijer commented Apr 8, 2024

jeffkreeftmeijer commented Apr 8, 2024

tombruijn Apr 10, 2024 •

edited

Loading

unflxw left a comment •

edited

Loading