Push metric name sanitization to backends #110

mheffner · 2012-06-26T14:46:10Z

Opening this as a general RFC.

Currently the main StatsD daemon performs a set of regex replacements to sanitize metric names:

        var key = bits.shift()
                      .replace(/\s+/g, '_')
                      .replace(/\//g, '-')
                      .replace(/[^a-zA-Z_\-0-9\.]/g, '');

These regular expressions are largely based on the acceptable name formats that Graphite (being the only backend) was able to handle. Now that we have a pluggable backend system, I'm proposing that we make this task a function of each backend to handle as is appropriate for its acceptable metric name character set.

One advantage of pushing this down to the backend level would be that different backends could handle the special characters as optional control sequences. For example, in the case of the Librato backend we would like a way to specify a custom source parameter on a stat by stat basis. One thought would be to use a prefix character, like (/), to separate the source name from the metric name:

db1.acme.com/gorets:1c

The Librato backend would split the source out based on the (/) character, while backends like Graphite would simply turn that into a metric named "db1.acme.com.gorets", like it does today.

The text was updated successfully, but these errors were encountered:

mrtazz · 2012-06-26T19:28:26Z

I think this is a good idea. I'd love to see more code pushed into the backends, where it makes sense.

Dieterbe · 2013-04-10T18:18:00Z

what's the status of this? histogram support (#162) suffers from keys like 'bin_0.5' not being sanitized to 'bin_0_5' (or whatever)

draco2003 · 2013-04-11T21:30:47Z

Closing this, since we'll pull #155 after the config flag is added.

Thanks!

mjr5749 mentioned this issue Sep 14, 2012

Relax metric name sanitation based on a configuration flag #154

Closed

mheffner mentioned this issue Sep 24, 2012

Move stat key name sanitization to Graphite backend. #155

Merged

ghost assigned mrtazz Oct 10, 2012

draco2003 closed this as completed Apr 11, 2013

joshbuddy mentioned this issue Jan 10, 2014

Sanitize statsd path mLewisLogic/saddle#10

Merged

mnadel mentioned this issue Jan 3, 2016

Column naming mnadel/metrics.net.influxdb#4

Closed

bobzoller mentioned this issue Apr 27, 2016

sanitize * to - in graphite serializer influxdata/telegraf#1110

Closed

nicolargo mentioned this issue Mar 28, 2017

Some FS and LAN metrics fail to export correctly to StatsD nicolargo/glances#1068

Closed

shakuzen mentioned this issue Oct 26, 2021

'?' in tag causes it not being forwarded to Telegraf via statsd micrometer-metrics/micrometer#2840

Closed

rudigerlove mentioned this issue Jan 19, 2022

blank spaces not parsed in statsd data influxdata/telegraf#10372

Closed

powersj mentioned this issue Feb 24, 2022

fix: sanitize stasd names influxdata/telegraf#10466

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Push metric name sanitization to backends #110

Push metric name sanitization to backends #110

mheffner commented Jun 26, 2012

mrtazz commented Jun 26, 2012

Dieterbe commented Apr 10, 2013

draco2003 commented Apr 11, 2013

Push metric name sanitization to backends #110

Push metric name sanitization to backends #110

Comments

mheffner commented Jun 26, 2012

mrtazz commented Jun 26, 2012

Dieterbe commented Apr 10, 2013

draco2003 commented Apr 11, 2013