Log flooded with `string to non-string with op EQ` #14467

atoulme · 2022-09-24T05:47:34Z

What happened?

Description

Upgraded to 0.60.0.
Using a transform processor with a clause such as:

    transform:
      metrics:
        queries:
          - set(attributes["foo"], "bar") where attributes["foo"] == "foobar"

Steps to Reproduce

Set up a pipeline using a transform processor that uses an eq operator.

Expected Result

The pipeline should perform correctly.

Actual Result

Logs are flooded with:

string to non-string with op EQ
string to non-string with op EQ
string to non-string with op EQ
string to non-string with op EQ
string to non-string with op EQ
string to non-string with op EQ
string to non-string with op EQ
string to non-string with op EQ

Collector version

0.60.0

Environment information

Environment

OS: (e.g., "Ubuntu 20.04")
Compiler(if manually compiled): (e.g., "go 14.2")

OpenTelemetry Collector configuration

receivers:
    splunk_hec:

exporters:
    splunk_hec/metrics:
        # Splunk HTTP Event Collector token.
        token: "00000000-0000-0000-0000-0000000000000"
        # URL to a Splunk instance to send data to.
        endpoint: "https://splunk:8088/services/collector"
        # Optional Splunk source: https://docs.splunk.com/Splexicon:Source
        source: "app"
        # Optional Splunk source type: https://docs.splunk.com/Splexicon:Sourcetype
        sourcetype: "telegraf"
        # Splunk index, optional name of the Splunk index targeted.
        index: "metrics"
        # Maximum HTTP connections to use simultaneously when sending data. Defaults to 100.
        max_connections: 20
        # Whether to disable gzip compression over HTTP. Defaults to false.
        disable_compression: false
        # HTTP timeout when sending data. Defaults to 10s.
        timeout: 10s
        # Whether to skip checking the certificate of the HEC endpoint when sending data over HTTPS. Defaults to false.
        # For this demo, we use a self-signed certificate on the Splunk docker instance, so this flag is set to true.
        insecure_skip_verify: true

processors:
    batch:
    transform:
      metrics:
        queries:
          - set(attributes["foo"], "bar") where attributes["foo"] == "foobar"
extensions:
    health_check:
      endpoint: 0.0.0.0:13133
    pprof:
      endpoint: :1888
    zpages:
      endpoint: :55679

service:
    telemetry:
      logs:
        level: "debug"
    extensions: [pprof, zpages, health_check]
    pipelines:
      metrics:
        receivers: [splunk_hec]
        processors: [batch, transform]
        exporters: [splunk_hec/metrics]

Log output

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

github-actions · 2022-09-26T17:20:18Z

Pinging code owners: @TylerHelmuth @kentquirk @bogdandrutu. See Adding Labels via Comments if you do not have permissions to add labels yourself.

TylerHelmuth · 2022-09-26T17:52:05Z

haha in trying to help users we destroyed their logs. Should definitely be logger.Debug.

dmitryax · 2022-09-26T18:48:20Z

@atoulme please add more context about the incoming data. I assume the metrics just don't always have the "foo" attribute, right? I don't think the output is incorrect in that case. Setting debug log level for all the logs can also hide other potential issues that user should be aware of, so I'm not sure if that's the right solution.

I believe the expression should have an additional condition verifying that metrics have foo attribute. Something like this:

set(attributes["foo"], "bar") where "foo" in attributes && attributes["foo"] == "foobar"

@TylerHelmuth does OTTL currently support something like this?

atoulme · 2022-09-26T19:08:03Z

Yes, specifically set a pipeline that doesn't have this attribute set to reproduce the issue.
At the very least, this is a breaking change behavior that wasn't documented as part of the release and produces surprising output for end users, flooding their logs. I have not identified a workaround to address this behavior, please help.

kentquirk · 2022-09-26T19:32:44Z

OTTL does short-circuit evaluation and so something like
set(attributes["foo"], "bar") where attributes["foo"] != nil && attributes["foo"] == "foobar" should prevent the error for the time being.

TylerHelmuth · 2022-09-26T19:36:28Z

@dmitryax In this situation the invalidComparison func exists to ensure that a boolean always bubbles up based on the comparison rules. We didn't want to silently use the "default" return value of false in these invalid situations, which is where the print statement came from.

I was initially thinking this log, and others like it, would be Debug level so that customers can set their log level to debug when trying to troubleshoot issues and then see this problem. I thought Debug log level printed all log statements; does it exclude Error, Warning, and Info?

dmitryax · 2022-09-26T20:15:34Z

@dmitryax In this situation the invalidComparison func exists to ensure that a boolean always bubbles up based on the comparison rules. We didn't want to silently use the "default" return value of false in these invalid situations, which is where the print statement came from.

I was initially thinking this log, and others like it, would be Debug level so that customers can set their log level to debug when trying to troubleshoot issues and then see this problem.

Just using print is not the best option. I agree that this particular error should be of debug level. But we need to bring logger there as @atoulme did.

I thought Debug log level printed all log statements; does it exclude Error, Warning, and Info?

If a user sets telemetry log level to Debug in their config, logs of all levels that are equal to debug or higher will be printed, it's everything excluding trace level

TylerHelmuth · 2022-09-26T20:26:56Z

Just using print is not the best option. I agree that this particular error should be of debug level. But we need to bring logger there as @atoulme did.

Agreed, his PR is what we need. @kentquirk and I have already provided feedback for next steps with that PR (member functions)

atoulme · 2022-09-27T20:07:30Z

OTTL does short-circuit evaluation and so something like set(attributes["foo"], "bar") where attributes["foo"] != nil && attributes["foo"] == "foobar" should prevent the error for the time being.

We tried this - it appears && is not supported and should be replaced with and.

With set(attributes["foo"], "bar") where attributes["foo"] != nil and attributes["foo"] == "foobar", the workaround works properly. Thanks!

atoulme · 2022-09-27T20:18:30Z

An update further on the workaround. Upon using the workaround, the test case above complies and there are no longer any logs generated.

However, please see this example:

receivers:
    splunk_hec:

exporters:
    logging:

processors:
    batch:
    attributes/example:
      actions:
        - key: foo
          action: upsert
          value: "bar"
    transform:
      logs:
        queries:
          - set(attributes["foo"], "foobar") where attributes["foo"] != nil and attributes["foo"] == "bar"

extensions:
    health_check:
      endpoint: 0.0.0.0:13133
    pprof:
      endpoint: :1888
    zpages:
      endpoint: :55679

service:
    extensions: [pprof, zpages, health_check]
    pipelines:
      logs:
        receivers: [splunk_hec]
        processors: [batch, attributes/example, transform]
        exporters: [logging]

In this example, we make sure that the value of attributes["foo"] is not nil. This generates logs:

string to non-string with op NE
string to non-string with op NE
string to non-string with op NE
string to non-string with op NE
string to non-string with op NE
string to non-string with op NE
string to non-string with op NE
string to non-string with op NE
string to non-string with op NE
string to non-string with op NE
string to non-string with op NE

This is because the first check is now comparing a string and nil, which is considered a violation of comparison types.

dmitryax · 2022-09-27T21:08:45Z

Ok looks like != nil is incorrect way to handle this use case. We need a key in map or map has key functionality in OTTL.

TylerHelmuth · 2022-09-27T21:22:46Z

Ya or to relax on what we consider an invalid comparison. @kentquirk what do you think?

atoulme · 2022-09-30T23:08:18Z

This is now fixed in 0.61.0. Closing.

atoulme added bug Something isn't working needs triage New item requiring triage labels Sep 24, 2022

atoulme mentioned this issue Sep 26, 2022

Remove a stray fmt.Printf, use a logger #14468

Merged

dmitryax added priority:p2 Medium processor/transform Transform processor and removed needs triage New item requiring triage labels Sep 26, 2022

atoulme closed this as completed Sep 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Log flooded with `string to non-string with op EQ` #14467

Log flooded with `string to non-string with op EQ` #14467

atoulme commented Sep 24, 2022

github-actions bot commented Sep 26, 2022

TylerHelmuth commented Sep 26, 2022

dmitryax commented Sep 26, 2022 •

edited

Loading

atoulme commented Sep 26, 2022

kentquirk commented Sep 26, 2022

TylerHelmuth commented Sep 26, 2022

dmitryax commented Sep 26, 2022

TylerHelmuth commented Sep 26, 2022

atoulme commented Sep 27, 2022

atoulme commented Sep 27, 2022 •

edited

Loading

dmitryax commented Sep 27, 2022

TylerHelmuth commented Sep 27, 2022

atoulme commented Sep 30, 2022

Log flooded with string to non-string with op EQ #14467

Log flooded with string to non-string with op EQ #14467

Comments

atoulme commented Sep 24, 2022

What happened?

Description

Steps to Reproduce

Expected Result

Actual Result

Collector version

Environment information

Environment

OpenTelemetry Collector configuration

Log output

Additional context

github-actions bot commented Sep 26, 2022

TylerHelmuth commented Sep 26, 2022

dmitryax commented Sep 26, 2022 • edited Loading

atoulme commented Sep 26, 2022

kentquirk commented Sep 26, 2022

TylerHelmuth commented Sep 26, 2022

dmitryax commented Sep 26, 2022

TylerHelmuth commented Sep 26, 2022

atoulme commented Sep 27, 2022

atoulme commented Sep 27, 2022 • edited Loading

dmitryax commented Sep 27, 2022

TylerHelmuth commented Sep 27, 2022

atoulme commented Sep 30, 2022

Log flooded with `string to non-string with op EQ` #14467

Log flooded with `string to non-string with op EQ` #14467

dmitryax commented Sep 26, 2022 •

edited

Loading

atoulme commented Sep 27, 2022 •

edited

Loading