fix: Second attempt at fixing trace propagation in Celery 4.2+ #831

untitaker · 2020-09-22T18:20:58Z

It turns out the bug is perfectly reproducible with the Redis backend, I just messed up the test.

apply_async was basically patched at the wrong point in time, and as such the trace propagation didn't work out.

Sorry y'all for dropping this on you without so little context, we can go through this tomorrow.

Follow-up to #824 #825

untitaker · 2020-09-22T19:02:17Z

sentry_sdk/integrations/celery.py

-                        kwarg_headers = kwargs.setdefault("headers", {})
+                        # Note: kwargs can contain headers=None, so no setdefault!
+                        # Unsure which backend though.
+                        kwarg_headers = kwargs.get("headers") or {}
                        kwarg_headers.update(headers)


This line is currently crashing all the time (in capture_internal_exceptions) because the old code could not deal with kwargs == {"headers": None}, only kwargs == {}.

untitaker · 2020-09-22T19:03:02Z

sentry_sdk/integrations/celery.py

                        kwarg_headers.update(headers)

                        # https://github.com/celery/celery/issues/4875
                        #
                        # Need to setdefault the inner headers too since other
                        # tracing tools (dd-trace-py) also employ this exact
                        # workaround and we don't want to break them.
-                        #


Turns out it is perfectly reproducible. The bug we're working around lives in celery.app.amqp, but it seems that module is used in redis too??? tbh I no longer understand how celery separates concerns.

untitaker · 2020-09-22T19:03:18Z

tests/integrations/celery/test_celery.py

@@ -42,6 +42,7 @@ def inner(propagate_traces=True, backend="always_eager", **kwargs):

            # this backend requires capture_events_forksafe
            celery.conf.worker_max_tasks_per_child = 1
+            celery.conf.worker_concurrency = 1


This speeds up tests, otherwise celery forks 20 times.

Sounds good on the perspective of speeding up tests -- won't this have an effect on bugs and code paths surfaced by tests, though?

So we haven't had any bugs related to this so far and my gut feeling tells me no. There is no direct communication between the forked processes so I'd say it's unlikely, and the tests won't send in high event volumes ever anyway.

untitaker · 2020-09-22T19:04:21Z

tests/conftest.py

-        @request.addfinalizer
-        def _():
-            assert not in_process_events
+        capture_events()


Need to remove this assertion because we actually do have events in the same process now.

We need to call capture_events or otherwise our transport will raise an error (this fixture setup is overdue for refactor...)

untitaker and others added 2 commits September 22, 2020 20:20

wip

c3655c2

fix: Formatting

f380004

untitaker requested review from mitsuhiko, flub and rhcarvalho September 22, 2020 18:59

untitaker commented Sep 22, 2020

View reviewed changes

untitaker added 2 commits September 22, 2020 21:13

remove unused import

036be0c

fix linters

7818ae9

rhcarvalho approved these changes Sep 23, 2020

View reviewed changes

fix linters

5c99cfe

flub approved these changes Sep 23, 2020

View reviewed changes

untitaker merged commit 4bf4859 into master Sep 23, 2020

untitaker deleted the fix/celery-trace-propagation-2 branch September 23, 2020 14:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Second attempt at fixing trace propagation in Celery 4.2+ #831

fix: Second attempt at fixing trace propagation in Celery 4.2+ #831

untitaker commented Sep 22, 2020 •

edited

Loading

untitaker Sep 22, 2020

untitaker Sep 22, 2020

untitaker Sep 22, 2020

rhcarvalho Sep 23, 2020

untitaker Sep 23, 2020

untitaker Sep 22, 2020 •

edited

Loading

fix: Second attempt at fixing trace propagation in Celery 4.2+ #831

fix: Second attempt at fixing trace propagation in Celery 4.2+ #831

Conversation

untitaker commented Sep 22, 2020 • edited Loading

untitaker Sep 22, 2020

Choose a reason for hiding this comment

untitaker Sep 22, 2020

Choose a reason for hiding this comment

untitaker Sep 22, 2020

Choose a reason for hiding this comment

rhcarvalho Sep 23, 2020

Choose a reason for hiding this comment

untitaker Sep 23, 2020

Choose a reason for hiding this comment

untitaker Sep 22, 2020 • edited Loading

Choose a reason for hiding this comment

untitaker commented Sep 22, 2020 •

edited

Loading

untitaker Sep 22, 2020 •

edited

Loading