DEV-46209 - Fix evaluation headers concurrency issue #37

yasmin-tr · 2024-08-14T09:28:20Z

What is this feature?
This is a fix to bug introduced on the code adding logzio headers to the alert evaluation.
We got the exception of: fatal error: concurrent map iteration and map write that was crashing the service.
This happened more often the more alert rules were evaluated at the same time, and the exception was originating at the following place in eval.go : 321

The error means that the map object (the headers) had access and write operations at the same time.

Root Cause
The root cause for this is that the headers were added from the api request context and since the evaluate API is done in bulk for multiple alert rules at the same time, they were sharing the same headers object - which is passed as reference into the evaluation.

What Was the Fix?
To fix the concurrency issue of accessing the same object we needed to clone it.
So instead of passing same headers reference we just cloned it to be a separate reference for each alert evaluation that is sent.

** Also added the following changes:**

small refactor on how headers are built based on similar existing code
added / changed logs to contain rule and org information for easier investigations
removed in eval.go some redundant logz changes (reverted to original code)

Why do we need this feature?
The obvious reason - server should not crash because of error when running multiple alert evaluations.
We need to support big scale and evaluation of a lot of alerts a the same time.

also added the following changes: * small refactor on how headers are built based on similar existing code * added / changed logs to contain rule and org information for easier investigations * removed in eval.go some redundant logz changed (reverted to original code)

yasmin-tr · 2024-08-14T09:30:03Z

run-test

yasmin-tr requested a review from ohadza August 14, 2024 09:30

yasmin-tr self-assigned this Aug 15, 2024

ohadza approved these changes Aug 20, 2024

View reviewed changes

yasmin-tr merged commit 73b2d9e into v10.4.x-logzio Aug 20, 2024
16 of 18 checks passed

yasmin-tr deleted the DEV-46209-fix-evaluation-headers-concurrency-issue branch August 20, 2024 11:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEV-46209 - Fix evaluation headers concurrency issue #37

DEV-46209 - Fix evaluation headers concurrency issue #37

yasmin-tr commented Aug 14, 2024 •

edited

Loading

yasmin-tr commented Aug 14, 2024

DEV-46209 - Fix evaluation headers concurrency issue #37

DEV-46209 - Fix evaluation headers concurrency issue #37

Conversation

yasmin-tr commented Aug 14, 2024 • edited Loading

yasmin-tr commented Aug 14, 2024

yasmin-tr commented Aug 14, 2024 •

edited

Loading