processor/stream: server blocks on stream end when batch size < 10 #3265

axw · 2020-01-31T03:28:28Z

The processor/stream code reads in batches of 10 events at a time:

apm-server/processor/stream/processor.go

Line 279 in 17e0f7a

    
           transformables, done := p.readBatch(ctx, ipRateLimiter, batchSize, jsonReader, res)

Once a batch is received, they are dispatched to the publisher, which transforms and sends them through the libbeat pipeline to be recorded in Elasticsearch.

By default, agents will close the stream after 10 seconds, or after it reaches a certain size (~750K). So if an agent sends fewer than 10 events, the processor/stream code will generally block waiting for the stream to end before it dispatches to the publisher.

We should consider adding a timeout (or context with timeout) to the StreamReader.Read method to avoid this.

The text was updated successfully, but these errors were encountered:

axw · 2020-09-10T09:28:38Z

master...axw:processor-concurrent-read

In this branch I have modified processor/stream to:

decode into map[string]interface{}s concurrently with validation and translation into model types (would partially address Intake v2: investigate parallelizing decode and validate #1285, but see below)
report events in batches when either: we have a minimum of 10 events, 1 second passes, or the stream ends (addresses this issue)

On master with heavy.ndjson the benchmark I get ~19MB/s, with this branch I get ~27MB/s. Once #3551 is done, as mentioned in #1285 (comment), it would no longer be possible to parallelise decode/validate; but I expect validation will be so fast that it won't matter.

We can come back to this once #3551 is done.

graphaelli added [zube]: Inbox enhancement and removed [zube]: Inbox labels Feb 5, 2020

zube bot added [zube]: Blocked and removed [zube]: Backlog labels Feb 5, 2020

simitt removed the [zube]: Backlog label Dec 31, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

processor/stream: server blocks on stream end when batch size < 10 #3265

processor/stream: server blocks on stream end when batch size < 10 #3265

axw commented Jan 31, 2020

axw commented Sep 10, 2020 •

edited

Loading

processor/stream: server blocks on stream end when batch size < 10 #3265

processor/stream: server blocks on stream end when batch size < 10 #3265

Comments

axw commented Jan 31, 2020

axw commented Sep 10, 2020 • edited Loading

axw commented Sep 10, 2020 •

edited

Loading