(batchprocessor): forking batchprocessor #79

moh-osman3 · 2023-11-03T04:21:02Z

Splitting up #71 into 2 PR's. This first PR is just simply forking the batchprocessor into this repository before applying further changes on top.

Part of #80.

lquerel · 2023-11-03T16:53:24Z

collector/processor/batchprocessor/metrics.go

+	errors = multierr.Append(errors, err)
+
+	bpt.timeoutTriggerSend, err = meter.Int64Counter(
+		processorhelper.BuildCustomMetricName(typeStr, "timeout_trigger_send"),


I'm not very familiar with the batch processor, so my comment might be off-base. According to the instrumentation, the batch processor has two types of triggers: timeout and max batch size reached. I'm wondering if that's sufficient to control the memory usage of this processor. What happens if the number of distinct metadata value combinations is very high? Could we introduce a third type of trigger that sends the current batches when the total number of entries across all batches reaches a specific threshold in order to keep the overall memory usage under control?

Please see #80.

Thanks for the review! Yeah this is the first in a series of PRs to enhance the processor. One way of controlling in flight memory usage is that the processor will block requests from being added https://github.com/open-telemetry/opentelemetry-collector/blob/main/processor/batchprocessor/batch_processor.go#L271 if the processing queue is full. In a later PR memory efficiency will be enhanced by controlling admission to the queue with a semaphore based on the size(in bytes) of the request. Unsure if a new trigger would be necessary, but will keep that in mind as we implement our enhancements and observe memory usage in testing

lquerel · 2023-11-03T16:56:51Z

collector/processor/batchprocessor/splitlogs.go

+	"go.opentelemetry.io/collector/pdata/plog"
+)
+
+// splitLogs removes logrecords from the input data and returns a new data of the specified size.


Just curious to understand what happens to the log records that are located after the specified size is exceeded?

The current behavior of this code, i.e., what's being forked in this PR by copying the core batchprocessor, is behavior we want to change. Currently, the batch processor can send at most one batch in parallel, so it requires certain downstream behavior (i.e., the exporterhelper's queue sender) to function well.

So, you'll see more about this in a subsequent PR, where we fix the behavior. The high-level picture that's missing from this dump of copied code, is that it uses pdata objects to accumulate points in a FIFO manner. Batches are assembled in a single pending pdata object and then when there are enough points/spans/records, when the pending size is too big it will be split by taking the first points out into a new batch, leaving a residual pdata object. We will ensure that the points in the residual are always the first to go next, which means up to a timeout but no more than one timeout for every point.

moh-osman3 added 3 commits November 3, 2023 00:17

first

407061d

go mod tidy

7cf14f7

add accidentally deleted comment

f209b9b

moh-osman3 marked this pull request as ready for review November 3, 2023 16:37

moh-osman3 requested review from lquerel, jmacd and codeboten as code owners November 3, 2023 16:37

jmacd approved these changes Nov 3, 2023

View reviewed changes

lquerel reviewed Nov 3, 2023

View reviewed changes

jmacd merged commit c6161e5 into open-telemetry:main Nov 3, 2023
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(batchprocessor): forking batchprocessor #79

(batchprocessor): forking batchprocessor #79

moh-osman3 commented Nov 3, 2023 •

edited by jmacd

Loading

lquerel Nov 3, 2023

jmacd Nov 3, 2023

moh-osman3 Nov 3, 2023

lquerel Nov 3, 2023

jmacd Nov 3, 2023

(batchprocessor): forking batchprocessor #79

(batchprocessor): forking batchprocessor #79

Conversation

moh-osman3 commented Nov 3, 2023 • edited by jmacd Loading

lquerel Nov 3, 2023

Choose a reason for hiding this comment

jmacd Nov 3, 2023

Choose a reason for hiding this comment

moh-osman3 Nov 3, 2023

Choose a reason for hiding this comment

lquerel Nov 3, 2023

Choose a reason for hiding this comment

jmacd Nov 3, 2023

Choose a reason for hiding this comment

moh-osman3 commented Nov 3, 2023 •

edited by jmacd

Loading