Long lines are unexpectedly split into multiple OTEL Log records #35042

cwegener · 2024-09-05T23:56:55Z

Component(s)

pkg/stanza/fileconsumer, receiver/filelog

What happened?

Description

When using the filelog receiver, any lines that are longer than the default buffer size (16KB) will be split into multiple log records instead of one single log record.

Steps to Reproduce

Create simple default filelog receiver config
Ingest a file that has a line length that is greater than 16KB)

Expected Result

Lines longer than 16KB gets emitted as one single OTEL Log record

Actual Result

Lines longer than 16KB get emitted as multiple OTEL Log records, each record getting split at the 16KB boundary.

Collector version

v0.108.0

Environment information

Environment

OS: Archlinux
Compiler(if manually compiled): go 1.23.0

OpenTelemetry Collector configuration

receivers:
  filelog:
    start_at: beginning
    include:
      - "/tmp/metrics_101089747.json"

exporters:
  debug:
    verbosity: detailed
service:
  pipelines:
    logs/raw:
      receivers:
        - filelog
      exporters:
        - debug

Log output

No response

Additional context

No response

github-actions · 2024-09-05T23:57:09Z

Pinging code owners:

pkg/stanza/fileconsumer: @djaglowski
receiver/filelog: @djaglowski

See Adding Labels via Comments if you do not have permissions to add labels yourself.

VihasMakwana · 2024-09-06T14:08:57Z

Although the buffer size is 16KB, the maximum log size is 1024KB.
The incomplete token issue might be due to the flush timeout expiring. Could you try setting force_flush_period to 0 and see if that resolves the problem?

cwegener · 2024-09-06T22:42:24Z

The bug still behaves the same when using different values for the flush time out.

Although, I have not actually tested a value of 0 ... I wonder what 0 means?

cwegener · 2024-09-06T23:45:41Z

Ok.

Did another quick test and the issue is:

the nature of the input file is causing the problem
the input files that I am using (they are generated by the awss3exporter contrib component) are missing a \n at the end of the line, which is then triggering the behaviour that I originally observed above.

Adding a \n at the end of the line, fixes the issue as expected.

I'll start investigating why the \n is missing in the files generated by https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/exporter/awss3exporter

cwegener · 2024-09-06T23:50:02Z

Closing this issue, as filelog receiver is behaving as expected (logs are force flushed if no log line separator token is seen within the force flush period.

djaglowski · 2025-01-21T22:27:37Z

I'm reopening this issue as I've stumbled onto the same behavior.

To clarify, the issue occurs when a log is:

Longer than the default buffer size (16KB)
Shorter than the max log size
NOT terminated

What happens is that the flush function kicks in and ejects the token.

What I do not understand yet is why the buffer size does not increase up to max log size before the ejection occurs. If it did, then the ejected token would be the entire log (even though it is not explicitly terminated).

…pen-telemetry#37596) Fixes open-telemetry#35042 (and open-telemetry#32100 again) The issue affected unterminated logs of particular lengths. Specifically, longer than our internal `scanner.DefaultBufferSize` (16kB) and shorter than `max_log_size`. The failure mode was described in open-telemetry#32100 but was apparently only fixed in some circumstances. I believe this is a more robust fix. I'll articulate the exact failure mode again here: 1. During a poll cycle, `reader.ReadToEnd` is called. Within this, a scanner is created which starts with a default buffer size. The buffer is filled, but no terminator is found. Therefore the scanner resizes the buffer to accommodate more data, hoping to find a terminator. Eventually, the buffer is large enough to contain all content until EOF, but still no terminator was found. At this time, the flush timer has not expired, so `reader.ReadToEnd` returns without emitting anything. 2. During the _next_ poll cycle, `reader.ReadToEnd` creates a new scanner, also with default buffer size. The first time is looks for a terminator, it of course doesn't find one, but at this time the flush timer has expired. Therefore, instead of resizing the buffer and continuing to look for a terminator, it just emits what it has. What should happen instead is the scanner continues to resize the buffer to find as much of the unterminated token as possible before emitting it. Therefore, this fix introduces a simple layer into the split func stack which allows us to reason about unterminated tokens more carefully. It captures the length of unterminated tokens and ensures that when we recreate a scanner, we will start with a buffer size that is appropriate to read the same content as last time, plus one additional byte. The extra byte allows us to check if new content has been added, in which case we will resume resizing. If no new content is found, the flusher will emit the entire unterminated token as one.

cwegener added bug Something isn't working needs triage New item requiring triage labels Sep 5, 2024

github-actions bot added pkg/stanza/fileconsumer receiver/filelog labels Sep 5, 2024

cwegener closed this as completed Sep 6, 2024

github-actions bot mentioned this issue Sep 10, 2024

Weekly Report: 2024-09-03 - 2024-09-10 #35086

Closed

tvazac mentioned this issue Nov 21, 2024

[receiver/filelog] Line over 16386 character is split, even when max_log_size is set high enough #36494

Closed

djaglowski reopened this Jan 21, 2025

github-actions bot mentioned this issue Jan 28, 2025

Weekly Report: 2025-01-21 - 2025-01-28 #37519

Open

djaglowski mentioned this issue Jan 30, 2025

[receiver/filelog] Fix issue where flushed tokens could be truncated #37596

Merged

andrzej-stencel removed the needs triage New item requiring triage label Feb 3, 2025

djaglowski closed this as completed in #37596 Feb 3, 2025

djaglowski closed this as completed in 68b24ea Feb 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Long lines are unexpectedly split into multiple OTEL Log records #35042

Long lines are unexpectedly split into multiple OTEL Log records #35042

cwegener commented Sep 5, 2024

github-actions bot commented Sep 5, 2024

VihasMakwana commented Sep 6, 2024

cwegener commented Sep 6, 2024 •

edited

Loading

cwegener commented Sep 6, 2024

cwegener commented Sep 6, 2024

djaglowski commented Jan 21, 2025 •

edited

Loading

Long lines are unexpectedly split into multiple OTEL Log records #35042

Long lines are unexpectedly split into multiple OTEL Log records #35042

Comments

cwegener commented Sep 5, 2024

Component(s)

What happened?

Description

Steps to Reproduce

Expected Result

Actual Result

Collector version

Environment information

Environment

OpenTelemetry Collector configuration

Log output

Additional context

github-actions bot commented Sep 5, 2024

VihasMakwana commented Sep 6, 2024

cwegener commented Sep 6, 2024 • edited Loading

cwegener commented Sep 6, 2024

cwegener commented Sep 6, 2024

djaglowski commented Jan 21, 2025 • edited Loading

cwegener commented Sep 6, 2024 •

edited

Loading

djaglowski commented Jan 21, 2025 •

edited

Loading