[adapters] Fix data interleaving in HTTP ingress connector. #5498

blp · 2026-01-23T00:49:46Z

Thanks to Bruno Rucy @brurucy and Abhinav Gyawali @abhizer for help with this issue.

@brurucy

Thanks to Bruno Rucy @brurucy and Abhinav Gyawali @abhizer for help with this issue. Fixes: #3495 Signed-off-by: Ben Pfaff <[email protected]>

Copilot

Pull request overview

This PR fixes a data interleaving issue in the HTTP ingress connector by moving the StreamSplitter from shared endpoint state to per-request local scope.

Changes:

Removed StreamSplitter from HttpInputEndpointDetails struct to prevent concurrent requests from sharing state
Refactored the push method to accept parsed chunks directly instead of managing splitting logic
Created a new StreamSplitter instance within the request handling loop to ensure each request has its own isolated splitter

Copilot · 2026-01-23T00:51:02Z

crates/adapters/src/transport/http/input.rs

+                    while let Some(chunk) = splitter.next(eoi) {
+                        num_errors += self.push(chunk, &mut errors, timestamp);
+                    }


The timestamp variable is captured once before the loop but reused for all chunks. If the splitter produces multiple chunks, they will all receive the same timestamp even though they may be processed at slightly different times. Consider capturing a fresh timestamp inside the loop for each chunk to ensure accurate temporal ordering of events.

mihaibudiu · 2026-01-23T00:55:15Z

crates/adapters/src/transport/http/input.rs

                        }
+                        Ok(None) => true,
+                    };
+                    while let Some(chunk) = splitter.next(eoi) {


so the fix is because this loop cannot be interrupted?

It's more than that. Each call to complete_request receives a payload, which gets read read chunk by chunk. If the request is short, there's only one chunk total, and if that chunk ends in a new-line (which is what the splitter looks for in the case of JSON with default settings), then push will split it properly. But if the request is long, and the chunks do not end in new-lines (they only would by luck), then push will feed in all the full records in the current chunk and leave the start of the next record in its buffer. Then, the next chunk from any request will be appended. Obviously, the start of one record followed by part of another record from a different source will cause something bad to happen, and that's what was happening.

By making each request break its input data into full records, and then only passing the full records to the parser, we avoid the problem.

brurucy · 2026-01-23T00:59:27Z

I will test this as soon as it is merged. In my use-case I routinely see tens of parsing failures every day

mihaibudiu · 2026-01-23T01:01:14Z

It's easy to overlook an issue if it's not solved quickly, please ping us again if you have bugs like this one so we can assign them higher priority.

[adapters] Fix data interleaving in HTTP ingress connector.

28411ec

Thanks to Bruno Rucy @brurucy and Abhinav Gyawali @abhizer for help with this issue. Fixes: #3495 Signed-off-by: Ben Pfaff <[email protected]>

blp requested a review from abhizer January 23, 2026 00:49

blp self-assigned this Jan 23, 2026

blp added bug Something isn't working connectors Issues related to the adapters/connectors crate labels Jan 23, 2026

Copilot AI review requested due to automatic review settings January 23, 2026 00:49

blp added the rust Pull requests that update Rust code label Jan 23, 2026

Copilot AI reviewed Jan 23, 2026

View reviewed changes

mihaibudiu approved these changes Jan 23, 2026

View reviewed changes

blp added this pull request to the merge queue Jan 23, 2026

Merged via the queue into main with commit 94a1805 Jan 23, 2026
6 of 7 checks passed

blp deleted the issue3495 branch January 23, 2026 06:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[adapters] Fix data interleaving in HTTP ingress connector. #5498

[adapters] Fix data interleaving in HTTP ingress connector. #5498

Uh oh!

blp commented Jan 23, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 23, 2026

Uh oh!

mihaibudiu Jan 23, 2026

Uh oh!

blp Jan 23, 2026

Uh oh!

brurucy commented Jan 23, 2026

Uh oh!

mihaibudiu commented Jan 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[adapters] Fix data interleaving in HTTP ingress connector. #5498

[adapters] Fix data interleaving in HTTP ingress connector. #5498

Uh oh!

Conversation

blp commented Jan 23, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

mihaibudiu Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

blp Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

brurucy commented Jan 23, 2026

Uh oh!

mihaibudiu commented Jan 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants