Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@edeandrea
Copy link
Contributor

  • Introduced S3Source and S3Target for S3-based document processing.
  • Refactored chunking request hierarchy for better reusability.
  • Added test coverage for S3 workflow validation.
  • Updated DoclingServeApi to support S3 credentials and configuration.
  • Enhanced documentation to include S3-based examples.

Fixes #254

@edeandrea
Copy link
Contributor Author

edeandrea commented Dec 28, 2025

There isn't a way to use a "real" docling-serve instance to use s3 source/target without a full Docling environment with Kubeflow set up (see docling-project/docling-serve#462).

Because of this, in the tests I wrote I used wiremock just to verify that the request payload "looked" correct. I really have no way to test whether or not the thing actually works, like I do with all the other tests.

I tried using testcontainers & Localstack, but docling itself requires a kubeflow setup running on kubernetes. I can't replicate that in a simple test.

@github-actions
Copy link

github-actions bot commented Dec 28, 2025

:java_duke: JaCoCo coverage report

Overall Project 45.72% 🔴

There is no coverage information present for the Files changed

@github-actions
Copy link

github-actions bot commented Dec 28, 2025

TestsPassed ✅SkippedFailed
Gradle Test Results (all modules & JDKs)927 ran927 passed0 skipped0 failed
TestResult
No test annotations available

@github-actions
Copy link

HTML test reports are available as workflow artifacts (zipped HTML).

• Download: Artifacts for this run

- Introduced `S3Source` and `S3Target` for S3-based document processing.
- Refactored chunking request hierarchy for better reusability.
- Added test coverage for S3 workflow validation.
- Updated `DoclingServeApi` to support S3 credentials and configuration.
- Enhanced documentation to include S3-based examples.

Fixes docling-project#254

Signed-off-by: Eric Deandrea <[email protected]>
- Introduced `S3Source` and `S3Target` for S3-based document processing.
- Refactored chunking request hierarchy for better reusability.
- Added test coverage for S3 workflow validation.
- Updated `DoclingServeApi` to support S3 credentials and configuration.
- Enhanced documentation to include S3-based examples.

Fixes docling-project#254

Signed-off-by: Eric Deandrea <[email protected]>
@edeandrea edeandrea enabled auto-merge (squash) December 29, 2025 13:43
@edeandrea edeandrea merged commit fcf586f into docling-project:main Dec 29, 2025
20 checks passed
@github-actions
Copy link

HTML test reports are available as workflow artifacts (zipped HTML).

• Download: Artifacts for this run

@docling-java-ops docling-java-ops bot added the released Issue has been released label Jan 1, 2026
@docling-java-ops
Copy link
Contributor

🎉 This issue has been resolved in v0.4.2 (Release Notes)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

released Issue has been released

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Feature Request: Support Additional Data Sources (S3, SharePoint, etc.(native support)) other than FileSource and HTTPSource

1 participant