1 unstable release
| 0.1.0 | Mar 17, 2026 |
|---|
#2616 in Text processing
Used in cairn-extract
120KB
3K
SLoC
Artifact loading and normalization for Cairn ingestion.
Phase 3A keeps runtime behavior markdown-first while introducing the
DocumentArtifact -> ExtractedText/Layout -> NormalizedDocument boundary
that later OCR/PDF/image backends can plug into.
Dependencies
~25–48MB
~759K SLoC