-
kreuzberg
High-performance document intelligence library for Rust. Extract text, metadata, and structured data from PDFs, Office documents, images, and 50+ formats with async/sync APIs.
-
ocrs
OCR engine
-
tesseract
Higher-level bindings for Tesseract OCR
-
nameback
Rename files based on their metadata with multi-language OCR, HEIC support, and video frame extraction
-
subtile-ocr
Converts DVD VOB subtitles to SRT subtitles with Tesseract OCR
-
usls
integrated with ONNXRuntime, providing a collection of ML models
-
macocr
An OCR Tool using Apple's Vision Framework API
-
tesseract-plumbing
Safe wrapper of
tesseract-sys -
oar-ocr
A comprehensive OCR library built in Rust with ONNX Runtime for efficient inference
-
kalosm
interface for pretrained AI models
-
oar-ocr-vl
Vision-Language models for oar-ocr
-
paddle-ocr-rs
call Paddle OCR models via ONNX Runtime for image text recognition
-
udataframe_rs
A pure Rust library for data frame operations, particularly useful for processing data extracted from PDF files or OCR recognize
-
cooklang-import
importing recipes into Cooklang format
-
ruvector-scipix
Rust OCR engine for scientific documents - extract LaTeX, MathML from math equations, research papers, and technical diagrams with ONNX GPU acceleration
-
oneocr-rs
binding for OneOCR, an embedded OCR engine in Windows 11 Snipping Tool
-
ghostai
Your second brain at the computer
-
rustocr
High-performance Rust CLI for EasyOCR with 80+ language support, featuring server mode and batch processing. Fastly Built by FastBuilder.AI
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
xycut-plus-plus
High-performance reading order detection for document layout analysis using XY-Cut++ algorithm
-
leptess
Productive Rust binding for Tesseract and Leptonica
-
pure-onnx-ocr-sync
【Sync Version】Pure Rust OCR pipeline that runs PaddleOCR DBNet + SVTR ONNX models without C/C++ dependencies
-
koharu
Manga translation tools
-
awful_book_sanitizer
CLI to clean up OCR-mangled book excerpts into readable text using OpenAI-compatible APIs
-
chrome_lens_ocr
Port of chrome-lens-py used in Mangatan
-
ocrs-cli
OCR CLI tool for extracting text from images
-
ddddocr
OCR for captcha recognition, ported from Python ddddocr
-
herzfeld
High-fidelity Epigraphic Rendering for Zonated Feature Extraction and Labelled Datasets
-
br-ocr
ocr
-
winocr
An OCR Tool using Windows.Media.Ocr.OcrEngine API
-
image-anonymizer
A command-line tool to detect and mask sensitive content in images
-
pure-onnx-ocr
Pure Rust OCR pipeline that runs PaddleOCR DBNet + SVTR ONNX models without C/C++ dependencies
-
kalosm-ocr
interface for pretrained OCR models
-
uni-ocr
Native OCR for MacOS, Windows, Linux
-
tesseract-sys
Rust Bindings for Tesseract OCR
-
lisudoku-ocr
Detecting sudoku grids from images
-
kreuzberg-tesseract
Rust bindings for Tesseract OCR with cross-compilation, C++17, and caching improvements
-
kalosm-vision
A set of pretrained vision models
-
koharu-renderer
Manga translation tools
-
tesseract-static
STATICALLY LINKED tesseract + leptonica bindings for easy inclusion of tesseract-ocr in binary applications
-
cuda-rt
Manga translation tools
-
fx-mistral
leverage the Mistral API for OCR and data extraction from PDFs
-
bpm-ocr
attempting to extract a blood pressure monitor reading from an image using opencv
-
koharu-models
Manga translation tools
-
parser-core
extracting text from various file formats including PDF, DOCX, XLSX, PPTX, images via OCR, and more
-
ocrmypdf-rs
A sdk for the ocrmypdf command line tool
-
yas_scanner
Genshin Impact item scanner
-
surya
multilingual document OCR toolkit, original implementation in Python and PyTorch
-
lama
Manga translation tools
-
koharu-runtime
Manga translation tools
-
comic-text-detector
Manga translation tools
-
manga-ocr
Manga translation tools
-
oar-ocr-core
Core types and predictors for oar-ocr
-
nameback-core
Core library for nameback - intelligent file renaming based on metadata
-
imgthin
A fast parallel algorithm for thinning digital patterns
-
vobsubocr
Converts DVD VOB subtitles to SRT subtitles with Tesseract OCR
-
ocr_b_checksum
Generates OCR B Checksums
-
koharu-core
Manga translation tools
-
gpt4ocr
Extract structured text from PDFs using OpenAI's GPT4o
-
stroke-width-transform
Stroke Width Transform for OCR image preprocessing
-
veryfi
Module for communicating with the Veryfi OCR API
-
paddleocr_rs
paddleocr-v4 onnxrumtime infer
-
advent-of-code-ocr
Small crate to help convert AoC text screens to strings
-
bce-ocr
unofficial rust OCR SDK of Baidu AI Cloud
-
win_ocr
do OCR on Windows
-
hocr-parser
A parser for the hOCR format
-
pic2txt
ocr by windows lib
-
ocr_latin_vocabulary
Convert OCR Latin GCSE Defined vocabulary lists to a simple format
-
spellcheck-rs
A fast spellchecker written in Rust with Python bindings, optimized for OCR error correction
-
ConExpression
My first RUST API
-
tesseract-native
Native tesseract-ocr library for executable application. Rebuild from: https://github.com/fschutt/tesseract-static-rs by fschutt
-
retrochoir
A match result retrospection tool utilizes OCR for games
Try searching with DuckDuckGo.