Codestin Search App

English | 简体中文 |

OnnxOCR

A High-Performance Multilingual OCR Engine Based on ONNX

🚀 Version Updates

2025.12.29
1. 服务层重构为FastAPI，支持ASGI高并发架构
2. 保持v1接口100%兼容，新增v2多文件处理接口
3. 新增健康检查、监控日志、并发控制等生产级功能
4. 支持多种输出格式：JSON、文本、TSV、hOCR
2025.05.21
1. Added PP-OCRv5 model, supporting 5 language types in a single model: Simplified Chinese, Traditional Chinese, Chinese Pinyin, English, and Japanese.
2. Overall recognition accuracy improved by 13% compared to PP-OCRv4.
3. Accuracy is consistent with PaddleOCR 3.0.

🌟 Core Advantages

Deep Learning Framework-Free: A universal OCR engine ready for direct deployment.
Cross-Architecture Support: Uses PaddleOCR-converted ONNX models, rebuilt for deployment on both ARM and x86 architecture computers with unchanged accuracy under limited computing power.
High-Performance Inference: Faster inference speed on computers with the same performance.
Multilingual Support: Single model supports 5 language types: Simplified Chinese, Traditional Chinese, Chinese Pinyin, English, and Japanese.
Model Accuracy: Consistent with PaddleOCR models.
Domestic Hardware Adaptation: Restructured code architecture for easy adaptation to more domestic GPUs by modifying only the inference engine.

🛠️ Environment Setup

FastAPI 服务 (推荐)

python>=3.7  

# 安装FastAPI版本依赖
pip install -i https://pypi.tuna.tsinghua.edu.cn/simple -r requirements-fastapi.txt

传统Flask服务 (兼容)

python>=3.6  

# 安装原版依赖
pip install -i https://pypi.tuna.tsinghua.edu.cn/simple -r requirements.txt

Note:

The Mobile version model is used by default; the PP-OCRv5_Server-ONNX model offers better performance.
The Mobile model is already in onnxocr/models/ppocrv5 and requires no download;
The PP-OCRv5_Server-ONNX model is large and uploaded to Baidu Netdisk (extraction code: wu8t). After downloading, place the det and rec models in ./models/ppocrv5/ to replace the existing ones.

🚀 One-Click Run

python test_ocr.py

📡 API Service

FastAPI 服务 (生产推荐)

启动服务

# Linux/Mac
./start_fastapi.sh

# Windows
start_fastapi.bat

# 或者手动启动
gunicorn app.main:app -k uvicorn.workers.UvicornWorker --bind 0.0.0.0:5005 --workers 4

v1 兼容接口 (与原版100%兼容)

curl -X POST http://localhost:5005/ocr \  
-H "Content-Type: application/json" \  
-d '{"image": "base64_encoded_image_data"}'

v2 新接口 (推荐)

# 单文件上传
curl -X POST http://localhost:5005/api/v2/ocr \
  -F "file=@test_image.jpg" \
  -F "model_name=PP-OCRv5" \
  -F "conf_threshold=0.6" \
  -F "output_format=json" \
  -F "bbox=true"

# 多文件上传
curl -X POST http://localhost:5005/api/v2/ocr \
  -F "[email protected]" \
  -F "[email protected]" \
  -F "output_format=text"

健康检查

curl http://localhost:5005/health        # 基本健康检查
curl http://localhost:5005/api/v2/readyz # 模型就绪检查

传统Flask服务 (兼容模式)

启动服务

python app-service.py

Test Example

Request

curl -X POST http://localhost:5005/ocr \  
-H "Content-Type: application/json" \  
-d '{"image": "base64_encoded_image_data"}'

Response

{  
  "processing_time": 0.456,  
  "results": [  
    {  
      "text": "Name",  
      "confidence": 0.9999361634254456,  
      "bounding_box": [[4.0, 8.0], [31.0, 8.0], [31.0, 24.0], [4.0, 24.0]]  
    },  
    {  
      "text": "Header",  
      "confidence": 0.9998759031295776,  
      "bounding_box": [[233.0, 7.0], [258.0, 7.0], [258.0, 23.0], [233.0, 23.0]]  
    }  
  ]  
}

🐳 Docker Deployment

FastAPI服务 (推荐)

构建镜像

docker build -t onnxocr-fastapi .

运行容器

# CPU版本（默认）
docker-compose up -d

# GPU版本（自动检测，失败时回退CPU）
docker-compose -f docker-compose.gpu.yml up -d

# 基础运行
docker run -itd --name onnxocr-service -p 5005:5005 onnxocr-fastapi

环境变量配置

docker run -itd --name onnxocr-service -p 5005:5005 \
  -e WORKERS=4 \
  -e THREADS=2 \
  -e LOG_LEVEL=INFO \
  -e DEFAULT_MODEL=PP-OCRv5 \
  -e MODEL_CONCURRENCY=8 \
  -e USE_GPU=true \
  -e MAX_UPLOAD_MB=50 \
  onnxocr-fastapi

GPU部署要求

NVIDIA Docker Runtime
CUDA兼容GPU
onnxruntime-gpu==1.14.1 (已包含在requirements中)
自动检测GPU可用性，失败时自动回退CPU推理

传统Flask服务 (兼容)

Build Image

# 使用原版Dockerfile
docker build -f Dockerfile.flask -t ocr-service .

Run Image

docker run -itd --name onnxocr-service-v3 -p 5006:5005 onnxocr-service:v3

POST Request

url: ip:5006/ocr

Response Example

{  
  "processing_time": 0.456,  
  "results": [  
    {  
      "text": "Name",  
      "confidence": 0.9999361634254456,  
      "bounding_box": [[4.0, 8.0], [31.0, 8.0], [31.0, 24.0], [4.0, 24.0]]  
    },  
    {  
      "text": "Header",  
      "confidence": 0.9998759031295776,  
      "bounding_box": [[233.0, 7.0], [258.0, 7.0], [258.0, 23.0], [233.0, 23.0]]  
    }  
  ]  
}

🌟 Effect Demonstration

Example 1	Example 2

Example 3	Example 4

Example 5	Example 6

👨💻 Contact & Communication

Career Opportunities

I am currently seeking job opportunities. Welcome to connect!

OnnxOCR Community

WeChat Group

QQ Group

🎉 Acknowledgments

Thanks to PaddleOCR for technical support!

🌍 Open Source & Donations

I am passionate about open source and AI technology, believing they can bring convenience and help to those in need, making the world a better place. If you recognize this project, you can support it via Alipay or WeChat Pay (please note "Support OnnxOCR" in the remarks).

📈 Star History

🤝 Contribution Guidelines

Welcome to submit Issues and Pull Requests to improve the project together!

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.claude		.claude
.github/workflows		.github/workflows
app		app
docs		docs
onnxocr		onnxocr
result_img		result_img
static		static
templates		templates
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Readme.md		Readme.md
Readme_cn.md		Readme_cn.md
app-service.py		app-service.py
docker-compose.gpu.yml		docker-compose.gpu.yml
docker-compose.yml		docker-compose.yml
draw_ocr.jpg		draw_ocr.jpg
requirements-fastapi.txt		requirements-fastapi.txt
test_model_loading.py		test_model_loading.py
webui.py		webui.py

License

ding113/OnnxOCR

Folders and files

Latest commit

History

Repository files navigation

OnnxOCR

🚀 Version Updates

🌟 Core Advantages

🛠️ Environment Setup

FastAPI 服务 (推荐)

传统Flask服务 (兼容)

🚀 One-Click Run

📡 API Service

FastAPI 服务 (生产推荐)

启动服务

v1 兼容接口 (与原版100%兼容)

v2 新接口 (推荐)

健康检查

传统Flask服务 (兼容模式)

启动服务

Test Example

Request

Response

🐳 Docker Deployment

FastAPI服务 (推荐)

构建镜像

运行容器

环境变量配置

GPU部署要求

传统Flask服务 (兼容)

Build Image

Run Image

POST Request

Response Example

🌟 Effect Demonstration

👨💻 Contact & Communication

Career Opportunities

OnnxOCR Community

WeChat Group

QQ Group

🎉 Acknowledgments

🌍 Open Source & Donations

📈 Star History

🤝 Contribution Guidelines

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages