Handwritten Essay Annotator

A computer vision and text matching-based tool for automatically identifying and annotating errors and highlights in handwritten text.

Example

The image above shows an annotated handwritten essay with error markers (red underlines) and highlights (green backgrounds).

Features

OCR-based handwritten text recognition
Fuzzy text matching algorithm for improved recognition accuracy
Intelligent annotation system with error marking and content highlighting
Dynamic text position adjustment to avoid annotation overlap
API interface support for easy integration into other systems

Tech Stack

Python
OpenCV
FastAPI
PIL (Python Imaging Library)
FuzzyWuzzy (Text Matching)

Installation

Clone the repository

git clone [repository-url]
cd handwritten-essay-annotator

Install dependencies

pip install -r requirements.txt

Set up configuration

cp config.py.example config.py
# Edit config.py and fill in your Baidu OCR API credentials
# Get your API credentials at: https://cloud.baidu.com/product/ocr

Usage

Main Annotation Service

The project provides two versions of the annotation service:

Option 1: Modular Version (Recommended)

# Runs on port 8006
python main1.py

Option 2: Standalone Version

# Runs on port 8005
python main.py

Testing the API

# Test the annotation service (connects to port 8006 by default)
python api_test.py

Image Preprocessing Service (Optional)

# Runs on port 8002 - for image rotation and alteration removal
python image_process.py

Input Format

You need to provide:

Handwritten text image for annotation
Reference text for text matching
- Error text snippets
- Error types
- Correction explanations
- Highlight content markers

Output

Annotated text image with:
- Error markers (with numbering)
- Correction explanations
- Content highlights

Configuration

API Credentials

Before using this tool, you need to configure your Baidu OCR API credentials:

Copy config.py.example to config.py
Register for a Baidu Cloud account and create an OCR application at https://cloud.baidu.com/product/ocr
Fill in your client_id and client_secret in config.py

Service Ports

The default ports used by different services:

main.py: Port 8005
main1.py: Port 8006 (default for api_test.py)
image_process.py: Port 8002

You can modify the ports in each file's if __name__ == '__main__': section if needed.

Examples

The examples/ folder contains sample images.

License

MIT

Contributing

Issues and Pull Requests are welcome to help improve this project.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
examples		examples
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
annotation.png		annotation.png
api_test.py		api_test.py
config.py.example		config.py.example
draw.py		draw.py
draw2.py		draw2.py
image_process.py		image_process.py
main.py		main.py
main1.py		main1.py
ocr.py		ocr.py
requirements.txt		requirements.txt
text_matching.py		text_matching.py
text_rendering.py		text_rendering.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Handwritten Essay Annotator

Example

Features

Tech Stack

Installation

Usage

Main Annotation Service

Testing the API

Image Preprocessing Service (Optional)

Input Format

Output

Configuration

API Credentials

Service Ports

Examples

License

Contributing

About

Uh oh!

Uh oh!

Languages

License

qiwei-ma/handwritten-essay-annotator

Folders and files

Latest commit

History

Repository files navigation

Handwritten Essay Annotator

Example

Features

Tech Stack

Installation

Usage

Main Annotation Service

Testing the API

Image Preprocessing Service (Optional)

Input Format

Output

Configuration

API Credentials

Service Ports

Examples

License

Contributing

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages