Thanks to visit codestin.com
Credit goes to github.com

Skip to content

rzadevv/OCRpdf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PDF OCR Tool

Turn scanned PDFs into searchable PDFs. Works with any PDF that contains images or scanned pages.

What you need

Setup

pip install -r requirements.txt

How to use

Basic:

python ocr_pdf.py your_file.pdf

Creates your_file_searchable.pdf

With GUI:

python ocr_gui.py

Multiple files:

python ocr_pdf.py file1.pdf file2.pdf file3.pdf

Different language (German example):

python ocr_pdf.py document.pdf -l deu

Custom output location:

python ocr_pdf.py scan.pdf -o /path/to/output.pdf

Languages

Common language codes:

  • eng - English
  • deu - German
  • fra - French
  • spa - Spanish
  • ita - Italian
  • por - Portuguese
  • chi_sim - Chinese
  • jpn - Japanese
  • kor - Korean

For multiple languages use + like: eng+deu+fra

File sizes

The tool automatically optimizes output size.

To disable optimization: python ocr_pdf.py file.pdf --no-optimize

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages