Thanks to visit codestin.com
Credit goes to github.com

Skip to content

snake-4/pdfparanoia

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

85 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pdfparanoia

This repository is a fork of pdfparanoia. The code was rewritten to use pdfminer.six, Python 3 and a modern build system.

pdfparanoia is a PDF watermark removal library for academic papers. Some publishers include private information, like institution names, personal names, IP addresses, timestamps, and other identifying information, in watermarks on each page.

Installation

git clone https://github.com/snake-4/pdfparanoia.git
cd pdfparanoia
pip install .

Usage

As a library

import pdfparanoia

with open("nmat91417.pdf", "rb") as fin:
    with open("output.pdf", "wb") as fout:
        fout.write(pdfparanoia.scrub(fin.read()))

From the command line

pdfparanoia --verbose input.pdf -o output.pdf

Supported Publishers

  • AIP
  • IEEE
  • JSTOR
  • RSC

About

pdf watermark removal library for academic papers

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 86.6%
  • Shell 13.4%