PDF packages

Showing projects tagged as PDF

  • PyPDF2

    8.8 9.5 L2 Python
    A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
  • PyMuPDF

    8.5 9.7 Python
    PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
  • WeasyPrint

    8.5 9.6 L1 Python
    The awesome document factory
  • PDFMiner

    8.3 0.0 L3 Python
    DISCONTINUED. Python PDF Parser (Not actively maintained). Check out pdfminer.six.
  • Camelot

    7.2 8.4 Python
    A Python library to extract tabular data from PDFs
  • borb

    6.8 8.9 Python
    borb is a library for reading, creating and manipulating PDF files in python.
  • pdftabextract

    6.4 0.0 L3 Python
    A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
  • Kreuzberg

    6.0 9.9 HTML
    Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.
  • plutoprint

    4.3 9.3 Python
    A Python Library for Generating PDFs and Images from HTML, powered by PlutoBook
  • ReportLab

    3.4 -
    Allowing Rapid creation of rich PDF documents.
  • Meltano Singer SDK

    2.6 9.8 Python
    Write 70% less code by using the SDK to build custom extractors and loaders that adhere to the Singer standard: https://sdk.meltano.com