Lists (1)
Sort Name ascending (A-Z)
Stars
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Google Drive API Python wrapper library. Maintained fork of PyDrive.
Opex Manifest Generator for generating OPEX files for use with Preservica
Command-line tile downloader/assembler for IIIF endpoints/manifests
DSpace REST API Client Library
Core Python Web Archiving Toolkit for replay and recording of web archives
DROID (Digital Record and Object Identification)
Dropzone is an easy to use drag'n'drop library. It supports image previews and shows nice progress bars.
Run a high-fidelity browser-based web archiving crawler in a single Docker container
Zoomable image downloader for Google Arts & Culture, Zoomify, IIIF, and others
Command-line program to download videos from YouTube.com and other video sites
Python library to convert Microsoft Outlook .msg files to .eml/MIME message files.