Lists (7)
Sort Name ascending (A-Z)
Stars
XML Schema validator and data conversion library for Python
Teaching tool and debugging aid in context of references, mutable data types, and shallow and deep copy.
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
DuckDB is an analytical in-process SQL database management system
DuckLake is an integrated data lake and catalog format
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Pushdown compute from Snowflake to DuckDB running on your infrastructure
Python programs, usually short, of considerable difficulty, to perfect particular skills.
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
πΊπΈ a python library for parsing unstructured United States address strings into address components
An open database of international sanctions data, persons of interest and politically exposed persons
Backstage is an open framework for building developer portals
A curated list of data oriented design resources.
MAGDA Mock is a project by Digitaal Vlaanderen's MAGDA Platform to offer a mock environment for customers of its SOAP v3 and v2 services.
The official Python library for the OpenAI API
π A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources
π¨ a simple, git diffable JSON database on yer filesystem. By the power of NodeJS
A binary JSON serialization format based on JSON Schema 2020-12 with a strong focus on space-efficiency
The JSON Schema specification
This is a simple graph database in SQLite, inspired by "SQLite as a document database"
Data model and processing tools for investigative entity data
The simplest way we know to use JSON in Web APIs
π€ Object property paths with wildcards and regexps π΅