Data
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Quilt is a data mesh for connecting people with actionable data
Always know what to expect from your data.
Fancy stream processing made operationally mundane
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphic…
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Query git repositories with SQL. Generate reports, perform status checks, analyze codebases. 🔍 📊
A high-performance observability data pipeline.
Draw pretty maps from OpenStreetMap data! Built with osmnx +matplotlib + shapely
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
A collection of useful little scripts for database analysis and administration, created by our team at PostgreSQL Experts.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (…
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
High-Resolution 3D Human Digitization from A Single Image.
Facebook AI Research's Automatic Speech Recognition Toolkit
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
A curated list of awesome PostgreSQL software, libraries, tools and resources, inspired by awesome-mysql
Speech recognition module for Python, supporting several engines and APIs, online and offline.
A toolkit for developing and comparing reinforcement learning algorithms.
The world's simplest facial recognition api for Python and the command line
Style transfer, deep learning, feature transform
Queries to mesure statistical bloat in indexes and tables for PostgreSQL
PostgreSQL-based Task Queue for Python