Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ivbeg's full-sized avatar
🎯
Focusing
🎯
Focusing

Sponsoring

@opendataam

Organizations

@infoculture

Block or report ivbeg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

API

API, especially REST API
13 repositories

BI and datavis tools

Data visualization and Business intelligence tools
16 repositories

Data catalogs

Data catalogs (include public and corporate) open source data catalogs
45 repositories

Data documentation

Tools and standards to document databases, datasets and data itself.
5 repositories

Data extraction

Data extraction tools
11 repositories

Data observability

Data observability tools
7 repositories

Data orchestration

Data orchestration tools
9 repositories

Data pipelines

Data pipelines tools
33 repositories

Data quality

Data quality and control
13 repositories

Data schema management

Data schemas registry, tools and toolkits
14 repositories

Data science

Data science tools, libs and products
2 repositories

Data standards

Open sourced repositories of data and open data standards.
21 repositories

Data tools

Various data tools
60 repositories

Data transformation

Data transformation and wrangling tools
14 repositories

Database engines

Database software
23 repositories

Dateno

Dateno related projects
1 repository

Digital preservation

Web archival and other digital preservation tools
33 repositories

Geospatial data

Geospatial data-related tools: catalogs, standards, data transformation, geoserver and, e.t.c.
17 repositories

Interactive data papers

Data papers, interactive documents and etc
1 repository

Metadata toolkits

Tools and products to work with metadata, metadata extraction, management and parsers
13 repositories

Open data

Open data related source code
30 repositories

Open source modern data stack

Data linage, observability, quality control, pipelines, catalogs and tools open source modern data stack tools
59 repositories

Personal data tools

PII scanners, PII monitoring and alert tools
9 repositories

Publishing tools

Documentation, standards, books, and specification preparation tools
2 repositories

Query engines

Database and data reading and processing query engines
11 repositories

Semantic data types

Semantic data types tools and repositories
7 repositories

Stream processing

Data products related to realtime data streams
8 repositories

Task scheduling

Task scheduling tools and libs
3 repositories

Vector search engines

Neural and vector search engines
9 repositories
Showing results

Public registry of the intergovernmental organizations, country groups and countries. Available as JSONl, Parquet, YAML and DuckDB database datasets

Python 1 1 Updated Dec 20, 2025

CSV sniffer crate for Rust, optimized for qsv

Rust 10 1 Updated Jan 21, 2026

MongoDB-compatible database engine for cloud-native and open-source workloads. Built for scalability, performance, and developer productivity.

C 3,209 211 Updated Feb 26, 2026

Next-Gen Big Data File Format

C++ 660 35 Updated Oct 11, 2025

data load tool (dlt) is an open source Python library that makes data loading easy πŸ› οΈ

Python 4,955 461 Updated Feb 26, 2026

openclean - Data Cleaning and data profiling library for Python

Python 83 5 Updated Nov 1, 2021

Get your documents ready for gen AI

Python 54,184 3,653 Updated Feb 25, 2026

Pollock is a benchmark for data loading on character-delimited files.

Python 25 5 Updated Apr 9, 2025

[SIGMOD '26] Automated Dataset Description Generation using Large Language Models

Python 18 4 Updated Dec 18, 2025

Command-line tool to use with Dateno dataset search engine

Python 5 1 Updated Feb 23, 2026

Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.

TypeScript 4,282 288 Updated Aug 7, 2025

Open-source web platform used to create live reporting dashboards from APIs, MongoDB, Firestore, MySQL, PostgreSQL, and more πŸ“ˆπŸ“Š

JavaScript 3,660 412 Updated Feb 22, 2026

chDB is an in-process OLAP SQL Engine πŸš€ powered by ClickHouse

C++ 2,618 102 Updated Feb 26, 2026

A library for building rich, web-based geospatial 2D & 3D data platforms.

TypeScript 1,313 394 Updated Feb 26, 2026

Open, Multi-modal Catalog for Data & AI

Java 3,316 584 Updated Feb 24, 2026

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

Rust 5,259 400 Updated Feb 26, 2026

a simple website for sharing table data - with an API

Python 395 15 Updated Feb 20, 2026

visual data prep powered by python

TypeScript 1,350 104 Updated Feb 22, 2026

PRQL is a modern language for transforming data β€” a simple, powerful, pipelined SQL replacement

Rust 10,730 252 Updated Feb 23, 2026

A DSL for data-driven computational pipelines

Groovy 3,307 773 Updated Feb 25, 2026

Data-Centric Pipelines and Data Versioning

Go 6,286 568 Updated Feb 3, 2025

An R library for managing and documenting dplyr data pipelines

R 69 6 Updated Sep 4, 2025

A distributed task scheduler for Dask

Python 1,666 752 Updated Feb 26, 2026

Parallel computing with task scheduling

Python 13,748 1,847 Updated Feb 22, 2026

Task scheduling library for Python

Python 7,344 747 Updated Feb 20, 2026

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

Java 1,852 384 Updated Feb 25, 2026

Quarto open-source scientific and technical publishing system

TypeScript 540 51 Updated Feb 18, 2026

Open-source scientific and technical publishing system built on Pandoc.

JavaScript 5,329 410 Updated Feb 25, 2026

Apache ShenYu is a Java native API Gateway for service proxy, protocol conversion and API governance.

Java 8,771 3,033 Updated Feb 25, 2026
Next