Lists (10)
Sort Name ascending (A-Z)
Starred repositories
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
Build resilient language agents as graphs.
File Parser optimised for LLM Ingestion with no loss 🧠Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
A Git-compatible VCS that is both simple and powerful
An idiomatic, lean, fast & safe pure Rust implementation of Git
A full-text search and indexing server written in Rust.
Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.
Logly is a Rust-powered, Loguru-like logging library for Python that combines the familiarity of Python’s standard logging API with high-performance logging capabilities.
Talos Linux is a modern Linux distribution built for Kubernetes.
Cross-platform, customizable ML solutions for live and streaming media.
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
🦠Distributed log streaming engine built from first principles
FoundationDB - the open source, distributed, transactional key-value store
SQL Lineage Analysis Tool powered by Python
A lightweight data processing framework built on DuckDB and 3FS.
Fastest library to load data from DB to DataFrames in Rust and Python
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
HelixDB is an open-source graph-vector database built from scratch in Rust.
A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).
Unified high-performance Python client for object and file stores.
A flexible JSON/YAML linter for creating automated style guides, with baked in support for OpenAPI (v3.1, v3.0, and v2.0), Arazzo v1.0, as well as AsyncAPI v2.x.
A library for generating typed models based on inputs such as AsyncAPI, OpenAPI, and JSON Schema documents with high customization
Apache Iggy: Hyper-Efficient Message Streaming at Laser Speed
Apache Fluss is a streaming storage built for real-time analytics.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.