Stars
An extremely fast Python package and project manager, written in Rust.
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
A lightweight data processing framework built on DuckDB and 3FS.
A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
DeepEP: an efficient expert-parallel communication library
FlashMLA: Efficient Multi-head Latent Attention Kernels
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
DuckDB is an analytical in-process SQL database management system
An Open-Ended Embodied Agent with Large Language Models
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
An accurate GUI element detection approach based on old-fashioned CV algorithms [Upgraded on 5/July/2021]
Powerful computer vision assisted Lego mosaic creator · Over 1 million images created (so far!)
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
[ICLR 2025] Diffusion Feedback Helps CLIP See Better
[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training
Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU.
Official repo and evaluation implementation of VSI-Bench
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
SGLang is a fast serving framework for large language models and vision language models.
Minecraft AI with LLMs+Mineflayer
An open-sourced end-to-end VLM-based GUI Agent
Collection of NSFW images URLs for the purposes of training an NSFW Image Classifier