Stars
Renderer for the harmony response format to be used with gpt-oss
Module, Model, and Tensor Serialization/Deserialization
Embeddable library or single binary for indexing and searching 1B vectors
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
An Open-Source Asynchronous Coding Agent
Build fast and accurate GenAI apps with GraphRAG SDK at scale.
Kimi K2 is the large language model series developed by Moonshot AI team
Build memory-native AI agents with Memory OS — an open-source framework for long-term memory, retrieval, and adaptive learning in large language models. Agent Memory | Memory System | Memory Manage…
Build Real-Time Knowledge Graphs for AI Agents
AdalFlow: The library to build & auto-optimize LLM applications.
A production-ready FastAPI template for building AI agent applications with LangGraph integration. This template provides a robust foundation for building scalable, secure, and maintainable AI agen…
Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
Practical real-world hands-on projects to practice and learn Kubernetes implementations
A roadmap to learn Kubernetes from scratch (Beginner to Advanced level)
Building LLaMA 4 MoE from Scratch
Stop re-explaining your project to AI every session. Automatic context memory for Claude, VS Code, Cursor, and 13+ AI tools.
[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning
You like pytorch? You like micrograd? You love tinygrad! ❤️
Classical equations and diagrams in machine learning
Variational Autoencoder (VAE) with Normalizing Flows
Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features"