Stars
Kode CLI — Design for post-human workflows. One unit agent for every human & computer task.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Ongoing research training transformer models at scale
A Tool to Visualize Claude Code's LLM Interactions
JittorInfer is a high-performance C++ inference framework designed for large language models on Huawei's Ascend AI processor.
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
Qihoo360 / 360-LLaMA-Factory
Forked from hiyouga/LlamaFactoryadds Sequence Parallelism into LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
A simple screen parsing tool towards pure vision based GUI agent
Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
Kodu is an autonomous coding agent that lives in your IDE. It is a VSCode extension that can help you build your dream project step by step by leveraging the latest technologies in automated coding…
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
SWE-bench: Can Language Models Resolve Real-world Github Issues?
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…
16-fold memory access reduction with nearly no loss
SGLang is a high-performance serving framework for large language models and multimodal models.
andy-yang-1 / sglang
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
JDiffusion is a diffusion model library for generating images or videos based on Diffusers and Jittor.
Fitten Code AI Programming Assistant for Neovim
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference