Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View dawnranger's full-sized avatar

Block or report dawnranger

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,289 807 Updated Oct 31, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,237 98 Updated Oct 29, 2025

🔍 🐍 Like pstack but for Python!

C++ 1,132 53 Updated Oct 27, 2025

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultra…

Python 3,007 159 Updated Oct 30, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,794 935 Updated Oct 31, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,929 294 Updated Oct 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,993 2,403 Updated Nov 1, 2025

Community maintained hardware plugin for vLLM on Ascend

Python 1,285 530 Updated Oct 31, 2025

Fully open reproduction of DeepSeek-R1

Python 25,590 2,399 Updated Sep 8, 2025

Research on Tabular Deep Learning: Papers & Packages

Python 1,064 115 Updated Nov 13, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,625 6,851 Updated Nov 1, 2025

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,258 358 Updated Oct 26, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,223 613 Updated Oct 31, 2025

Jupyter notebooks for the Natural Language Processing with Transformers book

Jupyter Notebook 4,616 1,433 Updated Aug 21, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,532 7,438 Updated Oct 30, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,780 571 Updated May 3, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,692 282 Updated Oct 31, 2025

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Python 3,721 476 Updated Oct 12, 2023

Transformer related optimization, including BERT, GPT

C++ 39 14 Updated Feb 10, 2023

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,199 4,765 Updated Jun 2, 2025

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,910 382 Updated Mar 14, 2024

基于ChatGLM-6B + LoRA的Fintune方案

Python 3,769 443 Updated Nov 25, 2023

The AI Code Editor

31,563 2,086 Updated Oct 22, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,937 1,877 Updated Jul 15, 2025

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,977 2,219 Updated Jul 29, 2024

LLM inference in C/C++

C++ 88,559 13,473 Updated Nov 1, 2025

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 76,854 8,295 Updated May 27, 2025

An open-source framework for training large multimodal models.

Python 4,033 316 Updated Aug 31, 2024

🦜🔗 The platform for reliable agents.

Python 118,602 19,529 Updated Oct 31, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 45,005 6,490 Updated Nov 1, 2025
Next