dawnranger

dawnranger

Achievements

Starred repositories

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,289 807 Updated Oct 31, 2025

Liuziyu77 / Visual-RFT

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,237 98 Updated Oct 29, 2025

bloomberg / pystack

🔍 🐍 Like pstack but for Python!

C++ 1,132 53 Updated Oct 27, 2025

SwanHubX / SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultra…

Python 3,007 159 Updated Oct 30, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,794 935 Updated Oct 31, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,929 294 Updated Oct 24, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,993 2,403 Updated Nov 1, 2025

vllm-project / vllm-ascend

Community maintained hardware plugin for vLLM on Ascend

Python 1,285 530 Updated Oct 31, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,590 2,399 Updated Sep 8, 2025

yandex-research / rtdl

Research on Tabular Deep Learning: Papers & Packages

Python 1,064 115 Updated Nov 13, 2024

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,625 6,851 Updated Nov 1, 2025

Tencent-Hunyuan / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,258 358 Updated Oct 26, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,223 613 Updated Oct 31, 2025

nlp-with-transformers / notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Jupyter Notebook 4,616 1,433 Updated Aug 21, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,532 7,438 Updated Oct 30, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,780 571 Updated May 3, 2024

ModelTC / LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,692 282 Updated Oct 31, 2025

hiyouga / ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Python 3,721 476 Updated Oct 12, 2023

THUDM / FasterTransformer

Forked from NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 39 14 Updated Feb 10, 2023

dawnranger

Starred repositories

C++

Chrome extension

C

Deep learning

Python

Algorithm