BenZstory

BenZ BenZstory

2 followers · 9 following

Achievements

Stars

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,299 147 Updated Nov 14, 2025

alibaba / ROCK

A construction kit for reinforcement learning environment management.

Python 105 9 Updated Nov 13, 2025

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 14,692 2,038 Updated Nov 19, 2024

ISEEKYAN / mbridge

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 154 33 Updated Nov 13, 2025

intel / ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,454 1,385 Updated Oct 14, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,706 601 Updated Nov 14, 2025

openai / harmony

Renderer for the harmony response format to be used with gpt-oss

Rust 4,003 225 Updated Nov 5, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,185 1,922 Updated Nov 1, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 4,156 253 Updated Nov 10, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,471 689 Updated Nov 14, 2025

intelligent-machine-learning / dlrover

DLRover: An Automatic Distributed Deep Learning System

Python 1,586 198 Updated Nov 13, 2025

NVIDIA / nvidia-resiliency-ext

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 233 37 Updated Nov 14, 2025

km1994 / LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

2,325 159 Updated Dec 26, 2024

SwanHubX / SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultra…

Python 3,083 163 Updated Nov 9, 2025

vllm-project / vllm-ascend

Community maintained hardware plugin for vLLM on Ascend

Python 1,345 561 Updated Nov 14, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,626 2,522 Updated Nov 14, 2025

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 11,362 2,402 Updated Aug 5, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,378 812 Updated Nov 9, 2025

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 909 88 Updated Sep 10, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,071 567 Updated Nov 14, 2025

Mintplex-Labs / anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 51,034 5,393 Updated Nov 7, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,264 621 Updated Nov 14, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,262 428 Updated Nov 13, 2025

NVIDIA / Cosmos-Tokenizer

A suite of image and video neural tokenizers

Jupyter Notebook 1,679 83 Updated Feb 11, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 63,051 11,270 Updated Nov 14, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,458 7,562 Updated Nov 13, 2025

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,070 1,069 Updated Oct 29, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 20,144 3,385 Updated Nov 14, 2025

iDvel / rime-ice

Rime 配置：雾凇拼音 | 长期维护的简体词库

Lua 13,833 881 Updated Nov 3, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,097 3,195 Updated Nov 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BenZ BenZstory

Achievements

Achievements

Block or report BenZstory

Stars

alibaba / ROLL

alibaba / ROCK

neonbjb / tortoise-tts

ISEEKYAN / mbridge

intel / ipex-llm

pytorch / torchtitan

openai / harmony

openai / gpt-oss

zhaochenyang20 / Awesome-ML-SYS-Tutorial

ai-dynamo / dynamo

intelligent-machine-learning / dlrover

NVIDIA / nvidia-resiliency-ext

km1994 / LLMs_interview_notes

SwanHubX / SwanLab

vllm-project / vllm-ascend

volcengine / verl

openai / spinningup

OpenRLHF / OpenRLHF

zhuzilin / ring-flash-attention

flashinfer-ai / flashinfer

Mintplex-Labs / anything-llm

InternLM / lmdeploy

kvcache-ai / Mooncake

NVIDIA / Cosmos-Tokenizer

vllm-project / vllm

hiyouga / LLaMA-Factory

PKU-YuanGroup / Open-Sora-Plan

sgl-project / sglang

iDvel / rime-ice

NVIDIA-NeMo / NeMo