An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,291 807 Updated Oct 31, 2025

SEU-COIN / LLMPapers

Papers & Works for large languange models (OpenAI GPT-4, Meta Llama, etc.).

TeX 318 27 Updated May 11, 2025

boyu-ai / Hands-on-RL

https://hrl.boyuai.com/

Jupyter Notebook 4,115 756 Updated Nov 22, 2022

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,404 1,301 Updated Oct 4, 2025

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 25,603 2,578 Updated Oct 30, 2025

datawhalechina / llm-cookbook

面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版

Jupyter Notebook 21,986 2,633 Updated Jun 12, 2025

langchain-ai / langchain

🦜🔗 The platform for reliable agents.

Python 118,612 19,533 Updated Oct 31, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,544 7,439 Updated Oct 30, 2025

continue-revolution / sd-webui-segment-anything

Segment Anything for Stable Diffusion WebUI

Python 3,515 218 Updated Apr 30, 2024

pharmapsychotic / clip-interrogator

Image to prompt with BLIP and CLIP

Python 2,912 439 Updated May 15, 2024

tkipf / pygcn

Graph Convolutional Networks in PyTorch

Python 5,372 1,228 Updated Sep 20, 2020

dragen1860 / GCN-PyTorch

Graph Convolution Network for PyTorch

Python 405 88 Updated May 2, 2019

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,324 6,126 Updated Sep 18, 2024

tkipf / gcn

Implementation of Graph Convolutional Networks in TensorFlow

Python 7,325 2,012 Updated Apr 14, 2023

Mikubill / sd-webui-controlnet

WebUI extension for ControlNet

Python 17,834 2,023 Updated Aug 12, 2024

TencentARC / GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 37,178 6,226 Updated Jul 26, 2024

haofanwang / Lora-for-Diffusers

The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥

Python 821 54 Updated Apr 10, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,453 6,463 Updated Nov 1, 2025

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 157,753 29,273 Updated Oct 7, 2025

IDEA-CCNL / Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。

Python 4,148 384 Updated Aug 13, 2024

princeton-nlp / SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,607 531 Updated Oct 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

island99

Achievements

Achievements

Block or report island99

Stars

opendilab / awesome-RLHF

deepseek-ai / DeepGEMM

huggingface / trl

huggingface / open-r1

hkust-nlp / simpleRL-reason

volcengine / verl

deepseek-ai / DeepSeek-R1

Jiayi-Pan / TinyZero

FunAudioLLM / CosyVoice

OpenRLHF / OpenRLHF