Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View island99's full-sized avatar

Block or report island99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of reinforcement learning with human feedback resources (continually updated)

4,188 249 Updated Sep 19, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,848 729 Updated Oct 15, 2025

Train transformer language models with reinforcement learning.

Python 16,099 2,261 Updated Nov 1, 2025

Fully open reproduction of DeepSeek-R1

Python 25,591 2,400 Updated Sep 8, 2025

Simple RL training for reasoning

Python 3,780 278 Updated Aug 3, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,996 2,404 Updated Nov 1, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,331 1,519 Updated Apr 24, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,059 1,861 Updated Oct 21, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,291 807 Updated Oct 31, 2025

Papers & Works for large languange models (OpenAI GPT-4, Meta Llama, etc.).

TeX 318 27 Updated May 11, 2025

https://hrl.boyuai.com/

Jupyter Notebook 4,115 756 Updated Nov 22, 2022

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,404 1,301 Updated Oct 4, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 25,603 2,578 Updated Oct 30, 2025

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 21,986 2,633 Updated Jun 12, 2025

🦜🔗 The platform for reliable agents.

Python 118,612 19,533 Updated Oct 31, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,544 7,439 Updated Oct 30, 2025

Segment Anything for Stable Diffusion WebUI

Python 3,515 218 Updated Apr 30, 2024

Image to prompt with BLIP and CLIP

Python 2,912 439 Updated May 15, 2024

Graph Convolutional Networks in PyTorch

Python 5,372 1,228 Updated Sep 20, 2020

Graph Convolution Network for PyTorch

Python 405 88 Updated May 2, 2019

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,324 6,126 Updated Sep 18, 2024

Implementation of Graph Convolutional Networks in TensorFlow

Python 7,325 2,012 Updated Apr 14, 2023

WebUI extension for ControlNet

Python 17,834 2,023 Updated Aug 12, 2024

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 37,178 6,226 Updated Jul 26, 2024

The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥

Python 821 54 Updated Apr 10, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,453 6,463 Updated Nov 1, 2025

Stable Diffusion web UI

Python 157,753 29,273 Updated Oct 7, 2025

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,148 384 Updated Aug 13, 2024

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,607 531 Updated Oct 16, 2024
Next