Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ShuaibinLi's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ShuaibinLi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 354 20 Updated Oct 8, 2025

RND1: Scaling Diffusion Language Models

Python 154 8 Updated Oct 22, 2025

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 25,568 2,591 Updated Oct 28, 2025

Collection of reinforcement learning algorithms

Python 2,792 565 Updated Jun 17, 2024
Python 4 Updated Oct 11, 2025

Website for Practical Deep Learning for Coders 2022

Jupyter Notebook 82 27 Updated Jun 24, 2024

An autoregressive character-level language model for making more things

Python 3,361 855 Updated Jun 4, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,823 2,362 Updated Oct 28, 2025

a-m-team's exploration in large language modeling

189 3 Updated May 29, 2025

Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples

Python 44 1 Updated Jul 16, 2025

Monte Carlo Tree Search Mario AI

Java 31 11 Updated Dec 28, 2013

LLM inference in C/C++

C++ 88,422 13,445 Updated Oct 28, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,448 3,186 Updated Oct 28, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 76,730 11,301 Updated Oct 22, 2025

Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)

TypeScript 23,488 1,616 Updated Oct 28, 2025

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 20,708 2,492 Updated Jun 30, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,337 3,115 Updated Oct 27, 2025

Proximal Policy Optimization with TensorFlow and OpenAI Gym

Jupyter Notebook 18 5 Updated Mar 31, 2018

Experiments results of PARL

5 6 Updated Jul 5, 2023

Make Fantastic games with pygame!

Python 2 Updated May 7, 2022

Simple framework for image and video deblurring, implemented by PyTorch

Python 332 39 Updated Dec 20, 2023

LaTeX Thesis Template for Tsinghua University

TeX 5,009 1,123 Updated Oct 19, 2025

Monte carlo tree search in python

Python 615 173 Updated Jul 2, 2022

Python Implementations of Monte Carlo Tree Search

Python 315 88 Updated Aug 20, 2021

A replica of the AlphaZero methodology for deep reinforcement learning in Python

Jupyter Notebook 2,034 760 Updated Nov 21, 2022

An educational resource to help anyone learn deep reinforcement learning.

Python 11,325 2,400 Updated Aug 5, 2024

Python Implementation of Reinforcement Learning: An Introduction

Python 14,367 4,957 Updated Aug 9, 2024

📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

182,660 51,248 Updated Aug 21, 2024

Train auto_car in CARLA simulator with RL algorithms(SAC).

Python 110 12 Updated Oct 11, 2025

A high-performance distributed training framework for Reinforcement Learning

Python 3,416 820 Updated Sep 13, 2025