ShuaibinLi

🎯

Focusing

Happy ShuaibinLi

🎯

Focusing

16 followers · 7 following

Achievements

Stars

pengzhangzhi / Open-dLLM

The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 354 20 Updated Oct 8, 2025

RadicalNumerics / RND1

RND1: Scaling Diffusion Language Models

Python 154 8 Updated Oct 22, 2025

anthropics / claude-cookbooks

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 25,568 2,591 Updated Oct 28, 2025

rail-berkeley / rlkit

Collection of reinforcement learning algorithms

Python 2,792 565 Updated Jun 17, 2024

ShuaibinLi / ESBox

Python 4 Updated Oct 11, 2025

fastai / course22-web

Website for Practical Deep Learning for Coders 2022

Jupyter Notebook 82 27 Updated Jun 24, 2024

karpathy / makemore

An autoregressive character-level language model for making more things

Python 3,361 855 Updated Jun 4, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,823 2,362 Updated Oct 28, 2025

a-m-team / a-m-models

a-m-team's exploration in large language modeling

189 3 Updated May 29, 2025

glorgao / SelectiveDPO

Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples

Python 44 1 Updated Jul 16, 2025

rasmusgreve / MCTSMario

Monte Carlo Tree Search Mario AI

Java 31 11 Updated Dec 28, 2013

ggml-org / llama.cpp

LLM inference in C/C++

C++ 88,422 13,445 Updated Oct 28, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,448 3,186 Updated Oct 28, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 76,730 11,301 Updated Oct 22, 2025

hcengineering / platform

Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)

TypeScript 23,488 1,616 Updated Oct 28, 2025

kenjihiranabe / The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 20,708 2,492 Updated Jun 30, 2025

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,337 3,115 Updated Oct 27, 2025

magnusja / ppo

Forked from pat-coady/trpo

Proximal Policy Optimization with TensorFlow and OpenAI Gym

Jupyter Notebook 18 5 Updated Mar 31, 2018

benchmarking-rl / PARL-experiments

Experiments results of PARL

5 6 Updated Jul 5, 2023

ShuaibinLi / pygame-games

Make Fantastic games with pygame！

Python 2 Updated May 7, 2022

ljzycmd / SimDeblur

Simple framework for image and video deblurring, implemented by PyTorch

Python 332 39 Updated Dec 20, 2023

tuna / thuthesis

LaTeX Thesis Template for Tsinghua University

TeX 5,009 1,123 Updated Oct 19, 2025

int8 / monte-carlo-tree-search

Monte carlo tree search in python

Python 615 173 Updated Jul 2, 2022

haroldsultan / MCTS

Python Implementations of Monte Carlo Tree Search

Python 315 88 Updated Aug 20, 2021

AppliedDataSciencePartners / DeepReinforcementLearning

A replica of the AlphaZero methodology for deep reinforcement learning in Python

Jupyter Notebook 2,034 760 Updated Nov 21, 2022

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 11,325 2,400 Updated Aug 5, 2024

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 14,367 4,957 Updated Aug 9, 2024

CyC2018 / CS-Notes

📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

182,660 51,248 Updated Aug 21, 2024

ShuaibinLi / RL_CARLA

Train auto_car in CARLA simulator with RL algorithms(SAC).

Python 110 12 Updated Oct 11, 2025

PaddlePaddle / PARL

A high-performance distributed training framework for Reinforcement Learning

Python 3,416 820 Updated Sep 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Happy ShuaibinLi

Achievements

Achievements

Block or report ShuaibinLi

Stars

pengzhangzhi / Open-dLLM

RadicalNumerics / RND1

anthropics / claude-cookbooks

rail-berkeley / rlkit

ShuaibinLi / ESBox

fastai / course22-web

karpathy / makemore

volcengine / verl

a-m-team / a-m-models

glorgao / SelectiveDPO

rasmusgreve / MCTSMario

ggml-org / llama.cpp

sgl-project / sglang

rasbt / LLMs-from-scratch

hcengineering / platform

kenjihiranabe / The-Art-of-Linear-Algebra

gradio-app / gradio

magnusja / ppo

benchmarking-rl / PARL-experiments

ShuaibinLi / pygame-games

ljzycmd / SimDeblur

tuna / thuthesis

int8 / monte-carlo-tree-search

haroldsultan / MCTS

AppliedDataSciencePartners / DeepReinforcementLearning

openai / spinningup

ShangtongZhang / reinforcement-learning-an-introduction

CyC2018 / CS-Notes

ShuaibinLi / RL_CARLA

PaddlePaddle / PARL