Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Phimos's full-sized avatar
😶
😶

Block or report Phimos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

RL

20 repositories

Some basic examples of playing with RL

Python 1,256 302 Updated Jan 9, 2025

Intro to Reinforcement Learning (强化学习纲要)

3,480 503 Updated Jul 25, 2020

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Python 1,974 469 Updated Jul 14, 2023

Repo for the Deep Reinforcement Learning Nanodegree program

Jupyter Notebook 5,111 2,372 Updated Nov 16, 2023
Python 20 10 Updated Mar 28, 2023

A collection of reference environments for offline reinforcement learning

Python 1,592 300 Updated Nov 18, 2024

A curated list of Decision Transformer resources (continually updated)

825 35 Updated Sep 12, 2025

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python 1,605 312 Updated Jul 31, 2025

Multi Task RL Baselines

Python 255 28 Updated Dec 31, 2021

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Jupyter Notebook 3,134 592 Updated Nov 4, 2021

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook 4,286 1,118 Updated Jan 1, 2025

[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI

Python 4,428 629 Updated Jun 26, 2024

Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone

2,533 597 Updated Apr 11, 2022

PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940

Python 589 145 Updated May 29, 2018

Robot bimanual manipulation / dual-arm manipulation

265 13 Updated Aug 2, 2024

Official code release for CVPR 2022 paper D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions

C++ 90 12 Updated Oct 20, 2023

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Jupyter Notebook 1,990 348 Updated Sep 26, 2025

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,869 683 Updated Oct 11, 2025

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python 1,256 190 Updated Feb 9, 2021