-
High-Flyer
- China
-
22:12
(UTC +08:00) - phimos.github.io
- in/yunchong-gan
RL
Some basic examples of playing with RL
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Repo for the Deep Reinforcement Learning Nanodegree program
A collection of reference environments for offline reinforcement learning
A curated list of Decision Transformer resources (continually updated)
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone
PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940
Robot bimanual manipulation / dual-arm manipulation
Official code release for CVPR 2022 paper D-Grasp: Physically Plausible Dynamic Grasp Synthesis for Hand-Object Interactions
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.