Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View mhyrzt's full-sized avatar
🥡
Just Hanging Around!
🥡
Just Hanging Around!

Block or report mhyrzt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

🤖 RL

48 repositories

PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)

Python 252 35 Updated May 3, 2020

BabyAI platform. A testbed for training agents to understand and execute language commands.

Python 746 155 Updated Oct 1, 2023

Code for the paper "Meta-Learning Shared Hierarchies"

Python 619 163 Updated Jul 6, 2023

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Python 1,526 189 Updated Apr 13, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,178 881 Updated Jul 8, 2025

Playing Pokemon Red with Reinforcement Learning

Jupyter Notebook 7,641 738 Updated Aug 28, 2025

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 1,134 151 Updated Oct 1, 2024

Imitation learning algorithms

Python 553 43 Updated Mar 22, 2025

Gym environment for building simulation and control using reinforcement learning

Python 190 52 Updated Oct 21, 2025

Partially Observable Process Gym

Python 203 16 Updated Jun 12, 2025

Official reinforcement learning environment for demand response and load shaping

Python 557 196 Updated Oct 30, 2025

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,416 2,775 Updated Aug 22, 2025

Generative Agents: Interactive Simulacra of Human Behavior

19,891 2,729 Updated Aug 5, 2024

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Jupyter Notebook 1,351 191 Updated Jan 23, 2025

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

Python 165 22 Updated May 9, 2023

Benchmarking the Spectrum of Agent Capabilities

Python 484 82 Updated Jan 23, 2024

A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges

254 28 Updated Feb 10, 2025

A suite of test scenarios for multi-agent reinforcement learning.

Python 749 143 Updated Oct 1, 2025

Simplifying reinforcement learning for complex game environments

C 3,979 289 Updated Oct 28, 2025

Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"

Jupyter Notebook 426 20 Updated Dec 12, 2024

Pytorch reimplementation for "Gradient Surgery for Multi-Task Learning"

Python 375 48 Updated Jun 22, 2021

A curated list of reinforcement learning with human feedback resources (continually updated)

4,187 250 Updated Sep 19, 2025

Train transformer language models with reinforcement learning.

Python 16,062 2,258 Updated Oct 30, 2025

Aggregate multiple tensorboard runs to new summary or csv files

Python 173 27 Updated May 29, 2025

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 2,905 316 Updated Oct 20, 2025

A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch

Python 110 4 Updated Aug 25, 2025

2024 up-to-date list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.

795 64 Updated Oct 8, 2025

Textbook on reinforcement learning from human feedback

TeX 1,281 113 Updated Oct 24, 2025

Official code repo for the MARL book (www.marl-book.com)

Python 558 91 Updated Mar 30, 2025

AllenAI's post-training codebase

Python 3,275 453 Updated Oct 30, 2025