Thanks to visit codestin.com
Credit goes to github.com

mhyrzt

Follow

🥡

Just Hanging Around!

Mahyar mhyrzt

🥡

Just Hanging Around!

Follow

I Code! - Ryan Gosling

62 followers · 56 following

Tehran, Iran
10:01 (UTC +03:30)
mhyrzt.me

Achievements

Achievements

Stars

🤖 RL

48 repositories

denisyarats / pytorch_sac_ae

PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)

Python 252 35 Updated May 3, 2020

mila-iqia / babyai

BabyAI platform. A testbed for training agents to understand and execute language commands.

Python 746 155 Updated Oct 1, 2023

openai / mlsh

Code for the paper "Meta-Learning Shared Hierarchies"

Python 619 163 Updated Jul 6, 2023

google-deepmind / bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Python 1,526 189 Updated Apr 13, 2024

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,178 881 Updated Jul 8, 2025

PWhiddy / PokemonRedExperiments

Playing Pokemon Red with Reinforcement Learning

Jupyter Notebook 7,641 738 Updated Aug 28, 2025

ericyangyu / PPO-for-Beginners

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 1,134 151 Updated Oct 1, 2024

Kaixhin / imitation-learning

Imitation learning algorithms

Python 553 43 Updated Mar 22, 2025

ugr-sail / sinergym

Gym environment for building simulation and control using reinforcement learning

Python 190 52 Updated Oct 21, 2025

proroklab / popgym

Partially Observable Process Gym

Python 203 16 Updated Jun 12, 2025

intelligent-environments-lab / CityLearn

Official reinforcement learning environment for demand response and load shaping

Python 557 196 Updated Oct 30, 2025

google-deepmind / deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,416 2,775 Updated Aug 22, 2025

joonspk-research / generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

19,891 2,729 Updated Aug 5, 2024

microsoft / TextWorld

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Jupyter Notebook 1,351 191 Updated Jan 23, 2025

vwxyzjn / invalid-action-masking

Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

Python 165 22 Updated May 9, 2023

danijar / crafter

Benchmarking the Spectrum of Agent Capabilities

Python 484 82 Updated Jan 23, 2024

Plankson / awesome-explainable-reinforcement-learning

A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges

254 28 Updated Feb 10, 2025

google-deepmind / meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Python 749 143 Updated Oct 1, 2025

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

C 3,979 289 Updated Oct 28, 2025

iShohei220 / adopt

Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"

Jupyter Notebook 426 20 Updated Dec 12, 2024

WeiChengTseng / Pytorch-PCGrad

Pytorch reimplementation for "Gradient Surgery for Multi-Task Learning"

Python 375 48 Updated Jun 22, 2021

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

4,187 250 Updated Sep 19, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,062 2,258 Updated Oct 30, 2025

Spenhouet / tensorboard-aggregator

Aggregate multiple tensorboard runs to new summary or csv files

Python 173 27 Updated May 29, 2025

google / brax

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 2,905 316 Updated Oct 20, 2025

lucidrains / gradnorm-pytorch

A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch

Python 110 4 Updated Aug 25, 2025

thuml / awesome-multi-task-learning

2024 up-to-date list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.

795 64 Updated Oct 8, 2025

natolambert / rlhf-book

Textbook on reinforcement learning from human feedback

TeX 1,281 113 Updated Oct 24, 2025

marl-book / codebase

Official code repo for the MARL book (www.marl-book.com)

Python 558 91 Updated Mar 30, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,275 453 Updated Oct 30, 2025