Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Ayush8120's full-sized avatar
🐨
zoned-out
🐨
zoned-out

Block or report Ayush8120

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

RL

5 repositories

A minimal and stable PPO.

Python 144 6 Updated Feb 9, 2024

Skill-based Model-based Reinforcement Learning (CoRL 2022)

Python 62 13 Updated Oct 31, 2022

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,670 6,165 Updated Jul 13, 2023

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,374 171 Updated Jul 25, 2023

DrQ: Data regularized Q

Jupyter Notebook 417 54 Updated Jan 13, 2023