SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python reinforcement-learning Projects
-
nn
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
Ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Not currently, but it is being worked on https://github.com/ray-project/ray/issues/53976.
-
d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
-
reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
-
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
-
Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
-
wandb
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
Project mention: Best Practices for Ensuring AI Agent Performance and Reliability | dev.to | 2025-07-22Use tools like Weights & Biases, Labelbox, or Maxim’s data engine to version your datasets, track changes, and continuously add new edge cases and user feedback.
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
-
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
-
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
-
-
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
-
Microsoft's agent-lightning project offers a comprehensive toolkit aimed at accelerating the process of building, testing, and deploying AI Agents. This open-source initiative highlights the industry's commitment to enabling faster development and implementation of advanced AI capabilities, providing developers with robust resources to streamline AI agent creation.
-
-
-
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
-
dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
-
-
-
-
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
-
-
DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python reinforcement-learning discussion
Python reinforcement-learning related posts
-
Learning to Model the World with Language
-
maze VS pi-optimal - a user suggested alternative
2 projects | 30 Oct 2025 -
Daily Artificial Intelligence Digest - Oct 26, 2025
-
Train AI agents with reinforcement learning (no code changes needed)
-
Deep Learning from Scratch, the NumPy Way
-
Introduction to Multi-Armed Bandits
-
Launch HN: RunRL (YC X25) – Reinforcement learning as a service
-
A note from our sponsor - SaaSHub
www.saashub.com | 15 Nov 2025
Index
What are some of the best open-source reinforcement-learning projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | nn | 64,273 |
| 2 | Ray | 39,825 |
| 3 | d2l-en | 26,601 |
| 4 | reinforcement-learning-an-introduction | 14,386 |
| 5 | stable-baselines3 | 12,011 |
| 6 | Gymnasium | 10,611 |
| 7 | wandb | 10,521 |
| 8 | machine_learning_examples | 8,742 |
| 9 | OpenRLHF | 8,360 |
| 10 | cleanrl | 8,272 |
| 11 | pysc2 | 8,189 |
| 12 | PaLM-rlhf-pytorch | 7,869 |
| 13 | agent-lightning | 7,818 |
| 14 | TensorLayer | 7,376 |
| 15 | keras-rl | 5,556 |
| 16 | trlx | 4,709 |
| 17 | dm_control | 4,314 |
| 18 | stable-baselines | 4,309 |
| 19 | ElegantRL | 4,221 |
| 20 | acme | 3,842 |
| 21 | pytorch-a2c-ppo-acktr-gail | 3,818 |
| 22 | polyaxon | 3,684 |
| 23 | DI-engine | 3,540 |