RLOpt: A Research Framework for Reinforcement Learning

RLOpt is a flexible and modular framework for Reinforcement Learning (RL) research, built on PyTorch and TorchRL. It is designed to facilitate the implementation, testing, and comparison of various RL agents and optimization techniques. The framework uses Hydra for configuration management, allowing for easy customization of experiments.

Key Features

Modular Architecture: Easily swap out components like agents, environments, and optimizers.
Modern RL Agents: Implementations of popular algorithms like Proximal Policy Optimization (PPO).
Custom Optimizers: Includes a variety of optimizers beyond standard libraries (e.g., agd, ac_fgd).
Configuration by Hydra: Leverages Hydra for powerful and clean configuration management.
Built on TorchRL: Utilizes the efficient and modular tools provided by the TorchRL library.
Standard Environment Support: Compatible with Gymnasium and DeepMind control suite environments.

Installation

Clone the repository:
```
git clone <repository-url>
cd RLOpt
```
Install dependencies: This project uses the dependencies listed in pyproject.toml. Install them using pip:
```
pip install torch torchrl tensordict hydra-core gymnasium[mujoco] wandb
```
For an editable installation of the local rlopt package, run:
```
pip install -e .
```

How to Run Experiments

Experiments are configured via YAML files in the conf directory and launched using a main script. The configuration is managed by Hydra, which allows you to override any parameter from the command line.

Example: Running a PPO agent on HalfCheetah

The primary configuration is in conf/config.yaml. You can run an experiment using a training script. Based on the test setup, a training run can be initiated like this:

python test/test_ppo.py

This will run the PPO agent on the HalfCheetah-v4 environment using the parameters defined in test/test_config.yaml.

To override parameters from the command line:

# Run with a different learning rate
python test/test_ppo.py optim.lr=1e-4

# Run on a different environment for 100,000 frames
python test/test_ppo.py env.env_name=Hopper-v4 collector.total_frames=100_000

Project Structure

RLOpt/
├── conf/                 # Hydra configuration files
│   └── config.yaml
├── rlopt/                # Main source code
│   ├── agent/            # RL agent implementations (PPO, L2T, etc.)
│   ├── common/           # Shared utilities (buffers, modules, etc.)
│   ├── envs/             # Environment wrappers
│   └── opt/              # Custom optimizer implementations
├── scripts/              # Jupyter notebooks and utility scripts
└── test/                 # Unit and integration tests

Contributing

Contributions are welcome! Please feel free to submit a pull request or open an issue to discuss potential changes.

Fork the repository.
Create a new branch (git checkout -b feature/my-new-feature).
Commit your changes (git commit -am 'Add some feature').
Push to the branch (git push origin feature/my-new-feature).
Create a new Pull Request.

License

This project is licensed under the terms of the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 159 Commits
.github/workflows		.github/workflows
.idea		.idea
conf		conf
rlopt		rlopt
scripts		scripts
test		test
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RLOpt: A Research Framework for Reinforcement Learning

Key Features

Installation

How to Run Experiments

Example: Running a PPO agent on HalfCheetah

Project Structure

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

fei-yang-wu/RLOpt

Folders and files

Latest commit

History

Repository files navigation

RLOpt: A Research Framework for Reinforcement Learning

Key Features

Installation

How to Run Experiments

Example: Running a PPO agent on HalfCheetah

Project Structure

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages