Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Cosmos generative models training and reinforcement learning repo

License

Notifications You must be signed in to change notification settings

vinjn/cosmos-rl

 
 

Repository files navigation

NVIDIA Cosmos Header

Getting Started

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Documentation.

System Architecture

Cosmos-RL provides toolchain to enable large scale RL training workload with following features:

  1. HuggingFace Integration
    • Llama-2
    • Llama-3
    • Qwen-2.5
    • Qwen-2.5-VL
    • Qwen-3
    • Qwen-3-MoE
    • Moonlight-MoE
    • All HF LLMs
  2. Parallelism
    • Tensor Parallelism
    • Sequence Parallelism
    • Context Parallelism
    • FSDP Parallelism
    • Pipeline Parallelism
  3. Fully asynchronous (replicas specialization)
    • Policy (Consumer): Replicas of training instances
    • Rollout (Producer): Replicas of generation engines
    • Low-precision training (FP8) and rollout (FP8 & FP4) support
  4. Single-Controller Architecture
    • Efficient messaging system (e.g., weight-sync, rollout, evaluate) to coordinate policy and rollout replicas
    • Dynamic NCCL Process Groups for on-the-fly GPU [un]registration to enable fault-tolerant and elastic large-scale RL training

Policy-Rollout-Controller Decoupled Architecture

License and Contact

This project will download and install additional third-party open source software projects. Review the license terms of these open source projects before use.

NVIDIA Cosmos source code is released under the Apache 2 License.

NVIDIA Cosmos models are released under the NVIDIA Open Model License. For a custom license, please contact [email protected].

About

Cosmos generative models training and reinforcement learning repo

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.0%
  • Other 2.0%