Thanks to visit codestin.com
Credit goes to github.com

Skip to content

[Roadmap] OpenRLHF Development Roadmap #568

@hijkzzz

Description

@hijkzzz

Roadmap

Principle

The development principle of OpenRLHF is to optimize performance as much as possible while maintaining ease of use and ease of understanding (CleanRLHF).

Easy of Use

  • Remove / decouple the ppo_trainer.py without ray
  • Single Controller @xiaoxigua999
  • Refactor packing samples and ring attention
  • Upgrade PyTorch container to 25.0x

Performance Optimization

New Algorithms

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentation

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions