Thanks to visit codestin.com
Credit goes to github.com

Skip to content
@PGCodeLLM

PGCodeLLM

Popular repositories Loading

  1. trl trl Public

    Forked from huggingface/trl

    [Downstream Fork DO NOT EDIT MAIN] Train transformer language models with reinforcement learning.

    Python

  2. OpenRLHF OpenRLHF Public

    [Fork] An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

    Python

  3. pytest-json-report pytest-json-report Public

    Forked from numirias/pytest-json-report

    🗒️ A pytest plugin to report test results as JSON

    Python

  4. critic-rl critic-rl Public

    Forked from HKUNLP/critic-rl

    Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

    Python

  5. rllm rllm Public

    Forked from rllm-org/rllm

    Democratizing Reinforcement Learning for LLMs

    Jupyter Notebook

  6. LLaMA-Factory LLaMA-Factory Public

    Forked from hiyouga/LLaMA-Factory

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    Python

Repositories

Showing 10 of 23 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…