PGCodeLLM
Popular repositories Loading
-
trl
trl PublicForked from huggingface/trl
[Downstream Fork DO NOT EDIT MAIN] Train transformer language models with reinforcement learning.
Python
-
OpenRLHF
OpenRLHF Public[Fork] An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python
-
pytest-json-report
pytest-json-report PublicForked from numirias/pytest-json-report
🗒️ A pytest plugin to report test results as JSON
Python
-
critic-rl
critic-rl PublicForked from HKUNLP/critic-rl
Code for Paper: Teaching Language Models to Critique via Reinforcement Learning
Python
-
rllm
rllm PublicForked from rllm-org/rllm
Democratizing Reinforcement Learning for LLMs
Jupyter Notebook
-
LLaMA-Factory
LLaMA-Factory PublicForked from hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Python
Repositories
- evalplus Public Forked from evalplus/evalplus
Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024
PGCodeLLM/evalplus’s past year of commit activity - SimpleTIR Public Forked from ltzheng/SimpleTIR
End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
PGCodeLLM/SimpleTIR’s past year of commit activity - LLaMA-Factory Public Forked from hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
PGCodeLLM/LLaMA-Factory’s past year of commit activity - R2E-Gym Public Forked from R2E-Gym/R2E-Gym
Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
PGCodeLLM/R2E-Gym’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…