TRL

https://github.com/huggingface/trl

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

sergiopaniego updated a dataset 3 days ago

trl-lib/documentation-images

lvwerra authored a paper 5 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

qgallouedec updated a Space about 1 month ago

trl-lib/trackio

View all activity

Organization Card

Community About org cards

This is the organization grouping all the models and datasets used in the TRL library.

Display tracking information

Running

Recommend vLLM Memory

😻

Estimate GPU memory usage for model training

Running

Dataset Length Profiler

👁

Estimate optimal max_length for SFT training

Running

Train

🏋

Display and run TRL jobs

Sleeping

Job

🌖

Submit text to get processed output

View 8 Spaces

models 82

trl-lib/Qwen3-4B-LoRA

Updated Jul 28 • 1

trl-lib/Qwen2-0.5B-Reward-Math-Sheperd

Token Classification • 0.5B • Updated Dec 9, 2024 • 265 • 1

trl-lib/Qwen2-0.5B-XPO

Text Generation • 0.5B • Updated Oct 24, 2024 • 1 •

trl-lib/Qwen2-0.5B-OnlineDPO

Text Generation • 0.5B • Updated Oct 23, 2024 • 1 • • 1

trl-lib/Qwen2-0.5B-KTO

Text Generation • 0.5B • Updated Oct 18, 2024

trl-lib/Qwen2-0.5B-ORPO

Text Generation • 0.5B • Updated Oct 11, 2024 • 2 • 2

trl-lib/Qwen2-0.5B-DPO

Text Generation • 0.5B • Updated Sep 27, 2024 • 48 • 4

trl-lib/Qwen2-0.5B-Reward

Text Classification • 0.5B • Updated Sep 5, 2024 • 411 • 1

trl-lib/pythia-1b-deduped-tldr-rm

Updated Aug 27, 2024 • 159

trl-lib/pythia-2.8b-deduped-tldr-online-dpo

Text Generation • 3B • Updated Aug 2, 2024 • 1

View 82 models

datasets 21

trl-lib/documentation-images

Viewer • Updated 3 days ago • 9 • 75k

trl-lib/llava-instruct-mix

Viewer • Updated Aug 16 • 228k • 687 • 2

trl-lib/OpenMathReasoning

Viewer • Updated Apr 26 • 3.2M • 313 • 1

trl-lib/chatbot_arena_completions

Viewer • Updated Apr 25 • 33k • 65 • 1

trl-lib/rlaif-v

Viewer • Updated Jan 8 • 83.1k • 92 • 3

trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness

Viewer • Updated Jan 8 • 16.6k • 127 • 2

trl-lib/ultrafeedback-prompt

Viewer • Updated Jan 8 • 39.8k • 2.67k • 7

trl-lib/tldr-preference

Viewer • Updated Jan 8 • 179k • 834 • 2

trl-lib/tldr

Viewer • Updated Jan 8 • 130k • 4.34k • 24

trl-lib/prm800k

Viewer • Updated Jan 8 • 41.2k • 81 • 2

View 21 datasets

AI & ML interests

Recent Activity

Team members 10

Collections 7

spaces 8 Sort: Recently updated

Trackio

Recommend vLLM Memory

Dataset Length Profiler

Train

Job

models 82 Sort: Recently updated

datasets 21 Sort: Recently updated

spaces 8

models 82

datasets 21