generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(DeepSeek OPSM): passing correct (vLLM) logprobs
#4857
opened Jan 18, 2026 by
casinca
Loading…
3 of 5 tasks
Enhance GRPO documentation with scaling notes
#4849
opened Jan 17, 2026 by
javadtaghia
Loading…
5 tasks
Add retry strategy to vLLM Client for increased robustness
#4845
opened Jan 16, 2026 by
apalmas-saifh
Loading…
2 of 5 tasks
Update OpenEnv dependency to new version for hf jobs scripts
#4843
opened Jan 16, 2026 by
sergiopaniego
Loading…
5 tasks
feat: Support log_completion for swanlab backend
#4826
opened Jan 14, 2026 by
ZiyiTsang
Loading…
2 of 5 tasks
Test distributed training for
RewardTrainer, RLOOTrainer and GRPOTrainer
#4823
opened Jan 13, 2026 by
qgallouedec
Loading…
[GRPO] Add parquet logging for completions with individual rewards
#4818
opened Jan 13, 2026 by
qgallouedec
Loading…
Refactor KTO [3/N]: Extract dataset processing to _prepare_dataset method
#4788
opened Jan 8, 2026 by
albertvillanova
Loading…
Refactor KTO [2/N]: Improve config validation in KTOConfig
#4787
opened Jan 8, 2026 by
albertvillanova
Loading…
feat(sft): add generation-based evaluation support to SFTTrainer
#4768
opened Jan 2, 2026 by
CodersAcademy006
Loading…
fix: handle None eval_dataset in example code
#4756
opened Dec 27, 2025 by
ciaoyizhen
Loading…
1 of 4 tasks
perf: avoid output_hidden_states when only last_hidden_state is used
#4755
opened Dec 27, 2025 by
ciaoyizhen
Loading…
2 of 5 tasks
Clarify Accelerate usage in SFTTrainer documentation
#4744
opened Dec 23, 2025 by
Likhita-17
Loading…
1 task done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-12-18.