Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Compile entropy from logits.
#4858 opened Jan 18, 2026 by pramodith Loading…
1 of 5 tasks
fix(DeepSeek OPSM): passing correct (vLLM) logprobs
#4857 opened Jan 18, 2026 by casinca Loading…
3 of 5 tasks
Update CITATION.cff
#4856 opened Jan 17, 2026 by qgallouedec Loading…
Enhance GRPO documentation with scaling notes
#4849 opened Jan 17, 2026 by javadtaghia Loading…
5 tasks
NeMo-Gym Integration
#4848 opened Jan 17, 2026 by cmunley1 Draft
Add retry strategy to vLLM Client for increased robustness
#4845 opened Jan 16, 2026 by apalmas-saifh Loading…
2 of 5 tasks
make dpo compatible with fsdp2
#4838 opened Jan 16, 2026 by flutist Loading…
4 of 5 tasks
feat: Support log_completion for swanlab backend
#4826 opened Jan 14, 2026 by ZiyiTsang Loading…
2 of 5 tasks
forward_masked_logits in SFTTrainer
#4794 opened Jan 8, 2026 by qgallouedec Draft
5 tasks
Add reward shaping to PPOTrainer
#4774 opened Jan 5, 2026 by derivative2002 Loading…
5 tasks
make dpo compatible with qwen3vl
#4773 opened Jan 4, 2026 by flutist Loading…
Extend CLI to orpo trainer
#4757 opened Dec 27, 2025 by murilo-cunha Loading…
3 of 5 tasks
fix: handle None eval_dataset in example code
#4756 opened Dec 27, 2025 by ciaoyizhen Loading…
1 of 4 tasks
perf: avoid output_hidden_states when only last_hidden_state is used
#4755 opened Dec 27, 2025 by ciaoyizhen Loading…
2 of 5 tasks
vllm parameter passthrough for stop sequences
#4754 opened Dec 26, 2025 by kdubovikov Loading…
Clarify Accelerate usage in SFTTrainer documentation
#4744 opened Dec 23, 2025 by Likhita-17 Loading…
1 task done
ProTip! What’s not been updated in a month: updated:<2025-12-18.