#
rlhf
Here are 3 public repositories matching this topic...
LLM 学习笔记:Transformer 架构、强化学习 (RLHF/DPO/PPO)、分布式训练、推理优化。含完整数学推导与Slides。
-
Updated
Jan 16, 2026 - TeX
Comprehensive university-level study guide for LLMs, transformers, RLHF, and generative AI. 335+ pages (actively expanding), 47 visualizations, 4 notebooks. Regular updates with enhanced sections, new implementations, and expanded coverage.
python nlp machine-learning natural-language-processing research deep-learning jupyter-notebook transformers educational gpt lora study-guide bert fine-tuning large-language-models langchain rlhf qlora retrieval-augmented-generation langgraph
-
Updated
Dec 19, 2025 - TeX
Improve this page
Add a description, image, and links to the rlhf topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rlhf topic, visit your repo's landing page and select "manage topics."