0% found this document useful (0 votes)

27 views9 pages

How LLMs Are Trained - A Simple Guide

The document discusses how LLMs are trained in 3 steps: pre-training on a large dataset to predict next words, supervised fine-tuning to understand instructions, and reinforcement learning from human feedback to focus on being helpful, honest and harmless. It also mentions new alignment methods like DPO will be covered later.

Uploaded by

Harsh Anand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views9 pages

How LLMs Are Trained - A Simple Guide

Uploaded by

Harsh Anand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

MASTERING LLM

PRESENTS:
COFFEE BREAK
CONCEPTS

How LLMs are

trained? A simple
guide to
understand LLM
Training

@MASTERING-LLM-
LARGE-LANGUAGE-
MODEL
Step 1 : Pre-training
Step 1 is to train a model on a massive dataset
from the internet to predict the next word -
This is usually called as Language Model

01
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Cool so i can use this model?
Not Yet
In step 1, the model understands how to
predict next word but doesn't understand
any instructions

Model just completes next words

02
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Step 2 : Supervised fine-tuning
(SFT) or instruction tuning
We need to teach the model now to
understand specific instructions, step 2
helps model learn instructions.

03
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
I got a model now? Wait not
yet. Lets look into below
senarios
The Instruction models (SFT) are not helpful, honest
and harmless (HHH), we need to teach them this so
that they learn to respond with HHH

SOURCE

04
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Step 3 : RLHF
We need to teach the model the human
preferences and focus on being helpful,
honest and harmless (HHH)
In this step, model is asked to generate multiple outputs
and humans will rank this output from best to worst.

The simple goal of RLHF is to replace

human feedback with a model which
understands human preferences.

05
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Final Model
In final step:
The instruction model is used to
generate an answer
Once the answer is generated, reward
model (Replacement of humans) will
generate a score.
This score is used to improve the output
until desired accuracy or number of
iteration is reached.

06
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Summery
Language model just understands how
to predict next words.

SFT or instruction tuning teaches model

on how to follow the instructions on
multiple different tasks.

RLHF helps more improve answers on

human preferences like helpful, honest
and harmless (HHH)
Check this paper to learn more about
LLM alignments

New alignment methods include

methods like DPO which we will cover
soon.

Comment below on which topic you

want to understand next in this "Coffee
Break Concepts" series and we will
include those topics in the upcoming
weeks
07
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
www.masteringllm.com

LLM Interview
Course
Want to Prepare yourself for an
LLM Interview?
100+ Questions spanning 14 categories

Curated 100+ assessments for each

Well-researched real-world interview

questions based on FAANG & Fortune
500 companies
Focus on Visual learning
Real Case Studies & Certification

Coupon Code - LLM50

Coupon is valid till 30th May 2024

TESOL Made Practical For All Situations-Language Training Institute (2022)
No ratings yet
TESOL Made Practical For All Situations-Language Training Institute (2022)
589 pages
Els Gramer Kitap
80% (20)
Els Gramer Kitap
847 pages
Stylistic Devices in Fathers of Nations-105
67% (3)
Stylistic Devices in Fathers of Nations-105
7 pages
Business Telephoning Guide
100% (1)
Business Telephoning Guide
10 pages
The Gingerbread Man Activity Book
No ratings yet
The Gingerbread Man Activity Book
20 pages
LLM Training: Coffee Break Guide
No ratings yet
LLM Training: Coffee Break Guide
9 pages
LLM Training - A Simple Visual Guide Beginners
No ratings yet
LLM Training - A Simple Visual Guide Beginners
10 pages
Generative Ai Terminology
67% (3)
Generative Ai Terminology
26 pages
Lab: L - S A C B: Arge Cale Lignment For HAT OTS
No ratings yet
Lab: L - S A C B: Arge Cale Lignment For HAT OTS
10 pages
LLM Advanced
No ratings yet
LLM Advanced
4 pages
Deep Learning: Large Language Models
No ratings yet
Deep Learning: Large Language Models
58 pages
Large Large Models
No ratings yet
Large Large Models
25 pages
Small Language Models (SLMS)
No ratings yet
Small Language Models (SLMS)
23 pages
This 200-Page LLM Guide Will Save You Months - Here's The Gold in 5 Minutes
No ratings yet
This 200-Page LLM Guide Will Save You Months - Here's The Gold in 5 Minutes
22 pages
Toc 9780138199302
No ratings yet
Toc 9780138199302
8 pages
Genai Llms w2
No ratings yet
Genai Llms w2
114 pages
Data Seminar
No ratings yet
Data Seminar
10 pages
Notes 4 Large Language Model
No ratings yet
Notes 4 Large Language Model
4 pages
Deeplearning - Ai Deeplearning - Ai
100% (1)
Deeplearning - Ai Deeplearning - Ai
115 pages
Tacn VD 1 4
No ratings yet
Tacn VD 1 4
6 pages
Jason Weston Reasoning Alignment Berkeley Talk
No ratings yet
Jason Weston Reasoning Alignment Berkeley Talk
106 pages
Fine Tuning Techniques For Large Language Models LLMs
100% (4)
Fine Tuning Techniques For Large Language Models LLMs
15 pages
ChatGPT Prompting for Developers
No ratings yet
ChatGPT Prompting for Developers
3 pages
Welcome To This Course On ChatGPT Intro 1
No ratings yet
Welcome To This Course On ChatGPT Intro 1
2 pages
Customizing LLMs for Developers
No ratings yet
Customizing LLMs for Developers
52 pages
ML A Deep Dive in The World of AI and LLM Tun'Up Munich - 241021 - 130023
No ratings yet
ML A Deep Dive in The World of AI and LLM Tun'Up Munich - 241021 - 130023
34 pages
Day 5
No ratings yet
Day 5
48 pages
Lecture 1
No ratings yet
Lecture 1
100 pages
Summary - Foundations On LLMs
No ratings yet
Summary - Foundations On LLMs
6 pages
Foundations of LLM
100% (1)
Foundations of LLM
231 pages
Foundations of Large Language Models 1738142777
No ratings yet
Foundations of Large Language Models 1738142777
101 pages
2.9 How LLMs Follow Instructions, Instruction Tuning and RLHF
No ratings yet
2.9 How LLMs Follow Instructions, Instruction Tuning and RLHF
2 pages
LLM Basics for Researchers
No ratings yet
LLM Basics for Researchers
54 pages
LLM Lifecycle & Fine-Tuning Guide
No ratings yet
LLM Lifecycle & Fine-Tuning Guide
2 pages
Foundations of Large Language Models: Tong Xiao and Jingbo Zhu
No ratings yet
Foundations of Large Language Models: Tong Xiao and Jingbo Zhu
277 pages
Chatgpt: A Technical Perspective: Presented by Teamx
No ratings yet
Chatgpt: A Technical Perspective: Presented by Teamx
18 pages
07 Lecture10 Post Training
No ratings yet
07 Lecture10 Post Training
61 pages
FutureOfLearning LLMs Book Chapter
No ratings yet
FutureOfLearning LLMs Book Chapter
12 pages
All The Basics That You Need To Know About LLMs
No ratings yet
All The Basics That You Need To Know About LLMs
26 pages
Quick Start Guide To Large Language Models Second Edition Sinan Ozdemir Online PDF
100% (1)
Quick Start Guide To Large Language Models Second Edition Sinan Ozdemir Online PDF
115 pages
Thoughts On NLP Research in The (Post-) LLM Era: Yijia Shao Yuanpei College 2023/04/28
No ratings yet
Thoughts On NLP Research in The (Post-) LLM Era: Yijia Shao Yuanpei College 2023/04/28
51 pages
AIO2023 Module9 Extra LLMs Instruction Finetuning 130424
No ratings yet
AIO2023 Module9 Extra LLMs Instruction Finetuning 130424
76 pages
14 Key Skills To Master Large Language Models 1729745509
No ratings yet
14 Key Skills To Master Large Language Models 1729745509
17 pages
Week4 LLMs EN
No ratings yet
Week4 LLMs EN
48 pages
LLM's For Code Generation
No ratings yet
LLM's For Code Generation
31 pages
Introduction to Language Models
No ratings yet
Introduction to Language Models
43 pages
Fine Tuning LLM
No ratings yet
Fine Tuning LLM
6 pages
Training Large Language Models
No ratings yet
Training Large Language Models
7 pages
19 20-gpt-3 Prompts
No ratings yet
19 20-gpt-3 Prompts
68 pages
All You Should Kno About LLM'S
No ratings yet
All You Should Kno About LLM'S
10 pages
LLMs: Training to Inference Guide
No ratings yet
LLMs: Training to Inference Guide
30 pages
100 Interview Q A For Large Language Models LLMs 1748803296
No ratings yet
100 Interview Q A For Large Language Models LLMs 1748803296
10 pages
Userdrive 1844/AIPrompts/65da8a56045061708821078
No ratings yet
Userdrive 1844/AIPrompts/65da8a56045061708821078
62 pages
Understanding LLMS: A Comprehensive Overview From Training To Inference
No ratings yet
Understanding LLMS: A Comprehensive Overview From Training To Inference
30 pages
Impact Robotic
No ratings yet
Impact Robotic
21 pages
521H0502-521H0498-521h0333 NLP Report
No ratings yet
521H0502-521H0498-521h0333 NLP Report
27 pages
4 Alignment
No ratings yet
4 Alignment
48 pages
Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities
No ratings yet
Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities
31 pages
Instruction Tuning For Large Language Models-A Survey
No ratings yet
Instruction Tuning For Large Language Models-A Survey
35 pages
A Beginner's Guide To Large Language Models
No ratings yet
A Beginner's Guide To Large Language Models
25 pages
How LLM's Work, How GPT Was Trained, and How GPT Generates Outputs
No ratings yet
How LLM's Work, How GPT Was Trained, and How GPT Generates Outputs
12 pages
Preschool English Guide
No ratings yet
Preschool English Guide
3 pages
10TH Class PH-1 07-06-2020 Q.P
No ratings yet
10TH Class PH-1 07-06-2020 Q.P
16 pages
Administration and Scoring of CRLA
100% (1)
Administration and Scoring of CRLA
38 pages
MODERN ENGLISH DRAMA in 19th Century For 4th Year BSU
No ratings yet
MODERN ENGLISH DRAMA in 19th Century For 4th Year BSU
31 pages
International Journal of Heat and Technology: Received: Accepted
No ratings yet
International Journal of Heat and Technology: Received: Accepted
4 pages
Kompan DLL Week 2
No ratings yet
Kompan DLL Week 2
4 pages
Q2M1L1 3
No ratings yet
Q2M1L1 3
8 pages
Gowan Meaning - Google Search
No ratings yet
Gowan Meaning - Google Search
1 page
Expressing Preference: English Worksheet For 8º Grade Students 2019 Expressing Prefference
No ratings yet
Expressing Preference: English Worksheet For 8º Grade Students 2019 Expressing Prefference
2 pages
Simple Anthropometry-Based Calculations To Monitor Body Composition in Athletes, Scoping Review and Reference Values
No ratings yet
Simple Anthropometry-Based Calculations To Monitor Body Composition in Athletes, Scoping Review and Reference Values
15 pages
ANALOGY STUDY MATERIAL COMPLETE Lyst3381
No ratings yet
ANALOGY STUDY MATERIAL COMPLETE Lyst3381
176 pages
Kombis Meeting 11,12 N 13
No ratings yet
Kombis Meeting 11,12 N 13
27 pages
The Philosophical Lexicon
No ratings yet
The Philosophical Lexicon
17 pages
We Need To Learn English
No ratings yet
We Need To Learn English
2 pages
Degrees of Comparison Worksheet: Test Your Understanding of The Degrees of Comparison With This English Test
100% (1)
Degrees of Comparison Worksheet: Test Your Understanding of The Degrees of Comparison With This English Test
2 pages
STARS Sample Lesson Teacher Grade8
No ratings yet
STARS Sample Lesson Teacher Grade8
21 pages
English for Journalists Course Guide
No ratings yet
English for Journalists Course Guide
18 pages
What Is An Adjective?
No ratings yet
What Is An Adjective?
11 pages
Essay
No ratings yet
Essay
8 pages
Closed-Class Words in Sentence Production: Evidence From A Modality-Specific Dissociation
No ratings yet
Closed-Class Words in Sentence Production: Evidence From A Modality-Specific Dissociation
34 pages
Cambridge IGCSE: 0500/11 First Language English
No ratings yet
Cambridge IGCSE: 0500/11 First Language English
16 pages
Person Deixis in Usa Presidential Campaign Speeches: Universitatea "Vasile Alecsandri" Bacău
No ratings yet
Person Deixis in Usa Presidential Campaign Speeches: Universitatea "Vasile Alecsandri" Bacău
8 pages
Complete Laboratory Experiments in The Social Sciences 2nd Edition Murray Webster PDF For All Chapters
100% (1)
Complete Laboratory Experiments in The Social Sciences 2nd Edition Murray Webster PDF For All Chapters
51 pages
BS en 302-4-2013
No ratings yet
BS en 302-4-2013
18 pages

How LLMs Are Trained - A Simple Guide

Uploaded by

How LLMs Are Trained - A Simple Guide

Uploaded by

MASTERING LLM

How LLMs are

Model just completes next words

The simple goal of RLHF is to replace

SFT or instruction tuning teaches model

RLHF helps more improve answers on

New alignment methods include

Comment below on which topic you

Curated 100+ assessments for each

Well-researched real-world interview

Coupon Code - LLM50

You might also like