Week 4 - LLM - FineTuning

Uploaded by

RAUSHAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views38 pages

Week 4 - LLM - FineTuning

Uploaded by

RAUSHAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Fine-tuning an llm

What does pre-training mean?

What does fine-tuning mean?
How many parameters does BERT have?
Is BERT much smaller than GPT?
How was the BERT model pre-trained?
How does MLM pre-training objective work?
How does NSP pre-training objective work?
Pre-training
is like a child learning to
read and write his/her mother tongue.

Fine Tuning
is like a student learning to use language to
perform complex tasks in high school and college.

In-Context Learning
is like a working professional trying to
figure out his/her manager’s instructions
Zero Shot vs Few Shot
TEXT CLASSIFICATION
Classical NLP Approach
Requires Fine Tuning
Requires Fine Tuning

Is only the classifier layer on top trained or

are the BERT parameters also updated during fine-tuning?
NAMED ENTITY RECOGNITION
BERT NER : The B-I-O Notation

Yesterday , Rohan Sharma traveled to Mumbai .

O O B-PER I-PER O O B-LOC O
INFORMATION RETRIEVAL
SBERT Fine-Tuning
- The query has a vector representation using embeddings

- Documents in the database stored as embeddings

- Brute Force Approach:

Do a dot product of the query vector with the embeddings
of all the documents, and choose the one that gives the
closest match

- Hierarchical Navigable Small World (HNSW):

Create a layered graph structure of the document
embedding vectors so that the search process is made
much faster
QUESTION ANSWERING
How to fine-tune BIG models?
Quantization
- LLMs require a large amount of expensive GPU memory
- Large number of parameters
- High precision of the floating point numbers

Model Original Size Quantized Size (4-bit)

LLaMA2 7B 13 GB 3.9 GB
LLaMA2 13B 24 GB 7.8 GB
LLaMA2 30B 60 GB 19.5 GB
LLaMA2 65B 120 GB 38.5 GB

NVIDIA A100 has 80 GB memory and costs around INR 12-15 lakhs
Distillation
- Transfer of knowledge from larger “teacher” model
to a smaller “student” model

- Smaller model represents the bigger model for specific tasks

- Larger model learns the distribution from the data

- Smaller model learns the distribution from the larger model

LLM Fine-Tuning - Presentation
No ratings yet
LLM Fine-Tuning - Presentation
7 pages
Introduction To Large Language Models-2025072419561496
No ratings yet
Introduction To Large Language Models-2025072419561496
16 pages
LLM Fine-Tuning: Best Practices & Tools
100% (1)
LLM Fine-Tuning: Best Practices & Tools
13 pages
Genai Llms w2
No ratings yet
Genai Llms w2
114 pages
Unit 3 Tuning and Optimization Techniques
No ratings yet
Unit 3 Tuning and Optimization Techniques
5 pages
LLM Finetuning
No ratings yet
LLM Finetuning
11 pages
Deeplearning - Ai Deeplearning - Ai
100% (1)
Deeplearning - Ai Deeplearning - Ai
115 pages
Fine-Tuning Models for Developers
No ratings yet
Fine-Tuning Models for Developers
24 pages
Fine-Tuning LLMs Explained
No ratings yet
Fine-Tuning LLMs Explained
6 pages
LLM Compute Challenges & Solutions
100% (1)
LLM Compute Challenges & Solutions
1 page
Fine-Tuning Representation Models For Classification
No ratings yet
Fine-Tuning Representation Models For Classification
72 pages
How To Fine-Tune LLMs in 2024 With Hugging Face
100% (1)
How To Fine-Tune LLMs in 2024 With Hugging Face
13 pages
(Slide v2) Peft For Mcqa
No ratings yet
(Slide v2) Peft For Mcqa
48 pages
Fine Tuning LLM
No ratings yet
Fine Tuning LLM
6 pages
Fine-Tuning AI Models for Developers
100% (2)
Fine-Tuning AI Models for Developers
19 pages
NB4-10 PT V Transfer Learning
No ratings yet
NB4-10 PT V Transfer Learning
16 pages
AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models - 2210.03858
No ratings yet
AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models - 2210.03858
18 pages
Fine Tuning Dictionary
No ratings yet
Fine Tuning Dictionary
17 pages
AAI Module 4
No ratings yet
AAI Module 4
13 pages
Finetuning Large Language Models - Short Course
No ratings yet
Finetuning Large Language Models - Short Course
16 pages
Pretraining & Finetuning
No ratings yet
Pretraining & Finetuning
13 pages
LLM 5
No ratings yet
LLM 5
31 pages
Chapter 4 - Fine-Tune Models and Training Algorithms
No ratings yet
Chapter 4 - Fine-Tune Models and Training Algorithms
26 pages
Lecture 3 Finetuning Part 1
No ratings yet
Lecture 3 Finetuning Part 1
85 pages
Day 5
No ratings yet
Day 5
48 pages
Fine-Tuning Large Language Models 2
No ratings yet
Fine-Tuning Large Language Models 2
9 pages
Full Fine-Tuning, PEFT, Prompt Engineering, or RAG
No ratings yet
Full Fine-Tuning, PEFT, Prompt Engineering, or RAG
23 pages
Chapter 6 - Notes PDF
No ratings yet
Chapter 6 - Notes PDF
22 pages
Beginner's Guide to LLM Fine-Tuning
No ratings yet
Beginner's Guide to LLM Fine-Tuning
9 pages
Why Finetuning
No ratings yet
Why Finetuning
7 pages
Autoencoding Models (Encoder Only) : Three LLM Architectures
No ratings yet
Autoencoding Models (Encoder Only) : Three LLM Architectures
5 pages
Model Pretraining
No ratings yet
Model Pretraining
11 pages
Unit 2
No ratings yet
Unit 2
9 pages
Fine-Tuning Embedding Models - Achieving More With Less - by Nilesh Raghuvanshi - Nov, 2024 - Towards AI
No ratings yet
Fine-Tuning Embedding Models - Achieving More With Less - by Nilesh Raghuvanshi - Nov, 2024 - Towards AI
20 pages
DLQ Eyelashes
No ratings yet
DLQ Eyelashes
36 pages
W S M LLM F: T E D, M F M: HEN Caling Eets Inetuning HE Ffect of ATA Odel and Inetuning Ethod
No ratings yet
W S M LLM F: T E D, M F M: HEN Caling Eets Inetuning HE Ffect of ATA Odel and Inetuning Ethod
20 pages
Compact Vision-Language With Open Weights, Faster Learning, Diffusion in Few Steps, LLMs Aid Tutors
No ratings yet
Compact Vision-Language With Open Weights, Faster Learning, Diffusion in Few Steps, LLMs Aid Tutors
15 pages
AI Frameworks and Fine-Tuning: An Overview
No ratings yet
AI Frameworks and Fine-Tuning: An Overview
10 pages
Finetuning LLMs
No ratings yet
Finetuning LLMs
22 pages
Project (8th)
No ratings yet
Project (8th)
15 pages
The Art of Fine-Tuning Large Language Models Explained in Depth
No ratings yet
The Art of Fine-Tuning Large Language Models Explained in Depth
15 pages
3 - Where Finetuning Fits
No ratings yet
3 - Where Finetuning Fits
7 pages
Revision Questions - Lecture 5
No ratings yet
Revision Questions - Lecture 5
5 pages
Pre Training
No ratings yet
Pre Training
4 pages
Fine-Tuning The Model What Why and How
No ratings yet
Fine-Tuning The Model What Why and How
3 pages
LLM Lifecycle & Fine-Tuning Guide
No ratings yet
LLM Lifecycle & Fine-Tuning Guide
2 pages
When To Use Azure OpenAI Fine
No ratings yet
When To Use Azure OpenAI Fine
4 pages
Transfer Learning with Pre-trained Models
No ratings yet
Transfer Learning with Pre-trained Models
16 pages
Toc 9780138199302
No ratings yet
Toc 9780138199302
8 pages
2.6 Fine Tuning
No ratings yet
2.6 Fine Tuning
3 pages
Over Description About The Model
No ratings yet
Over Description About The Model
3 pages
AI2
No ratings yet
AI2
2 pages
Fine Tuning LLM For Enterprise: Practical Guidelines and Recommendations
No ratings yet
Fine Tuning LLM For Enterprise: Practical Guidelines and Recommendations
17 pages
Adaptive Fine-Tuning Strategies For Domain-Specific Large Language Models in Industrial Applications
No ratings yet
Adaptive Fine-Tuning Strategies For Domain-Specific Large Language Models in Industrial Applications
8 pages