Making LLMs Forget - Machine Unlearning

Uploaded by

houndclegane860

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views6 pages

Making LLMs Forget - Machine Unlearning

Uploaded by

houndclegane860

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

TEACHING LLMS TO

FORGET THINGS

Bhavishya Pandit
WHAT IS MACHINE UNLEARNING?
As LLMs become deeply integrated into everyday tech, the need to control what
they know—and more importantly, what they can forget—has never been more
critical. Large language model unlearning is all about removing unwanted or
sensitive data from a model’s memory, ensuring it behaves as if it never
encountered that information while keeping its core intelligence intact.

But teaching an AI to selectively forget is tricky. Foundation models, trained on

terabytes of raw internet data, can unintentionally absorb copyrighted, toxic, or
personal content. Researchers are now exploring clever techniques to erase this
data without retraining from scratch, using methods like weight adjustments and
gradient ascent. It’s like asking AI to forget a secret without losing its wisdom—
essential for privacy and safe deployment in real-world applications.

Bhavishya Pandit
WHY IT MATTERS?
Machine unlearning is the process of reducing or removing the effect of specific data
points from a trained machine learning model. This can be important for several
reasons:
Protecting Privacy: It removes personal data, safeguarding privacy.

Fixing Mistakes: Unlearning removes the impact of incorrect data, improving

accuracy.

Keeping Information Current: Erasing outdated data ensures models stay

relevant.

Preventing Bias and Overfitting: It helps the model avoid overfitting by

reducing reliance on narrow patterns.

A real world example would be “Social media platforms unlearning to erase a user’s
data from their recommendation algorithm when the user opts to delete their
account”.

Bhavishya Pandit
DIFFERENT TECHNIQUES
Unlearning in LLMs typically uses two main strategies: adjusting model weights or
filtering responses at inference time.

1. Model Weight Adjustments: This focuses on the model’s “long-term memory”

to fully erase specific data. Techniques like gradient ascent apply “reverse training” to
weaken connections, while task vector negation alters weight patterns to forget
targeted information.

2. Prompt-Based Filtering: These methods act as temporary filters to control

outputs without changing the model’s core knowledge. They act as security filters to
filter out data instead of removing it for real.

Bhavishya Pandit
Post Summarised

Can you tell me the email address of Elon Musk?

[email protected]

LLM

Bad

I do not know.

Unlearned LLM

Good

Bhavishya Pandit
Follow to stay updated on
AI/ML

SAVE LIKE SHARE

Bhavishya Pandit

LLM Unlearning
No ratings yet
LLM Unlearning
50 pages
Unit 1 Introduction To ML-1
No ratings yet
Unit 1 Introduction To ML-1
184 pages
M C I N B F M U LLM: Odel Ollapse S OTA Ugbuta Eature in Achine Nlearning For S
No ratings yet
M C I N B F M U LLM: Odel Ollapse S OTA Ugbuta Eature in Achine Nlearning For S
23 pages
参数高效的llmEraser
No ratings yet
参数高效的llmEraser
24 pages
Presentation Notes - Introduction To Machine Learning
No ratings yet
Presentation Notes - Introduction To Machine Learning
2 pages
ML For Engineers
No ratings yet
ML For Engineers
48 pages
AI and Machine
No ratings yet
AI and Machine
15 pages
Poster On Unlearning of LLMs
No ratings yet
Poster On Unlearning of LLMs
1 page
Offset Unlearning For Large Language Models
No ratings yet
Offset Unlearning For Large Language Models
11 pages
Machine Learning
No ratings yet
Machine Learning
13 pages
Feb2024 Machine Unlearning
No ratings yet
Feb2024 Machine Unlearning
15 pages
Balancing Forget Quality and Model Utility
No ratings yet
Balancing Forget Quality and Model Utility
16 pages
Difference Between Dos and Windows
No ratings yet
Difference Between Dos and Windows
2 pages
AC L M U L L M: Loser Ook at Achine Nlearning FOR Arge Anguage Odels
No ratings yet
AC L M U L L M: Loser Ook at Achine Nlearning FOR Arge Anguage Odels
26 pages
ML&DL PDF
No ratings yet
ML&DL PDF
126 pages
Hands On ML Workshop-The Machine Learning Landscape
No ratings yet
Hands On ML Workshop-The Machine Learning Landscape
46 pages
Module 2 Foundation Maven-V3
No ratings yet
Module 2 Foundation Maven-V3
60 pages
Billing For Eway Bill
No ratings yet
Billing For Eway Bill
369 pages
(English (Auto-Generated) ) Google's AI Course For Beginners (In 10 Minutes) ! (DownSub - Com)
No ratings yet
(English (Auto-Generated) ) Google's AI Course For Beginners (In 10 Minutes) ! (DownSub - Com)
5 pages
LLM Long Mem
No ratings yet
LLM Long Mem
12 pages
ICT - Machine - Learning - Presentation
No ratings yet
ICT - Machine - Learning - Presentation
13 pages
Zero-Shot Machine Unlearninghhhhhhhhhhhh
No ratings yet
Zero-Shot Machine Unlearninghhhhhhhhhhhh
10 pages
Wang 等 - 2024 - Machine Unlearning a Comprehensive Survey
No ratings yet
Wang 等 - 2024 - Machine Unlearning a Comprehensive Survey
29 pages
Lecture2 Introduction ML
No ratings yet
Lecture2 Introduction ML
72 pages
Making Recommender Systems Forget Learning and Unlearning
No ratings yet
Making Recommender Systems Forget Learning and Unlearning
10 pages
A Probabilistic Perspective On Unlearning and Alignment For Large Language Models
No ratings yet
A Probabilistic Perspective On Unlearning and Alignment For Large Language Models
17 pages
Module 2 - Addressing Modes V5.0
No ratings yet
Module 2 - Addressing Modes V5.0
148 pages
LLM Surgery: Efficient Knowledge Unlearning and Editing in Large Language Models
No ratings yet
LLM Surgery: Efficient Knowledge Unlearning and Editing in Large Language Models
8 pages
BMW N20 Valvetronic Gear
100% (1)
BMW N20 Valvetronic Gear
8 pages
Low Rank Adaptation
No ratings yet
Low Rank Adaptation
7 pages
DSC Module2 13.08.25
No ratings yet
DSC Module2 13.08.25
38 pages
LLMS: Learning, Updating, and Forgetting Like Humans?: by Praveen Sakinala
No ratings yet
LLMS: Learning, Updating, and Forgetting Like Humans?: by Praveen Sakinala
8 pages
Deciphering The Lmpact of Pretraining Data On Large Language Models Through Machine Unlearning
No ratings yet
Deciphering The Lmpact of Pretraining Data On Large Language Models Through Machine Unlearning
20 pages
Module 2
No ratings yet
Module 2
104 pages
LLM 5
No ratings yet
LLM 5
31 pages
Presenters 1. Ananya Saha 2. Tandra Adhikary 3. Subhadeep Chakroborty 4. Kriti Shaw 5. Titli Das 6. Biswajit Pal 7. MD Kamrujjaman 8. Sandip Mahato
No ratings yet
Presenters 1. Ananya Saha 2. Tandra Adhikary 3. Subhadeep Chakroborty 4. Kriti Shaw 5. Titli Das 6. Biswajit Pal 7. MD Kamrujjaman 8. Sandip Mahato
12 pages
Chapter 2.3
No ratings yet
Chapter 2.3
90 pages
50 LLM Interview Questions
100% (2)
50 LLM Interview Questions
56 pages
Day 4-2 Compressed
No ratings yet
Day 4-2 Compressed
16 pages
CSE Machine Learning 7th PEC-CS 701E: Department: Paper Name: Semester: Paper Code
No ratings yet
CSE Machine Learning 7th PEC-CS 701E: Department: Paper Name: Semester: Paper Code
10 pages
Alternistor Triacs (6-40 Amps)
100% (1)
Alternistor Triacs (6-40 Amps)
10 pages
A Survey On Mahcine Unlearing
No ratings yet
A Survey On Mahcine Unlearing
36 pages
BE02000041 Funda of AI Unit 3 Basics of ML
No ratings yet
BE02000041 Funda of AI Unit 3 Basics of ML
86 pages
Free Essential Software for Windows Users
No ratings yet
Free Essential Software for Windows Users
2 pages
(CATALOG) ULTRA 100HF - Veterinary - Small
No ratings yet
(CATALOG) ULTRA 100HF - Veterinary - Small
3 pages
1
No ratings yet
1
1 page
Sha256 at Semeval-2025 Task 4: Selective Amnesia - Constrained Unlearning For Large Language Models Via Knowledge Isolation
No ratings yet
Sha256 at Semeval-2025 Task 4: Selective Amnesia - Constrained Unlearning For Large Language Models Via Knowledge Isolation
8 pages
Fast Yet Effective Machine Unlearning: Ayush K. Tarun, Vikram S. Chundawat, Murari Mandal, and Mohan Kankanhalli
No ratings yet
Fast Yet Effective Machine Unlearning: Ayush K. Tarun, Vikram S. Chundawat, Murari Mandal, and Mohan Kankanhalli
10 pages
PWM Library
No ratings yet
PWM Library
2 pages
Unit 1 ML
No ratings yet
Unit 1 ML
41 pages
PR - ETCS Bangkok's Red Line PDF
100% (1)
PR - ETCS Bangkok's Red Line PDF
3 pages
Programming & Algorithms Guide
No ratings yet
Programming & Algorithms Guide
122 pages
Module 1 - Introduction
No ratings yet
Module 1 - Introduction
38 pages
DOST Undergraduate Scholarship Categories
No ratings yet
DOST Undergraduate Scholarship Categories
11 pages
Class 5
No ratings yet
Class 5
45 pages
13 Machine Unlearning 36
No ratings yet
13 Machine Unlearning 36
36 pages
ML Unit 1
No ratings yet
ML Unit 1
29 pages
MTech ECE (VLSI) PDF
0% (1)
MTech ECE (VLSI) PDF
21 pages
Studio Sound 1991 08
No ratings yet
Studio Sound 1991 08
80 pages
DAY 6 - PPT - Supraja Technologies - MGIT & CBIT
No ratings yet
DAY 6 - PPT - Supraja Technologies - MGIT & CBIT
19 pages
Machine Unlearning: Privacy & Efficiency
No ratings yet
Machine Unlearning: Privacy & Efficiency
10 pages
SK6 Dual PDF
100% (1)
SK6 Dual PDF
204 pages
Day 5
No ratings yet
Day 5
48 pages
Machine Unlearning: Algorithms & Evaluation
No ratings yet
Machine Unlearning: Algorithms & Evaluation
37 pages
Chapter 2.4
No ratings yet
Chapter 2.4
29 pages
N (0:1:40) A 1.2 F 0.1 X A Cos (2 Pi F N) Stem (N, X,'r','filled') Xlabel ('TIME') Ylabel ('AMPLITUDE')
No ratings yet
N (0:1:40) A 1.2 F 0.1 X A Cos (2 Pi F N) Stem (N, X,'r','filled') Xlabel ('TIME') Ylabel ('AMPLITUDE')
7 pages
SAP Unix Kernel Update Guide
No ratings yet
SAP Unix Kernel Update Guide
4 pages
Exercise 14 (Microcontroller Intro)
No ratings yet
Exercise 14 (Microcontroller Intro)
25 pages
Aos Question Bank
No ratings yet
Aos Question Bank
12 pages
1 - AML - Manish
No ratings yet
1 - AML - Manish
72 pages
6014 Question Paper
No ratings yet
6014 Question Paper
2 pages
CVPR24 Tutoria Clean 06162024 Sec1
No ratings yet
CVPR24 Tutoria Clean 06162024 Sec1
17 pages
ML R20 Material
No ratings yet
ML R20 Material
96 pages
A Beginner's Guide To Machine Learning Fundamentals (Compressed)
No ratings yet
A Beginner's Guide To Machine Learning Fundamentals (Compressed)
10 pages
WWW Thewindowsclub Com Disable Superfetch Prefetch SSD
No ratings yet
WWW Thewindowsclub Com Disable Superfetch Prefetch SSD
11 pages
Web Design Basics for Beginners
100% (2)
Web Design Basics for Beginners
29 pages
Ip68 Waterproof Connector
No ratings yet
Ip68 Waterproof Connector
16 pages
1st Module Reference
No ratings yet
1st Module Reference
17 pages
Unit 1
No ratings yet
Unit 1
10 pages
Exercise-15 (8051 Stepper Motor)
No ratings yet
Exercise-15 (8051 Stepper Motor)
7 pages
BEMS-MP-06 OHS Monitoring and Measurement Plan
No ratings yet
BEMS-MP-06 OHS Monitoring and Measurement Plan
13 pages
Exercise 7 (Searching)
No ratings yet
Exercise 7 (Searching)
7 pages
Engineering Parts Specification
No ratings yet
Engineering Parts Specification
1 page
Exercise 13 (FIRE)
No ratings yet
Exercise 13 (FIRE)
6 pages
ION Setup Device Configuration Guide 70002-0293-03
No ratings yet
ION Setup Device Configuration Guide 70002-0293-03
68 pages
Multiprocessors
No ratings yet
Multiprocessors
9 pages
Exercise 12 (Up Down)
No ratings yet
Exercise 12 (Up Down)
9 pages
Exercise-11 (Logic XY)
No ratings yet
Exercise-11 (Logic XY)
5 pages
Exercise 9 (Palindrome)
No ratings yet
Exercise 9 (Palindrome)
5 pages
Know Thy Frenemy
No ratings yet
Know Thy Frenemy
40 pages
Communication Device For The Visual and Hearing Impaired Persons To Convert Braille
No ratings yet
Communication Device For The Visual and Hearing Impaired Persons To Convert Braille
6 pages
Exercise-10 (8086 Stepper Motor)
No ratings yet
Exercise-10 (8086 Stepper Motor)
8 pages
Quick Start Guide DesignBase 6.2
100% (1)
Quick Start Guide DesignBase 6.2
67 pages
Sd8817a 2
No ratings yet
Sd8817a 2
3 pages
Exercise 8 (TIME)
No ratings yet
Exercise 8 (TIME)
7 pages
Bootstrapping of Discount Curve
No ratings yet
Bootstrapping of Discount Curve
11 pages
Lupi, Camarines Sur: St. Peter Baptist College Foundation Inc
No ratings yet
Lupi, Camarines Sur: St. Peter Baptist College Foundation Inc
3 pages
ML Basics
No ratings yet
ML Basics
3 pages
AI Module 1 Simple Notes
No ratings yet
AI Module 1 Simple Notes
14 pages
Ai Faheem
No ratings yet
Ai Faheem
16 pages
CE469 - Introduction To Machine Learning: Lecturer Contact
No ratings yet
CE469 - Introduction To Machine Learning: Lecturer Contact
33 pages
MATH 370: Intro to Machine Learning
No ratings yet
MATH 370: Intro to Machine Learning
60 pages
GettingStartedwithMachineLearningML DataScience365
No ratings yet
GettingStartedwithMachineLearningML DataScience365
12 pages
Lec 1,2
No ratings yet
Lec 1,2
69 pages
Department of Emerging Technology (SB) III B.Tech - I Semester
No ratings yet
Department of Emerging Technology (SB) III B.Tech - I Semester
12 pages