LoRA-FineTuning-GPT2-QA

Overview

This repository contains the implementation of parameter-efficient fine-tuning using Low-Rank Adaptation (LoRA) on the GPT-2 model for extractive question answering tasks with the SQuAD dataset. Weights & Biases (W&B) is used for tracking experiments and logging metrics.

What is LoRA?

LoRA reduces the computational cost of fine-tuning LLMa by updating only a low-rank decomposition of the weight matrix, leaving the original weights frozen. The formula is:

$$W' = W + \Delta W = W + BA$$

where:

$W$ is the original weight matrix
$\Delta W = BA$ with $B \in \mathbb{R}^{d \times r}$ and $A \in \mathbb{R}^{r \times k}$
$r \ll \min(d, k)$ controls the rank, minimizing trainable parameters.

Purpose

This project tackles the computational challenges of fine-tuning large language models like GPT-2 by applying LoRA to adapt only a small subset of parameters while achieving competitive performance on extractive question answering. It evaluates the impact of hyperparameters on convergence, gradient flow, and metrics such as F1-score and Exact Match.

Experiments

The training process used the following hyperparameters, detailed in the table below:

Feature	Value
Batch Size	8
Number of Epochs	3
Optimizer	AdamW
Learning Rate	0.0001, 0.0002, 0.0005
LoRA Rank	4, 8, 16, 32
Target Modules	Attention, Attention + Projection
Alpha	16
LoRA Scaling Factor	16

For instance, the effect of varying target modules on loss is illustrated below, with Attention + Projection showing superior convergence.


Evaluation Loss	Train Loss

Explore additional visualizations in this Weights & Biases Project!

Results

The best configuration achieved the following performance, summarized in the table below:

LoRA Rank	Target Module	Learning Rate	F1-Score	Exact Match (EM)
32	Attention + Projection	0.0005	90.67	80
8	Attention	0.0002	80	80

Setup

Dependencies

PyTorch
Transformers (Hugging Face)
Datasets (Hugging Face)
Evaluate (Hugging Face)
Weights & Biases (wandb)

Install dependencies via:

pip install -r requirements.txt

Running the Code

Clone the repository:

git clone https://github.com/AmirAAZ818/GPT2-LoRA-QA.git
cd GPT2-LoRA-QA

Set up Weights & Biases (optional but recommended for experiment tracking):
- Sign up at wandb.ai and obtain your API key.
- Run wandb login and paste your API key.
Run the notebook:
- Open parameter-effcient-fine-tuning-with-lora_Experiments.ipynb in Jupyter Notebook or Colab.
- Experiments log metrics (e.g., F1, Exact Match, loss) to W&B; adjust configurations for hyperparameters like rank and learning rate.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Assets		Assets
GPT2-PEFT-LoRA.ipynb		GPT2-PEFT-LoRA.ipynb
LICENSE		LICENSE
PEFT Assignment Report - English Version.pdf		PEFT Assignment Report - English Version.pdf
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LoRA-FineTuning-GPT2-QA

Overview

What is LoRA?

Purpose

Experiments

Results

Setup

Dependencies

Running the Code

About

Uh oh!

Languages

License

AmirAAZ818/GPT2-LoRA-QA

Folders and files

Latest commit

History

Repository files navigation

LoRA-FineTuning-GPT2-QA

Overview

What is LoRA?

Purpose

Experiments

Results

Setup

Dependencies

Running the Code

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages