SFT Practice

Supervised fine-tuning experiments with local models using LoRA and quantization.

Setup

# Install dependencies
pip install transformers trl peft bitsandbytes datasets torch

# Download models to local cache
python download_models.py

# Check available models and paths
python model_path_utils.py

Training Scripts

Basic SFT Training

# Train a simple math model
python simple_sft.py
# Output: ./my_finetuned_model/

Model Organism Experiment

# Train model to underperform on math while maintaining geography knowledge
python model_organism_sft.py
# Output: ./model_organism_checkpoint/

Evaluation & Testing

Quick Model Testing

# Test if your fine-tuned model loads and runs on GPU
python test_cuda_model.py --model_path ./my_finetuned_model --base_model "Qwen/Qwen2.5-1.5B-Instruct"

Benchmark Evaluation (Recommended)

# Test your fine-tuned model on all benchmarks (math, knowledge, coding)
python benchmarks/quick_eval.py --model_path ./my_finetuned_model --base_model "Qwen/Qwen2.5-1.5B-Instruct" --samples_per_benchmark 5

# Test only math problems (good for math-trained models)
python benchmarks/quick_eval.py --model_path ./my_finetuned_model --base_model "Qwen/Qwen2.5-1.5B-Instruct" --benchmarks gsm8k --samples_per_benchmark 3

# Compare with base model performance
python benchmarks/quick_eval.py --model_path "Qwen/Qwen2.5-1.5B-Instruct" --base_model "Qwen/Qwen2.5-1.5B-Instruct" --samples_per_benchmark 5

Custom Model Evaluation

# Test model organism behavior (if trained)
python evaluate_model.py --model_path ./model_organism_checkpoint

# Test basic fine-tuned model
python evaluate_model.py --model_path ./my_finetuned_model

Dataset Processing

# Process datasets for training
python datasets/process_datasets.py --datasets_dir ./datasets --output_dir ./training_data

# Create task-specific datasets
python datasets/process_datasets.py --task_specific

Model Management

Check Model Paths

# List all available models and paths
python model_path_utils.py

# Find specific model path
python -c "from model_path_utils import print_model_info; print_model_info('Qwen/Qwen2.5-1.5B-Instruct')"

Model Information

Pre-trained models: Cached in ~/.cache/huggingface/hub/
SFT outputs: Saved to ./my_finetuned_model/ or ./model_organism_checkpoint/
Training logs: Saved to ./results/ or ./model_organism_results/

Requirements

GPU with 4GB+ VRAM
Python 3.8+
CUDA-compatible GPU (for quantization)

TRL Library Compatibility

Note: The training scripts are compatible with TRL v0.24.0+. If you encounter API errors with newer TRL versions, you may need to update the SFTTrainer usage:

Remove dataset_text_field parameter
Remove max_seq_length parameter
Add tokenizer parameter to SFTTrainer
Consider using setup_chat_format() for better chat performance

Current scripts work with the installed TRL version but may need updates for future versions.

Troubleshooting

Common Issues

SFTTrainer API Errors:

If you get unexpected keyword argument 'dataset_text_field' or 'max_seq_length', your TRL version is newer
The scripts have been updated to work with current TRL versions
Remove these parameters if you encounter errors

CUDA Memory Issues:

Scripts use 4-bit quantization for 4GB GPUs
If you get OOM errors, reduce per_device_train_batch_size to 1
Increase gradient_accumulation_steps to simulate larger batches

Model Loading Issues:

Ensure your virtual environment is activated: conda activate sft_learning
Check CUDA installation: python -c "import torch; print(torch.cuda.is_available())"
Verify model paths exist before running evaluation scripts

Evaluation Script Issues:

If test_cuda_model.py fails with 'dict' object has no attribute 'input_ids', this is a known issue with the test script
Use benchmarks/quick_eval.py for reliable model testing instead

Quick Test Sequence

# 1. Setup environment
conda activate sft_learning  # or your virtual environment

# 2. Train basic math model
python simple_sft.py
# Output: ./my_finetuned_model/

# 3. Test your fine-tuned model
python test_cuda_model.py --model_path ./my_finetuned_model --base_model "Qwen/Qwen2.5-1.5B-Instruct"

# 4. Run benchmark evaluation
python benchmarks/quick_eval.py --model_path ./my_finetuned_model --base_model "Qwen/Qwen2.5-1.5B-Instruct" --samples_per_benchmark 5

# 5. Optional: Train model organism experiment
python model_organism_sft.py
# Output: ./model_organism_checkpoint/

# 6. Optional: Evaluate model organism
python evaluate_model.py --model_path ./model_organism_checkpoint

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
datasets		datasets
.gitattributes		.gitattributes
.gitignore		.gitignore
2106.09685v2.pdf		2106.09685v2.pdf
2305.14314v1.pdf		2305.14314v1.pdf
README.md		README.md
cuda_debug.py		cuda_debug.py
download_models.py		download_models.py
evaluate_model.py		evaluate_model.py
model_organism_sft.py		model_organism_sft.py
model_path_utils.py		model_path_utils.py
requirements.txt		requirements.txt
simple_sft.py		simple_sft.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SFT Practice

Setup

Training Scripts

Basic SFT Training

Model Organism Experiment

Evaluation & Testing

Quick Model Testing

Benchmark Evaluation (Recommended)

Custom Model Evaluation

Dataset Processing

Model Management

Check Model Paths

Model Information

Requirements

TRL Library Compatibility

Troubleshooting

Common Issues

Quick Test Sequence

About

Uh oh!

Releases

Packages

Languages

ariel-gil/sft_finetune

Folders and files

Latest commit

History

Repository files navigation

SFT Practice

Setup

Training Scripts

Basic SFT Training

Model Organism Experiment

Evaluation & Testing

Quick Model Testing

Benchmark Evaluation (Recommended)

Custom Model Evaluation

Dataset Processing

Model Management

Check Model Paths

Model Information

Requirements

TRL Library Compatibility

Troubleshooting

Common Issues

Quick Test Sequence

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages