Balancing the Budget

Code for the paper Balancing the Budget: Understanding Trade-offs Between Supervised and Preference-Based Finetuning

Install

conda create --name <env> --file requirements.txt

Also add the IFEval repository in the root folder.

Data

Process all the datasets

bash tuning/data_processing.sh

Run

Edit the train_sizes list in tuning/run.sh to add different #train examples to train the models.
Run bash tuning/run.sh and select the task, sft-pft ratio and base model.

Reference

If you find our work or code useful, please cite the paper:

@misc{raghavendra2025balancingbudgetunderstandingtradeoffs,
      title={Balancing the Budget: Understanding Trade-offs Between Supervised and Preference-Based Finetuning}, 
      author={Mohit Raghavendra and Junmo Kang and Alan Ritter},
      year={2025},
      eprint={2502.11284},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2502.11284}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Balance		Balance
assets		assets
tuning		tuning
unsloth_compiled_cache		unsloth_compiled_cache
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
killarney.txt		killarney.txt
requirements.txt		requirements.txt
run.sbatch		run.sbatch
split_requirements.py		split_requirements.py
test_python_1433242.out		test_python_1433242.out
test_python_1433261.out		test_python_1433261.out
test_python_1434529.out		test_python_1434529.out
test_python_1475770.out		test_python_1475770.out
test_python_1476622.out		test_python_1476622.out

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Balancing the Budget

Install

Data

Run

Reference

About

Uh oh!

Releases

Packages

Languages

lucedes27/balance-budget

Folders and files

Latest commit

History

Repository files navigation

Balancing the Budget

Install

Data

Run

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages