DUET

This repository contains DUET, a data-mixing method that exploits feedback from an unseen task to optimize a data mixture for an LLM.

Requirements

pip3 install -r requirements.txt

Running DUET

Running DUET is straightforward. Simply run the following command (ensure you have at least one free GPU):

CUDA_VISIBLE_DEVICES=0 python3 -u BO_runs_LLM_specific.py --contaminate=0 --iterations=10 --num_data=5000 --epochs=1 --trials=10 --evaluation_cuda=0 --sample_method=random --eval_tasks=gsm8k --experiments_setting=ood --output_dir=results

The python script automatically fetches data from 9 training domains with the following evaluation performance, where the performance is used as feedback elicited in DUET's problem setting:

  "commonsense_qa": "acc,none",
  "gsm8k": "exact_match,strict-match",
  "headqa_en": "acc,none",
  "hellaswag": "acc,none",
  "pubmedqa": "acc,none",
  "sciq": "acc_norm,none",
  "triviaqa": "exact_match,remove_whitespace",
  "truthfulqa_gen": "bleu_acc,none",
  "wikitext": "word_perplexity,none",
}

There are a few important arguments.

The first important argument here is --eval_tasks=gsm8k, which specifies the unseen evaluation task. You can also specify something like --eval_tasks=gsm8k,headqa_en, which means both tasks will be set as the evaluation task (the performance is averaged). You can specify as many domains as you want.
The second important argument is --experiments_setting=ood, which implies we are removing the eval_task(s) from our training domains. Alternatively, you can use --experiments_setting=in_dist to keep the eval_task in the training domain (this makes the training data mixture easier to optimize, since the eval data is found in the training data).

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LLM		LLM
data_loader		data_loader
influence		influence
BO.py		BO.py
BO_runs_LLM_specific.py		BO_runs_LLM_specific.py
LICENSE		LICENSE
README.md		README.md
helper.py		helper.py
image_training.py		image_training.py
influence.py		influence.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DUET

Requirements

Running DUET

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

pmsdapfmbf/DUET

Folders and files

Latest commit

History

Repository files navigation

DUET

Requirements

Running DUET

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages