Codestin Search App

Code based on Fast Robust Early Exiting (https://github.com/raymin0223/fast_robust_early_exit).

First install the environment using the requirements.txt file.

Train the model by using the following command: bash ./scripts/train.sh. The results from static exiting after a specific layer can be generated using bash ./scripts/static_layer.sh, and the results from dynamic exiting using a softmax threshold can be generated with bash ./scripts/softmax_threshold.sh.

To generate the results for the transformer and MLP models, use [LUAN SCRIPT], and to generate the result files for the top-k propagation use bash ./topk_run_file.sh.

Calibration

In order to run the experiments for calibration please see the calibiration_run.sh script. The important parameters are:

--do_cali : this sets the calibration flag to True i.e. run the calibration.
--max_calibrate_samples : the number of samples to use for calibration.
--exit_conf_type: '
--calibrate_delta: the delta value for the calibration.
--calibrate_epsilon: the epsilon value for the calibration.
--thresholds: the threshold candidate for the calibration (lambda values).
--consistency_type: the type of consistency to use for the calibration. For more information regarding the meaning of delta, epsilon, and consistency type, please refer to the original paper.

Calibration Plots

In order to generate the calibration plots, please use the plot_gen_calibration.ipynb notebook. It contain information on how to generate the calibration plots given that you have the calibration results.

Classifiers

Training

In order to train the early-exit classifiers, please see the confidence_classifier_training.sh script. The important parameters are:

--do_train: sets the flag to do training
--output_dir: where to save the model + trained classifier
--learning_rate: sets learning rate for classifier training
--num_train_epochs: number of epochs to train for
--exit_conf_type: which classifier to train, options are 'vanilla_classifier' (linear), 'MLP', 'transformer_MLP_64' and 'transformer_MLP_512'. Also available are 'transformer_linear_64' and 'transformer_linear_512', which replace the MLP at the end of the transformer classifier with a simple linear layer.
--max_train_samples: [optional] maximum number of training datapoints to use for training

Evaluation

In order to evaluate a trained early_exit classifier, please see the confidence_classifier_training_eval.sh script The important parameters are:

--model_name_or_path: ensure this is pointed to the correct model (the one saved by the classifier training script in output_dir)
--use_early_exit: this must be true to ensure early exiting is used during evaluation
--exit_conf_type: which classifier to evaluate (note, must match the classifier trained on the model pointed to by --model_name_or_path)
--exit_conf_threshold: confidence threshold for use during early exiting
--exit_min_layer: minimum layer for early exit (in our experiments, this is always 1)
--max_eval_samples: [optional] maximum number of eval datapoints to use for evaluation

Top-k token propagation

The evaluation of top-k token propagation method has been integrated into the standard evaluation pipeline of FREE codebase. It can be activated by specifying the value of K for parameter --top_propagation K. Since top-k propagation is defined for softmax response confidence estimation, it takes effect only when parameter --exit_conf_type softmax is set.

For the full structure of the command, please refer to the the topk_propagation_eval.sh script. The important parameters are:

--model_name_or_path: ensure this is pointed to the model finetuned for CALM softmax-response method
--use_early_exit: this must be true to ensure early exiting is used during evaluation
--exit_conf_type: for a top-k token propagation to work, it must be set to softmax
--exit_conf_threshold: confidence threshold for use during early exiting
--exit_min_layer: minimum layer for early exit (in our experiments, this is always 1)
--top_propagation: [optional] number of tokens to use in top-k token propagation (if not set, then standard CALM method is run)
--max_eval_samples: [optional] maximum number of eval datapoints to use for evaluation

As the result of this command, all_results.json file with time measurements and recorded metrics will be saved in the location of --output_dir. All the results used in our experiments were generated by running bash ./topk_run_file.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 248 Commits
gpfs/home3/scur0393/fast_robust_early_exit		gpfs/home3/scur0393/fast_robust_early_exit
gpt_eval		gpt_eval
models		models
plots		plots
qa_lib		qa_lib
results		results
scripts		scripts
sum_lib		sum_lib
tr_lib		tr_lib
util		util
.gitignore		.gitignore
README.md		README.md
blogpost.md		blogpost.md
calibration_run.sh		calibration_run.sh
confidence_classifier_training.sh		confidence_classifier_training.sh
confidence_classifier_training_eval.sh		confidence_classifier_training_eval.sh
evaluate_softmax.py		evaluate_softmax.py
model_calibration.py		model_calibration.py
original_README.md		original_README.md
plot_gen_calibration.ipynb		plot_gen_calibration.ipynb
requirements.txt		requirements.txt
run_question_answering.py		run_question_answering.py
run_summarization.py		run_summarization.py
run_translation.py		run_translation.py
sm_test.job		sm_test.job
topk_propagation_eval.sh		topk_propagation_eval.sh
topk_run_file.sh		topk_run_file.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Calibration

Calibration Plots

Classifiers

Training

Evaluation

Top-k token propagation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

fletchel/DL2-CALM

Folders and files

Latest commit

History

Repository files navigation

Calibration

Calibration Plots

Classifiers

Training

Evaluation

Top-k token propagation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages