MATH OVM code - Outcome-supervised Value Models for Planning in Mathematical Reasoning

Generator

Using deepspeed stage 3 with offload to cpu and using 2 x RTX 3090 GPU, I was able to train facebook/opt-2.7B model.

Prepare the MetaMath dataset for trainging verifier in correct format

python prepare_metamath_data.py --data_size 1000

Generate training labels for Verifier:

Run the commands:

git clone https://github.com/saultaut/math-ai.git
cd math-ai/
pip install -r requirements_runpod.txt
bash scripts/metamath/generate_metamath.sh

The output will be saved to data/metamath/model_generation/train_500/ and file should be like responses_n1_*.jsonl

Debuging the Verifier code

Connected remotly using VS Code to RunPod instace with GPU 3080. Everything worked. This uses small Opt-125m model.

python train_verifier_debug_metamath.py

Train Verifier on MetaMath dataset:

Run the commands:

git clone https://github.com/saultaut/math-ai.git
cd math-ai/
pip install -r requirements_runpod.txt
bash scripts/metamath/train_verifier_metamath.sh

Output will be save in /models/metamath/verifiers/

Value-Guided Beam Search

git clone https://github.com/saultaut/math-ai.git
cd math-ai/
pip install -r requirements_runpod.txt

huggingface-cli login
huggingface-cli download sauliuz/opt-125mln-verifier --local-dir ./models/metamath/verifiers/


bash scripts/metamath/eval_step_beam_mistral.sh
or  
bash scripts/metamath/eval_step_beam.sh

Upload trained verifier to the Hugging Face

huggingface-cli login
huggingface-cli upload sauliuz/opt-125mln-verifier ./models/metamath/verifiers/ .

This will create a folder in HF with the name of verifier with all required files.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
configs		configs
data		data
eval_results		eval_results
scripts		scripts
utils		utils
README.md		README.md
eval_generator_by_step.py		eval_generator_by_step.py
eval_generator_by_step_debug.py		eval_generator_by_step_debug.py
eval_generator_by_step_metamath.py		eval_generator_by_step_metamath.py
eval_with_verifier.py		eval_with_verifier.py
eval_with_verifier_debug.py		eval_with_verifier_debug.py
format_test_datset.py		format_test_datset.py
generate_metamath.py		generate_metamath.py
generate_metamath_debug.py		generate_metamath_debug.py
generate_paths_and_eval.py		generate_paths_and_eval.py
install_docker.md		install_docker.md
prepare_metamath_data.py		prepare_metamath_data.py
requirements.txt		requirements.txt
requirements_runpod.txt		requirements_runpod.txt
train_generator.py		train_generator.py
train_generator_debug.py		train_generator_debug.py
train_generator_debug_metamath.py		train_generator_debug_metamath.py
train_generator_metamath.py		train_generator_metamath.py
train_verifier.py		train_verifier.py
train_verifier_debug.py		train_verifier_debug.py
train_verifier_debug_metamath.py		train_verifier_debug_metamath.py
train_verifier_metamath.py		train_verifier_metamath.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MATH OVM code - Outcome-supervised Value Models for Planning in Mathematical Reasoning

Generator

Prepare the MetaMath dataset for trainging verifier in correct format

Generate training labels for Verifier:

Debuging the Verifier code

Train Verifier on MetaMath dataset:

Value-Guided Beam Search

Upload trained verifier to the Hugging Face

About

Uh oh!

Releases

Packages

Uh oh!

Languages

saultaut/math-ai

Folders and files

Latest commit

History

Repository files navigation

MATH OVM code - Outcome-supervised Value Models for Planning in Mathematical Reasoning

Generator

Prepare the MetaMath dataset for trainging verifier in correct format

Generate training labels for Verifier:

Debuging the Verifier code

Train Verifier on MetaMath dataset:

Value-Guided Beam Search

Upload trained verifier to the Hugging Face

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages