MiCoTA: Bridging the Learnability Gap with Intermediate CoT and Teacher Assistants

📌 Overview

🛠 Installation

git clone https://github.com/yourusername/micota.git
cd micota
pip install -r requirements.txt

📊 Data Processing

To generate training data from raw sources:

bash scripts/run_data_processing.sh

Output: Processed data will be saved to data/processed/filtered_result.json with the following structure:

{
    "instruction": "...",
    "output": "...",
    "answer": "...",
    "resp_answer": "..."
}

🧠 Model Architecture

Merging Strategies

We adopted DARE as our model merging method.

Build Environment

cd mergekit
conda create -n mergekit
conda activate mergekit
pip install -e .

We use mergekit framework for model merging. Run the merge and write your merged model to saves/model.

mergekit-yaml configs/dares_ties.yml saves/model

Training Configuration

We use LLaMA-Factory framework for model training.

Build Environment

cd LLaMA-Factory
conda create -n llama_factory python=3.10
conda activate llama_factory
pip install -e ".[torch,metrics]"
pip install deepspeed

Configure training parameters in YAML files:

llamafactory-cli train configs/3B.yaml

📈 Evaluation

We employ lm-evaluation-harness, which is a tool for evaluating the performance of the fine-tuned models.

Build Environment

cd lm-evaluation-harness
conda create -n lm-evaluation-harness
conda activate lm-evaluation-harness
pip install -e .

We provide comprehensive evaluation across multiple mathematical reasoning benchmarks:

lm_eval --model vllm \
    --model_args "pretrained=Model_Path,tensor_parallel_size=4,gpu_memory_utilization=0.85,max_model_len=16000,enforce_eager=True" \
    --tasks gsm8k_zero,AMC,AIME,Olympiad,hendrycks_math_500  \
    --batch_size auto \
    --gen_kwargs do_sample=false,temperature=0,max_gen_toks=16000 \
    --output_path results/micota \
    --apply_chat_template \
    --log_samples

For the AMC, AIME, Olympiad, and hendrycks_math_500 tasks, we leverage the customized evaluation tasks and scripts from the Small-Model-Learnability-Gap repository. Specifically, we adopted the task configurations and evaluation frameworks provided in the lm-evaluation-harness directory to assess model performance on complex reasoning benchmarks.

This implementation is based on the original work released under the MIT License, and we thank the authors for their open-source contributions.

Benchmark Details

Task	Dataset	Description
GSM8K	Grade School Math	Basic arithmetic and reasoning
AMC	American Math Competition	Advanced problem solving
AIME	Math Olympiad	Competition-level problems
OlympiadBench	Math Olympiad	Olympiad-level problems
Hendrycks Math	Comprehensive Test	Diverse mathematical concepts

Acknowledgments

This repository is built upon LLaMA-Factory, lm-evaluation-harness , mergekit , Small-Model-Learnability-Gap, and hiyouga/math. We would like to thank all contributors for their support.

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

📚 Citation

If you use MiCoTA in your research, please cite our work:

@misc{ding2025micotabridginglearnabilitygap,
      title={MiCoTA: Bridging the Learnability Gap with Intermediate CoT and Teacher Assistants}, 
      author={Dongyi Ding and Tiannan Wang and Chenghao Zhu and Meiling Tao and Yuchen Eleanor Jiang and Wangchunshu Zhou},
      year={2025},
      eprint={2507.01887},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2507.01887}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LLaMA-Factory @ 8e7727f		LLaMA-Factory @ 8e7727f
figs		figs
lm-evaluation-harness @ a7ca043		lm-evaluation-harness @ a7ca043
mergekit @ 4c6658b		mergekit @ 4c6658b
scripts		scripts
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

MiCoTA: Bridging the Learnability Gap with Intermediate CoT and Teacher Assistants

📌 Overview

🛠 Installation

📊 Data Processing

🧠 Model Architecture

Merging Strategies

Build Environment

Training Configuration

Build Environment

📈 Evaluation

Build Environment

Benchmark Details

Acknowledgments

📜 License

📚 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Uh oh!

License

Uh oh!

OPPO-PersonalAI/MiCoTA

Folders and files

Latest commit

History

Repository files navigation

MiCoTA: Bridging the Learnability Gap with Intermediate CoT and Teacher Assistants

📌 Overview

🛠 Installation

📊 Data Processing

🧠 Model Architecture

Merging Strategies

Build Environment

Training Configuration

Build Environment

📈 Evaluation

Build Environment

Benchmark Details

Acknowledgments

📜 License

📚 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages