Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning

📢 If you are interested in our work, please star ⭐ our project.

🌈 Introduction

Large reasoning models (LRMs) have demonstrated impressive capabilities in complex problem-solving, yet their internal reasoning mechanisms remain poorly understood. In this paper, we investigate the reasoning trajectories of LRMs from an information-theoretic perspective. By tracking how mutual information (MI) between intermediate representations and the correct answer evolves during LRM reasoning, we observe an interesting MI peaks phenomenon: the MI at specific generative steps exhibits a sudden and significant increase during LRM’s reasoning process. We theoretically analyze such phenomenon and show that as MI increases, the probability of model’s prediction error decreases. Furthermore, these MI peaks often correspond to tokens expressing reflection or transition, such as “Hmm”, “Wait” and “Therefore,” which we term as the thinking tokens. We then demonstrate that these thinking tokens are crucial for LRM’s reasoning performance, while other tokens has minimal impacts. Building on these analyses, we propose two simple yet effective methods to improve LRM’s reasoning performance, by delicately leveraging these thinking tokens. Overall, our work provides novel insights into the reasoning mechanisms of LRMs and offers practical ways to improve their reasoning capabilities.

🚩Main Analyses

Certain steps exhibit sudden and significantly increases in MI during the reasoning process of LRMs, and these MI peaks are sparse and distribute non-uniformly.

Theoretical Insights: Higher MI Leads to Tighter Bounds on Prediction Error.

Non-reasoning LLMs exhibit weaker and less pronounced MI peaks compared to LRMs. And the overall MI in non-reasoning LLMs during the reasoning process is lower than their corresponding LRMs.

The tokens that appear at MI peaks are mostly connective words that express self-reflection or transitions in LRM’s reasoning process.

🚀Quick Start

🔧Requirements

The following pakages are required to run the code:

python==3.11.5
pytorch==2.1.2
transformers==4.46.1
numpy==1.26.4

🌟Usage

cd src/

1. Collect the representations and compute the MI

sh scripts/compute_mi_trajectories.sh

2. Plot figures to observe the MI Peaks phenomenon

run the plot_mi_peaks.ipynb

3. Run the Representation Recycling (RR)

sh scripts/run_RR.sh

📝License

Distributed under the Apache-2.0 License. See LICENSE for more information.

Acknowledgements

Some code in this project is adapted from resources provided by the following repositories:

We greatly appreciate the contributions of the original authors.

📖BibTeX

@article{qian2025demystifying,
  title={Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning},
  author={Qian, Chen and Liu, Dongrui and Wen, Haochen and Bai, Zhen and Liu, Yong and Shao, Jing},
  journal={arXiv preprint arXiv:2506.02867},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning

🌈 Introduction

🚩Main Analyses

🚀Quick Start

🔧Requirements

🌟Usage

📝License

Acknowledgements

📖BibTeX

About

Uh oh!

Releases

Packages

Languages

License

ChnQ/MI-Peaks

Folders and files

Latest commit

History

Repository files navigation

Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning

🌈 Introduction

🚩Main Analyses

🚀Quick Start

🔧Requirements

🌟Usage

📝License

Acknowledgements

📖BibTeX

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages