LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

Official repository for the paper:
LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

🧠 Overview

LaDiR (Latent Diffusion Reasoner) introduces a new reasoning framework that unifies the expressiveness of continuous latent representations with the iterative refinement capability of diffusion models for large language models (LLMs).

Instead of generating reasoning chains autoregressively, LaDiR performs latent diffusion over thought tokens, enabling:

Iterative semantic self-refinement
Diverse parallel reasoning trajectories
A flexible trade-off between accuracy and test-time compute

🛠️ Installation

Clone the repository:
```
git clone <repository-url>
```

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

🎯 Usage

Training the VAE Model

Prepare your dataset in JSONL format with the following structure:
```
{"input": "question text", "output": "reasoning chain"}
```
Configure training parameters in configs/cd_formal_8B_VAE_conn.yaml
Run VAE training:
```
cd vae
bash ..scripts/train_vae.sh
```

Training the Diffusion Model

bash scripts/train_vae.sh

⚙️ Configuration

The model can be configured through YAML files in the configs/ directory. Key parameters include:

Model: Base language model path, LoRA configuration
Training: Learning rate, batch size, number of steps
VAE: Compression rate, memory size, beta for KL loss
Dataset: Training file paths, data processing options

If you find this work useful, please consider citing:

@article{kang2025ladir,
  title={LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning},
  author={Kang, Haoqiang and Zhang, Yizhe and Kuang, Nikki Lijing and Majamäki, Nicklas and Jaitly, Navdeep and Ma, Yi-An and Qin, Lianhui},
  journal={arXiv preprint arXiv:2510.08558},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
configs		configs
scripts		scripts
vae		vae
.gitignore		.gitignore
README.md		README.md
config.py		config.py
dataset.py		dataset.py
example_usage.py		example_usage.py
fm_noise_scheduler.py		fm_noise_scheduler.py
model.py		model.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

🧠 Overview

🛠️ Installation

🎯 Usage

Training the VAE Model

Training the Diffusion Model

⚙️ Configuration

About

Uh oh!

Releases

Packages

Contributors 2

Languages

mk322/LaDiR

Folders and files

Latest commit

History

Repository files navigation

LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

🧠 Overview

🛠️ Installation

🎯 Usage

Training the VAE Model

Training the Diffusion Model

⚙️ Configuration

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages