Thanks to visit codestin.com
Credit goes to github.com

Skip to content

VILA-Lab/Awesome-DLMs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 

Repository files navigation

Awesome Diffusion Language Models

Awesome https://arxiv.org/abs/2508.10875

One of the most starred, comprehensive and up-to-date collections of Diffusion Language Model papers, code and resources! If you find this repository helpful, please consider giving it a ⭐ to support.

Timeline of Diffusion Language Models

This figure highlights key milestones in the development of DLMs, categorized into three groups: continuous DLMs, discrete DLMs, and recent multimodal DLMs. We observe that while early research predominantly focused on continuous DLMs, discrete DLMs have gained increasing popularity in more recent years.

Timeline of Diffusion Language Models

Table of Contents

Playground

Must-Read

D3PM: Structured Denoising Diffusion Models in Discrete State-Spaces
arXiv

LLaDA: Large Language Diffusion Models
arXiv Website Star

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models (ICLR 2025)
arXiv Star

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding
arXiv Website Star

Super Data Learners: Diffusion Language Models are Super Data Learners
arXiv Website Star

Surveys

[12 Aug 2025] A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models
arXiv Star

[16 Jun 2025] Discrete Diffusion in Large Language and Multimodal Models: A Survey
arXiv Star

[23 Feb 2024] Diffusion models in text generation: a survey (PeerJ Computer Science)

[29 Jun 2023] An Overview of Diffusion Models for Text Generation (MIPRO)

[24 May 2023] A Survey of Diffusion Models in Natural Language Processing
arXiv

[14 Mar 2023] Diffusion Models in NLP: A Survey
arXiv

[12 Mar 2023] Diffusion Models for Non-autoregressive Text Generation: A Survey (IJCAI 2023)
arXiv Star

Diffusion Foundation

[7 Sep 2022] Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow (ICLR 2023)
arXiv Star

[26 Nov 2020] Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021)
arXiv Star

[6 Oct 2020] Denoising Diffusion Implicit Models (ICLR 2021)
arXiv Star

[19 Jun 2020] Denoising Diffusion Probabilistic Models (NeurIPS 2020)
arXiv Website Star

[12 Jul 2019] Generative Modeling by Estimating Gradients of the Data Distribution (NeurIPS 2019)
arXiv Star

[12 Mar 2015] Deep Unsupervised Learning using Nonequilibrium Thermodynamics (ICML 2015)
arXiv Star

Discrete DLMs

[05 Nov 2025] Training Optimal Large Diffusion Language Models
arXiv Website Star

[02 Nov 2025] OpenMoE 2: Sparse Diffusion Language Models
blog Website Star

[21 Oct 2025] How Efficient Are Diffusion Language Models? A Critical Examination of Efficiency Evaluation Practices
arXiv

[20 Oct 2025] Soft-Masked Diffusion Language Models
arXiv

[17 Oct 2025] Planner and Executor: Collaboration between Discrete Diffusion And Autoregressive Models in Reasoning
arXiv

[17 Oct 2025] Attention Sinks in Diffusion Language Models
arXiv

[15 Oct 2025] On the Reasoning Abilities of Masked Diffusion Language Models
arXiv

[12 Oct 2025] UltraLLaDA: Scaling the Context Length to 128K for Diffusion Large Language Models
arXiv Star

[10 Oct 2025] Closing the Data-Efficiency Gap Between Autoregressive and Masked Diffusion LLMs
arXiv

[10 Oct 2025] Beyond Surface Reasoning: Unveiling the True Long Chain-of-Thought Capacity of Diffusion Large Language Models
arXiv

[8 Oct 2025] Next Semantic Scale Prediction via Hierarchical Diffusion Language Models
arXiv

[7 Oct 2025] SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation
arXiv Star

[5 Oct 2025] What Makes Diffusion Language Models Super Data Learners?
arXiv Star

[5 Oct 2025] Beyond Next-Token Prediction: A Performance Characterization of Diffusion versus Autoregressive Language Models
arXiv

[4 Oct 2025] Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMs
arXiv Star

[3 Oct 2025] DMark: Order-Agnostic Watermarking for Diffusion Large Language Models
arXiv

[1 Oct 2025] Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling
arXiv

[30 Sep 2025] dParallel: Learnable Parallel Decoding for dLLMs
arXiv Star

[29 Sep 2025] Why mask diffusion does not work
arXiv

[29 Sep 2025] DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models
arXiv Star

[29 Sep 2025] LLaDA-MoE: A Sparse MoE Diffusion Language Model
arXiv

[29 Sep 2025] Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct
arXiv

[28 Sep 2025] SparseD: Sparse Attention for Diffusion Language Models
arXiv Star

[28 Sep 2025] Sequential Diffusion Language Models
arXiv Star

[27 Sep 2025] Tree Reward-Aligned Search for TReASURe in Masked Diffusion Language Models
arXiv

[24 Sep 2025] FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models
arXiv

[17 Sep 2025] Masked Diffusion Models as Energy Minimization
arXiv

[5 Sep 2025] Masked Diffusion Language Models with Frequency-Informed Training
arXiv

[1 Sep 2025] Dream-Coder 7B: An Open Diffusion Language Model for Code
arXiv Website Star

[31 Aug 2025] Any-Order Flexible Length Masked Diffusion
arXiv

[17 Aug 2025] Where to Start Alignment? Diffusion Large Language Model May Demand a Distinct Position
arXiv

[14 Aug 2025] Thinking Inside the Mask: In-Place Prompting in Diffusion LLMs
arXiv

[12 Aug 2025] Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models
arXiv Website

[4 Aug 2025] Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference
arXiv Website

[25 Jul 2025] Jailbreaking Large Language Diffusion Models: Revealing Hidden Safety Flaws in Diffusion-Based Text Generation
arXiv

[15 Jul 2025] DreamOn: Diffusion Language Models For Code Infilling Beyond Fixed-Size Canvas
Website Star

[15 Jul 2025] The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs
arXiv Star

[10 Jul 2025] Your Absorbing Discrete Diffusion Secretly Models the Bayesian Posterior
arXiv Star

[7 Jul 2025] Review, Remask, Refine (R3): Process-Guided Block Diffusion for Text Generation (ICML 2025)
arXiv

[6 Jul 2025] Efficient perplexity bound and ratio matching in discrete diffusion language models (ICLR 2025)
arXiv Star

[2 Jul 2025] Discrete Diffusion Models for Language Generation
arXiv Star

[17 Jun 2025] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs
arXiv Star

[12 Jun 2025] The Diffusion Duality (ICML 2025)
arXiv Website Star

[12 Jun 2025] Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Principles
arXiv Star

[2 Jun 2025] Esoteric Language Models
arXiv Website Star

[25 May 2025] LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models
arXiv Website Star

[24 May 2025] Anchored Diffusion Language Model
arXiv

[21 May 2025] Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
arXiv

[20 May 2025] CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation
arXiv

[9 May 2025] Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions
arXiv

[22 Apr 2025] Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion (ICML 2025)
arXiv

[2 Apr 2025] Dream 7B
Website Star

[16 Mar 2025] State Fourier Diffusion Language Model (SFDLM): A Scalable, Novel Iterative Approach to Language Modeling
arXiv

[12 Mar 2025] Constrained Discrete Diffusion
arXiv

[12 Mar 2025] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models (ICLR 2025)
arXiv Website Star

[11 Mar 2025] Understanding the Quality-Diversity Trade-off in Diffusion Language Models
arXiv Star

[6 Mar 2025] Generalized Interpolating Discrete Diffusion (ICML 2025)
arXiv Star

[14 Feb 2025] Large Language Diffusion Models
arXiv Website Star

[13 Feb 2025] Theoretical Benefit and Limitation of Diffusion Language Model
arXiv

[13 Feb 2025] Non-Markovian Discrete Diffusion with Causal Language Models
arXiv

[10 Feb 2025] Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions (ICML 2025)
arXiv

[10 Nov 2024] Conditional [MASK] Discrete Diffusion Language Model
arXiv

[28 Oct 2024] Beyond Autoregression: Fast LLMs via Self-Distillation Through Time (ICLR 2025)
arXiv Website Star

[28 Oct 2024] Energy-Based Diffusion Language Models for Text Generation (ICLR 2025)
arXiv Star

[24 Oct 2024] Scaling up Masked Diffusion Models on Text (ICLR 2025)
arXiv Star

[23 Oct 2024] Scaling Diffusion Language Models via Adaptation from Autoregressive Models (ICLR 2025)
arXiv Star

[18 Oct 2024] Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning (ICLR 2025)
arXiv Star

[8 Oct 2024] (DDPD) Think While You Generate: Discrete Diffusion with Planned Denoising (ICLR 2025)
arXiv Star

[2 Oct 2024] Discrete Copula Diffusion (ICLR 2025)
arXiv Star

[4 Sep 2024] Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling (ICLR 2025)
arXiv

[22 Jul 2024] Discrete Flow Matching (NeurIPS 2024)
arXiv

[10 Jul 2024] Promises, Outlooks and Challenges of Diffusion Language Modeling
arXiv

[11 Jun 2024] (MDLM) Simple and Effective Masked Diffusion Language Models (NeurIPS 2024)
arXiv Website Star

[6 Jun 2024] (RADD) Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data (ICLR 2025)
arXiv

[6 Jun 2024] (MD4) Simplified and Generalized Masked Diffusion for Discrete Data (NeurIPS 2024)
arXiv Star

[7 Feb 2024] Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design (ICML 2024)
arXiv

[30 Jan 2024] Transfer Learning for Text Diffusion Models
arXiv

[25 Oct 2023] (SEDD) Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (ICML 2024)
arXiv Star

[15 Oct 2023] FiLM: Fill-in Language Models for Any-Order Generation
arXiv Star

[23 Aug 2023] Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
arXiv Star

[30 May 2023] Likelihood-Based Diffusion Language Models (NeurIPS 2023)
arXiv Star

[6 May 2023] Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation (EACL 2024)
arXiv Star

[11 Feb 2023] A Reparameterized Discrete Diffusion Model for Text Generation (COLM 2024)
arXiv Star

[28 Nov 2022] DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models (ACL 2023)
arXiv Star

[30 Oct 2022] DiffusER: Discrete Diffusion via Edit-based Reconstruction (ICLR 2023)
arXiv

[13 Dec 2021] (SUNDAE) Step-unrolled Denoising Autoencoders for Text Generation (ICLR 2022)
arXiv

[7 Jul 2021] Structured Denoising Diffusion Models in Discrete State-Spaces (NeurIPS 2021)
arXiv

[10 Feb 2021] Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions (NeurIPS 2021)
arXiv Star

Continuous DLMs

[6 Oct 2025] LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning
arXiv

[3 Oct 2025] Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner
arXiv

[26 Jun 2025] Compressed and Smooth Latent Space for Text Diffusion Modeling
arXiv

[28 May 2025] Unifying Continuous and Discrete Text Diffusion with Non-simultaneous Diffusion Processes (ACL 2025)
arXiv

[24 May 2025] Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation
arXiv Star

[20 Apr 2025] Perfect diffusion is TC^0 -- Bad diffusion is Turing-complete
arXiv

[19 Feb 2025] TESS 2: A Large-Scale Generalist Diffusion Language Model
arXiv Star

[15 Dec 2024] Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models (ACL 2025)
arXiv

[17 Oct 2024] Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration (NeurIPS 2024)
arXiv Star

[8 Aug 2024] Diffusion Guided Language Modeling (ACL Findings 2024)
arXiv Star

[May 2024] Effective Integration of Text Diffusion and Pre-Trained Language Models with Linguistic Easy-First Schedule (LREC-COLING 2024)

[17 Mar 2024] Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows (NAACL 2024)
arXiv

[14 Mar 2024] LDSeq: Latent Diffusion Models for Sequence to Sequence Text Generation (CSAI 23)

[Mar 2024] Flow Matching for Conditional Text Generation in a Few Sampling Steps (EACL 2024)

[29 Feb 2024] TEncDM: Understanding the Properties of Diffusion Model in the Space of Language Model Encodings
arXiv Star

[29 Feb 2024] Generating, Reconstructing, and Representing Discrete and Continuous Data: Generalized Diffusion with Learnable Encoding-Decoding (ICML 2024)
arXiv

[31 Oct 2023] LADIDA: Latent Diffusion for Document Generation with Sequential Decoding (NeurIPS Workshop 2023)

[18 Oct 2023] InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation (EMNLP 2023)
arXiv Star

[09 Oct 2023] DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models (EMNLP 2023)
arXiv Star

[26 Jul 2023] How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data? (ECAI 2023)
arXiv Star

[19 May 2023] DiffuSIA: A Spiral Interaction Architecture for Encoder-Decoder Text Diffusion
arXiv

[16 May 2023] AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation (NeurIPS 2023)
arXiv Star

[15 May 2023] TESS: Text-to-Text Self-Conditioned Simplex Diffusion (EACL 2024)
arXiv Star

[25 Apr 2023] Glyphdiffusion: Text generation as image generation
arXiv

[10 Apr 2023] A Cheaper and Better Diffusion Language Model with Soft-Masked Noise (EMNLP 2023)
arXiv Star

[20 Feb 2023] Dinoiser: Diffused conditional sequence learning by manipulating noises (TCAL 2024)
arXiv Star

[22 Dec 2022] (GENIE) Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise (ICML 2023)
arXiv Star

[20 Dec 2022] Seqdiffuseq: Text diffusion with encoder-decoder transformers (NAACL 2024)
arXivStar

[19 Dec 2022] Latent Diffusion for Language Generation (NeurIPS 2023)
arXiv Star

[19 Dec 2022] (Difformer) Empowering Diffusion Models on the Embedding Space for Text Generation (NAACL 2024)
arXiv Star

[28 Nov 2022] Continuous diffusion for categorical data
arXiv

[8 Nov 2022] Self-conditioned Embedding Diffusion for Text Generation
arXiv

[31 Oct 2022] SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control (ACL 2023)
arXiv Star

[17 Oct 2022] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models (ICLR 2023)
arXiv Star

[1 Aug 2022] Composable Text Controls in Latent Space with ODEs (EMNLP 2023)
arXiv Star

[13 Jun 2022] Latent Diffusion Energy-Based Model for Interpretable Text Modeling (ICML 2022)
arXiv Star

[27 May 2022] Diffusion-LM Improves Controllable Text Generation (NeurIPS 2022)
arXiv Star

Multimodal DLMs

[22 Oct 2025] From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model
arXiv Website Star

[23 Sep 2025] Lavida-O: Elastic Large Masked Diffusion Models for Unified Multimodal Understanding and Generation
arXiv Website Star

[9 Sep 2025] Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
arXiv Website Star

[8 Sep 2025] LLaDA-VLA: Vision Language Diffusion Action Models
arXiv Website

[29 May 2025] Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model
arXiv Star

[26 May 2025] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
arXiv Website Star

[22 May 2025] LaViDa: A Large Diffusion Language Model for Multimodal Understanding
arXiv Website Star

[22 May 2025] Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding
arXiv Star

[22 May 2025] LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning
arXiv Website Star

[21 May 2025] MMaDA: Multimodal Large Diffusion Language Models
arXiv Star

[26 Mar 2025] Unified Multimodal Discrete Diffusion
arXiv Website Star

Training Strategies

[03 Oct 2025] Training Optimal Large Diffusion Language Models
arXiv Website Star

[13 Oct 2025] Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models
arXiv Star

[10 Oct 2025] SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models
arXiv Star

[5 Oct 2025] Principled and Tractable RL for Reasoning with Diffusion Language Models
arXiv

[2 Oct 2025] Step-Aware Policy Optimization for Reasoning in Diffusion Large Language Models
arXiv

[27 Sep 2025] A2D: Any-Order, Any-Step Safety Alignment for Diffusion Language Models
arXiv

[12 Sep 2025] Inpainting-Guided Policy Optimization for Diffusion Large Language Models
arXiv

[8 Sep 2025] Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
arXiv Star

[7 Sep 2025] BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models
arXiv

[27 Aug 2025] Blockwise SFT for Diffusion Language Models: Reconciling Bidirectional Attention and Autoregressive Decoding
arXiv Star

[18 Aug 2025] MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models
arXiv Website Star

[7 Jul 2025] wd1: Weighted Policy Optimization for Reasoning in Diffusion Language Models
arXiv Star

[25 Jun 2025] DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
arXiv Star

[25 May 2025] LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models
arXiv Website Star

[21 May 2025] MMaDA: Multimodal Large Diffusion Language Models
arXiv Star

[15 May 2025] Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models
arXiv

[16 Apr 2025] d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning
arXiv Website Star

[2 Apr 2025] Dream 7B
Website Star

[3 Feb 2025] Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods
arXiv Star

[Jan 2025] Addressing the Training-Inference Discrepancy in Discrete Diffusion for Text Generation (COLING 2025)
Star

[23 Oct 2024] Scaling Diffusion Language Models via Adaptation from Autoregressive Models (ICLR 2025)
arXiv Star

[17 Oct 2024] Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design (ICLR 2025)
arXiv Star

[19 Feb 2024] Text Diffusion with Reinforced Conditioning
arXiv

[12 Feb 2024] Diffusion of Thought: Chain-of-Thought Reasoning in Diffusion Language Models (NeurIPS 2024)
arXiv Star

[8 May 2023] Can Diffusion Model Achieve Better Performance in Text Generation? Bridging the Gap between Training and Inference! (ACL 2023)
arXiv Star

Inference Optimization

[20 Oct 2025] Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model
arXiv

[16 Oct 2025] Attention Is All You Need for KV Cache in Diffusion LLMs
arXiv Website

[16 Oct 2025] Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models
arXiv

[13 Oct 2025] Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States
arXiv

[13 Oct 2025] Unlocking the Potential of Diffusion Language Models through Template Infilling
arXiv

[10 Oct 2025] Mask Tokens as Prophet: Fine-Grained Cache Eviction for Efficient dLLM Inference
arXiv Star

[9 Oct 2025] dInfer: An Efficient Inference Framework for Diffusion Language Models
arXiv Star

[8 Oct 2025] Accelerating Diffusion LLM Inference via Local Determinism Propagation
arXiv

[7 Oct 2025] CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credits
arXiv

[6 Oct 2025] Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models
arXiv

[5 Oct 2025] Self Speculative Decoding for Diffusion Large Language Models
arXiv

[30 Sep 2025] Fast-dLLM v2: Efficient Block-Diffusion LLM
arXiv Website Star

[29 Sep 2025] RFG: Test-Time Scaling for Diffusion Large Language Model Reasoning with Reward-Free Guidance
arXiv

[29 Sep 2025] Learning to Parallel: Accelerating Diffusion Large Language Models via Adaptive Parallel Decoding
arXiv Website Star

[28 Sep 2025] Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step
arXiv Star

[28 Sep 2025] Don't Settle Too Early: Self-Reflective Remasking for Diffusion Language Models
arXiv Star

[28 Sep 2025] DiffuSpec: Unlocking Diffusion Language Models for Speculative Decoding
arXiv

[27 Sep 2025] d2Cache: Accelerating Diffusion-based LLMs via Dual Adaptive Caching
arXiv Star

[25 Sep 2025] Enabling Approximate Joint Sampling in Diffusion LMs
arXiv

[22 Sep 2025] Spiffy: Multiplying Diffusion LLM Acceleration via Lossless Speculative Decoding
arXiv

[18 Sep 2025] Fast and Fluent Diffusion Language Models via Convolutional Decoding and Rejective Fine-tuning
arXiv Star

[31 Aug 2025] Reward-Weighted Sampling: Enhancing Non-Autoregressive Characteristics in Masked Diffusion LLMs
arXiv

[27 Aug 2025] Diffusion Language Models Know the Answer Before Decoding
arXiv Star

[20 Aug 2025] Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs
arXiv

[19 Aug 2025] DPad: Efficient Diffusion Language Models with Suffix Dropout
arXiv Star

[18 Aug 2025] PC-Sampler: Position-Aware Calibration of Decoding Bias in Masked Diffusion Models
arXiv Star

[14 Aug 2025] DLLMQuant: Quantizing Diffusion-based Large Language Models
arXiv

[8 Aug 2025] Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing
arXiv Star

[4 Aug 2025] Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction
arXiv

[1 Aug 2025] Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models
arXiv Star

[24 Jul 2025] Wide-In, Narrow-Out: Revokable Decoding for Efficient and Effective DLLMs
arXiv Star

[11 Jul 2025] Inference-Time Scaling of Diffusion Language Models with Particle Gibbs Sampling
arXiv

[6 Jul 2025] Unveiling the Potential of Diffusion Large Language Model in Controllable Generation
arXiv

[23 Jun 2025] Plan for Speed -- Dilated Scheduling for Masked Diffusion Language Models
arXiv

[12 Jun 2025] Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Principles
arXiv Star

[12 Jun 2025] The Diffusion Duality (ICML 2025)
arXiv Website Star

[2 Jun 2025] Esoteric Language Models
arXiv Website Star

[31 May 2025] Accelerating Diffusion LLMs via Adaptive Parallel Decoding
arXiv Star

[30 May 2025] DLM-One: Diffusion Language Models for One-Step Sequence Generation
arXiv

[30 May 2025] Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking
arXiv

[28 May 2025] Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding
arXiv Website Star

[28 May 2025] DINGO: Constrained Inference for Diffusion LLMs
arXiv

[27 May 2025] Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion
arXiv

[26 May 2025] Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking
arXiv Star

[22 May 2025] Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding
arXiv Star

[17 May 2025] dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching
arXiv Star

[21 May 2025] dKV-Cache: The Cache for Diffusion Language Models
arXiv Star

[1 Mar 2025] Remasking Discrete Diffusion Models with Inference-Time Scaling (ICLR 2025)
arXiv Website Star

[11 Oct 2024] Distillation of Discrete Diffusion through Dimensional Correlations (ICML 2025)
arXiv Star

[8 Oct 2024] (DDPD) Think While You Generate: Discrete Diffusion with Planned Denoising
arXiv Star

[Nov 2024] Enable Fast Sampling for Seq2Seq Text Diffusion (EMNLP Findings 2024)
Anthology Star

[10 Aug 2024] Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion (NAACL 2025)
arXiv

[May 2024] Few-shot Temporal Pruning Accelerates Diffusion Models for Text Generation (LREC-COLING 2024)
Anthology

[15 Mar 2024] Utilizing Latent Diffusion Model to Accelerate Sampling Speed and Enhance Text Generation Quality

[15 Feb 2024] Quantized Embedding Vectors for Controllable Diffusion Language Models
arXiv

[3 Jun 2024] Unlocking Guidance for Discrete State-Space Diffusion and Flow Models (ICLR 2025)
arXiv Star

[09 Oct 2023] DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models (EMNLP 2023)
arXiv Star

[24 May 2023] David helps Goliath: Inference-Time Collaboration Between Small Specialized and Large General Diffusion LMs (NAACL 2024)
arXiv

[18 May 2023] Diffusion Language Models Generation Can Be Halted Early
arXiv

Training Frameworks

[02 Nov 2025] MegaDLMs: Training Diffusion Language Models at Any Scale
Star

Applications

[1 Oct 2025] Syntax-Guided Diffusion Language Models with User-Integrated Personalization
arXiv

[30 Sep 2025] TraceDet: Hallucination Detection from the Decoding Trace of Diffusion Large Language Models
arXiv

[29 Sep 2025] DiffTester: Accelerating Unit Test Generation for Diffusion LLMs via Repetitive Pattern
arXiv Star

[24 Sep 2025] Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving
arXiv

[14 Aug 2025] Improving Text Style Transfer using Masked Diffusion Language Models with Inference-time Scaling
arXiv

[2 Aug 2025] TreeDiff: AST-Guided Code Generation with Diffusion LLMs
arXiv

[25 Jul 2025] Arg-LLaDA: Argument Summarization via Large Language Diffusion Models and Sufficiency-Aware Refinement
arXiv

[26 Jun 2025] DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
arXiv Star

[17 Jun 2025] Mercury: Ultra-Fast Language Models Based on Diffusion
arXiv

[16 Jun 2025] Flexible-length Text Infilling for Discrete Diffusion Models
arXiv

[11 Jun 2025] Debunk and Infer: Multimodal Fake News Detection via Diffusion-Generated Evidence and LLM Reasoning
arXiv

[9 Jun 2025] Diffusion Sequence Models for Enhanced Protein Representation and Generation
arXiv

[28 May 2025] CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language Models (ICML 2025)
arXiv Star

[14 May 2025] Gemini Diffusion
Website

[27 Feb 2025] EdiText: Controllable Coarse-to-Fine Text Editing with Diffusion Language Models (ACL 2025)
arXiv

[31 Jan 2025] TermDiffuSum: A Term-guided Diffusion Model for Extractive Summarization of Legal Documents (COLING 2025)
Star

[1 Jan 2025] DiffETM: Diffusion Process Enhanced Embedded Topic Model (ICASSP 2025)
arXiv

[23 Dec 2024] DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
arXiv

[5 Nov 2024] DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models (ACL 2025)
arXiv Star

[30 Oct 2024] Private Synthetic Text Generation with Diffusion Models (NAACL 2025)
arXiv Star

[22 Oct 2024] MeMDLM: De Novo Membrane Protein Design with Masked Discrete Diffusion Protein Language Models (ICLR 2025)
arXiv

[17 Oct 2024] Text-Guided Multi-Property Molecular Optimization with a Diffusion Language Model
arXiv

[17 Oct 2024] DPLM-2: A Multimodal Diffusion Protein Language Model
arXiv Website Star

[17 Oct 2024] Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design (ICLR 2025)
arXiv Star

[10 Oct 2024] Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction (ICLR 2025)
arXiv

[14 Sep 2024] Towards Diverse and Efficient Audio Captioning via Diffusion Models (DAC-Interspeech25)
arXiv

[10 Sep 2024] Table-to-Text Generation with Pretrained Diffusion Models (IEEE 2024)
arXiv

[5 Sep 2024] An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification (EMNLP 2024)
arXiv Star

[Aug 2024] DiffusPoll: Conditional Text Diffusion Model for Poll Generation (ACL 2024)
Star

[25 Jun 2024] Discrete Diffusion Language Model for Efficient Text Summarization (NAACL 2025)
arXiv

[16 Apr 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? (NAACL 2024)
arXiv Star

[13 Apr 2024] Improved Paraphrase Generation via Controllable Latent Diffusion
arXiv Star

[10 Apr 2024] DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space (LREC-COLING 2024)
arXiv Star

[10 Apr 2024] Diffuwords: A Contrastive Diffusion Model for Lexically Constrained Text Generation (SSRN 2024 Apr)

[28 Mar 2024] Benchmarking Diffusion Models for Machine Translation (EACL 2024)

[26 Mar 2024] Improving Iteration-based Non-Autoregressive Language Model With Time Step Awareness (ICPADS 2023)

[28 Feb 2024] Diffusion Language Models Are Versatile Protein Learners (ICML 2024)
arXiv Star

[26 Feb 2024] DiffuCOMET: Contextual Commonsense Knowledge Diffusion (ACL 2024)
arXiv Star

[24 Feb 2024] IPED: An Implicit Perspective for Relational Triple Extraction based on Diffusion Model (NAACL 2024)
arXiv Star

[23 Feb 2024] Let's Rectify Step by Step: Improving Aspect-based Sentiment Analysis with Diffusion Models (LREC-COLING 2024)
arXiv Star

[20 Feb 2024] Text-Guided Molecule Generation with Diffusion Language Model (AAAI 2024)
arXiv Star

[16 Feb 2024] Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation
arXiv

[11 Jan 2024] MDM: Meta diffusion model for hard-constrained text generation (Knowledge-Based Systems)

[Dec 2023] DiffusionSL: Sequence Labeling via Tag Diffusion Process (EMNLP 2023)

[19 Dec 2023] IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition (IJCV 2025)
arXiv

[12 Dec 2023] DiffuVST: Narrating Fictional Scenes with Global-History-Guided Denoising Models (EMNLP 2023)
arXiv

[3 Dec 2023] DiffuCom: A novel diffusion model for comment generation (Knowledge-Based Systems)

[Dec 2023] DiffusionRet: Diffusion-Enhanced Generative Retriever using Constrained Decoding (EMNLP 2023)
Star

[16 Nov 2023] P^3SUM: Preserving Author's Perspective in News Summarization with Diffusion Language Models (NAACL 2024)
arXiv Star

[31 Oct 2023] LADIDA: Latent Diffusion for Document Generation with Sequential Decoding (NeurIPS Workshop 2023)

[26 Oct 2023] DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation (EMNLP 2023)
arXiv

[24 Oct 2023] ScanDL: A Diffusion Model for Generating Synthetic Scanpaths on Texts (EMNLP 2023)
arXiv Star

[23 Oct 2023] DeTiME: Diffusion-Enhanced Topic Modeling using Encoder-decoder based LLM (EMNLP 2023)
arXiv Star

[21 Oct 2023] Context-Aware Prompt for Generation-based Event Argument Extraction with Diffusion Models (CIKM 2023)

[16 Oct 2023] ForceGen: End-to-end de novo protein generation based on nonlinear mechanical unfolding responses using a protein language diffusion model (ScienceAdvances)
arXiv

[29 Aug 2023] ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer (AAAI 2024)
arXiv Star

[17 Aug 2023] Enhancing Phrase Representation by Information Bottleneck Guided Text Diffusion Process for Keyphrase Extraction (LREC-COLING 2024)
arXiv

[25 Jul 2023] XDLM: Cross-lingual Diffusion Language Model for Machine Translation
arXiv Star

[9 Jul 2023] Controllable Conversation Generation with Conversation Structures via Diffusion Models (ACL 2023)

[14 Jun 2023] PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation (AAAI 2024)
arXiv

[14 Jun 2023] DiffuDetox: A Mixed Diffusion Model for Text Detoxification (ACL 2023)
arXiv Star

[5 Jun 2023] PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model (NeurIPS 2023)
arXiv Star

[2 Jun 2023] DiffusEmp: A Diffusion Model-Based Framework with Multi-Grained Control for Empathetic Response Generation (ACL 2023)
arXiv Star

[31 May 2023] Fine-grained Text Style Transfer with Diffusion-Based Language Models (RepL4NLP 2023)
arXiv Star

[31 May 2023] Protein Design with Guided Discrete Diffusion (NeurIPS 2023)
arXiv Star

[22 May 2023] Dior-CVAE: Pre-trained Language Models and Diffusion Priors for Variational Dialog Generation (EMNLP 2023)
arXiv Star

[22 May 2023] DiffusionNER: Boundary Diffusion for Named Entity Recognition (ACL 2023)
arXiv Star

[2 May 2023] DiffuSum: Generation Enhanced Extractive Summarization with Diffusion (ACL 2023)
arXiv Star

[7 Jan 2023] ROIC-DM: Robust Text Inference and Classification via Diffusion Model
arXiv

Resources

ZHZisZZ/dllm Star

pengzhangzhi/Open-dLLM Star

bansky-cl/diffusion-nlp-paper-arxiv Star

bansky-cl/Diffusion-LM-Papers Star

yczhou001/Awesome-Diffusion-LLM Star

StevenYuan666/Awesome-Diffusion-Models-for-NLP Star

LiQiiiii/DLLM-Survey Star

ML-GSAI/Diffusion-LLM-Papers Star

AoiDragon/Awesome-Text-Diffusion-Models Star

kuleshov-group/awesome-discrete-diffusion-models Star

Gemini Diffusion

Mercury Arxiv

Star History Chart

Citation

@article{li2025survey,
  title={A Survey on Diffusion Language Models},
  author={Li, Tianyi and Chen, Mingda and Guo, Bowei and Shen, Zhiqiang},
  journal={arXiv preprint arXiv:2508.10875},
  year={2025}
}

About

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 7