📚
Stars
Official implementation of "Diffusion Language Models Know the Answer Before Decoding"
[ISCA 2025] Official Implementation of "MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization"
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
[ICCV 2025] Official Implementation of "OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models"