Thanks to visit codestin.com
Credit goes to boruizhang.site

Borui Zhang (张博睿)

About me: I am a final-year Ph.D. candidate at the i-VisionGroup, Department of Automation, Tsinghua University, advised by Prof. Jiwen Lu. I received my B.E. degree from the Department of Automation and a second B.A. degree from the School of Economics and Management at Tsinghua University in 2021. My research philosophy focuses on bridging the gap between theoretical interpretability and practical efficiency in deep learning.

Research: My primary research interests lie at the intersection of Computer Vision and Deep Learning Theory. Currently, I am focused on the following pillars:

profile photo

News

Preprints

* indicates equal contribution

Quantize-then-Rectify: Efficient VQ-VAE Training
Quantize-then-Rectify: Efficient VQ-VAE Training
arXiv, 2025

To investigate the relationship between continuous and discrete tokenizers, we propose ReVQ. This method yields a high-performance VQ-VAE requiring only 40 GPU hours of training on a single RTX 4090.

SFTok: Bridging the Performance Gap in Discrete Tokenizers
SFTok: Bridging the Performance Gap in Discrete Tokenizers
arXiv, 2025

We investigate iterative approaches for constructing discrete tokenizers and propose SFTok. Analogous to the discrete diffusion paradigm, SFTok is well-suited for integration into Multimodal Large Models (MLLMs), facilitating the realization of a unified discrete diffusion framework.

Preventing Local Pitfalls in Vector Quantization via Optimal Transport
Preventing Local Pitfalls in Vector Quantization via Optimal Transport
Borui Zhang*, Wenzhao Zheng, Jie Zhou, Jiwen Lu
arXiv, 2024

This study addresses the training instability of Vector-Quantized Networks (VQNs) by introducing OptVQ, a new method using the Sinkhorn algorithm for optimal transport. It achieves full codebook utilization (100%) and outperforms current VQNs.

Exploring Unified Perspective For Fast Shapley Value Estimation
Exploring Unified Perspective For Fast Shapley Value Estimation
arXiv, 2023

This paper analyzes existing Shapley value estimators and proposes SimSHAP. Experiments validate that SimSHAP significantly accelerates the computation of accurate Shapley values.

Selected Publications

Path Choice Matters for Clear Attribution in Path Methods
Path Choice Matters for Clear Attribution in Path Methods
ICLR 2024, 2024

We introduced the Concentration Principle and developed SAMP, an efficient model-agnostic interpreter incorporating infinitesimal constraint (IC) and momentum strategy (MS).

Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint
Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint
ICLR 2023, 2023

This paper proposes Bort, an optimizer for improving model explainability with boundedness and orthogonality constraints, derived from model comprehensibility conditions.

Attributable Visual Similarity Learning
Attributable Visual Similarity Learning
CVPR 2022, 2022

We propose AVSL, which employs a generalized similarity learning paradigm to represent the similarity between images with a graph for a more accurate and explainable measure.

Deep Relational Metric Learning
Deep Relational Metric Learning
Wenzhao Zheng*, Borui Zhang*, Jiwen Lu, Jie Zhou
ICCV 2021, 2021

This paper proposes to adaptively learn an ensemble of features that characterizes an image from different aspects, employing a relational module to capture correlations among features.

Honors and Awards

Academic Services