I am Qihan Huang, a final-year Ph.D student in College of Computer Science and Technology, Zhejiang University. Prior to this, I earned a Bachelor's degree in Software Engineering from Zhejiang University in 2021. During 2024 to 2025, I had a long-term research internship at Alibaba Group, focusing on multimodal large language models and image generation.
Currently, my research interest lies in reinforcement learning for MLLMs.
ICCV 2025Boosting MLLM Reasoning with Text-Debiased Hint-GRPO, Qihan Huang, Weilong Dai, Jinlong Liu, et al.
-
AAAI 2025Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation, Qihan Huang, Siming Fu, Jinlong Liu, et al. -
CVPR 2025PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation, Qihan Huang, Weilong Dai, Jinlong Liu, et al. -
ICLR 2025MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance, Xierui Wang, Siming Fu, Qihan Huang, et al.
-
NeurIPS 2024LG-CAV: Train Any Concept Activation Vector with Language Guidance, Qihan Huang, Jie Song, Mengqi Xue, et al. -
AAAI 2024On the Concept Trustworthiness in Concept Bottleneck Models, Qihan Huang, Jie Song, Haofei Zhang, et al. -
ICCV 2023Evaluation and Improvement of Interpretability for Self-Explainable Part-Prototype Networks, Qihan Huang, Mengqi Xue, Wenqi Huang, et al. -
IJCAI 2024ProtoPFormer: Concentrating on Prototypical Parts in Vision Transformers for Interpretable Image Recognition, Mengqi Xue, Qihan Huang, Haofei Zhang, et al.