Li Zhang
Tenure-track Professor
School of Data Science, Fudan University
email /
google scholar
I am a tenure-track Professor at the School of Data Science, Fudan University where I direct the Zhang Vision Group. The aim of my group is to engage in state of the art research in computer vision and deep learning.
Previously, I was a Research Scientist at Samsung AI Center Cambridge, and a Postdoctoral Research Fellow at the University of Oxford where I was supervised by professor Philip H.S. Torr and professor Andrew Zisserman.
Prior to joining Oxford, I read my PhD in computer science under the supervision of professor Tao Xiang at Queen Mary University of London.
If you are highly creative, have top grades/coding skill and interested in joining my group please do not hesitate to send me your CV and transcripts of grades.
This website will no longer be maintained. New website is here.
News
Two papers to appear in ICLR 2023.
Call for papers! We are orgnising a CVPR 2023 workshop on End-to-End Autonomous Driving: Perception, Prediction, Planning and Simulation.
One paper to appear in AAAI 2023.
One paper is accepted by Pattern Recognition.
I will be serving as an Area Chair for CVPR 2023.
I will be talking at CCAI 2022 and PRCV 2022.
One paper to appear in NeurIPS 2022.
Our work DGMN is accepted by IEEE TPAMI.
Four papers to appear in ECCV 2022.
Our DeepInteraction is ranked 1st at the nuScenes 3D detection leaderboard.
Our work SiamMask is accepted by IEEE TPAMI.
One paper to appear in CVPR 2022.
Our work SETR is ranked second at the most influential CVPR papers.
Two papers to appear in NeurIPS 2021 (1 Spotlight and 1 Poster).
Publications
- S-NeRF: Neural Radiance Fields for Street Views
Ziyang Xie, Junge Zhang, Wenye Li, Feihu Zhang, Li Zhang,
ICLR 2023
[paper]
- SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation
Qiang Wan, Jiachen Lu, Zilong Huang, Gang YU, Li Zhang,
ICLR 2023
[paper]
- PolarFormer: Multi-camera 3D Object Detection with Polar Transformers
Yanqin Jiang, Li Zhang, Zhenwei Miao, Xiatian Zhu, Jin Gao, Weiming Hu, Yu-Gang Jiang,
AAAI 2023 (Oral)
[paper]
[code]
- ImpDet: Exploring Implicit Fields for 3D Object Detection
Xuelin Qian, Li Wang, Yi Zhu, Li Zhang, Yanwei Fu, Xiangyang Xue,
WACV 2023
[paper]
- Rethinking Local and Global Feature Representation for Dense Prediction
Mohan Chen, Li Zhang, Rui Feng, Xiangyang Xue, Jianfeng Feng,
Pattern Recognition 2023
- DeepInteraction: 3D Object Detection via Modality Interaction
Zeyu Yang, Jiaqi Chen, Zhenwei Miao, Wei Li, Xiatian Zhu, Li Zhang,
NeurIPS 2022 (Spotlight)
[code]
- Dynamic Graph Message Passing Network for Visual Recognition
Li Zhang, Mohan Chen, Anurag Arnab, Xiangyang Xue, Philip H.S. Torr
IEEE TPAMI 2022
[paper]
[project page]
- When, Where and How does it fail? A Spatial-temporal Visual Analytics Approach for Interpretable Object Detection in Autonomous Driving
Junhong Wang, Yun Li, Zhaoyu Zhou, Chengshun Wang, Yijie Hou, Li Zhang, Xiangyang Xue, Michael Kamp, Xiaolong (Luke) Zhang, Siming Chen
IEEE TVCG 2022
- Learning Ego 3D Representation as Ray Tracing
Jiachen Lu, Zheyuan Zhou, Xiatian Zhu, Hang Xu, Li Zhang,
ECCV 2022
[code]
[project page]
[demo]
- Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling
Hengyuan Ma, Li Zhang, Xiatian Zhu, Jianfeng Feng,
ECCV 2022
[code]
- FashionViL: Fashion-Focused Vision-and-Language Representation Learning
Xiao Han, Licheng Yu, Xiatian Zhu, Li Zhang, Yi-Zhe Song, and Tao Xiang,
ECCV 2022
[code]
- RCLane: Relay Chain Prediction for Lane Detection
Shenghua Xu, Xinyue Cai, Bin Zhao, Li Zhang, Hang Xu, Yanwei Fu, Xiangyang Xue,
ECCV 2022
- SiamMask: A Framework for Fast Online Object Tracking and Segmentation
Weiming Hu, Qiang Wang, Li Zhang, Luca Bertinetto, Philip H.S. Torr
TPAMI 2022
[code]
- ONCE-3DLanes: Building Monocular 3D Lane Detection
Fan Yan, Ming Nie, Xinyue Cai, Jianhua Han, Hang Xu, Zhen Yang, Chaoqiang Ye, Yanwei Fu, Michael Bi Mi, Li Zhang
CVPR 2022
[project page]
- SOFT: Softmax-free Transformer with Linear Complexity
Jiachen Lu, Jinghan Yao, Junge Zhang, Xiatian Zhu, Hang Xu, Weiguo Gao, Chunjing Xu, Tao Xiang, Li Zhang,
NeurIPS 2021 (Spotlight)
[code]
[project page]
- Progressive Coordinate Transforms for Monocular 3D Object Detection
Li Wang, Li Zhang, Yi Zhu, Zhi Zhang, Tong He, Mu Li, Xiangyang Xue,
NeurIPS 2021
[code]
- The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection
Zhikang Zou, Xiaoqing Ye, Liang Du, Xianhui Cheng, Xiao Tan, Li Zhang, Jianfeng Feng, Xiangyang Xue, Errui Ding,
ICCV 2021
- Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer
Zhihe Lu, Sen He, Xiatian Zhu, Li Zhang, Yi-Zhe Song, Tao Xiang,
ICCV 2021
[code]
- Boundary-sensitive Pretraining for Temporal Localization in Videos
Mengmeng Xu, Victor Escorcia, Brais Martínez, Juan-Manuel Perez-Rua, Xiatian Zhu, Li Zhang, Bernard Ghanem, Tao Xiang,
ICCV 2021
- Text-Based Person Search with Limited Data
Xiao Han, Sen He, Li Zhang, Tao Xiang,
BMVC 2021
- Few-shot Action Recognition with Prototype-centered Attentive Learning
Xiatian Zhu, Antoine Toisoul, Juan-Manuel Prez-Ra, Li Zhang, Brais Martinez, Tao Xiang,
BMVC 2021
- Rethinking Local and Global Feature Representation for Semantic Segmentation
Mohan Chen, Xinxuan Zhao, Bingfei Fu, Li Zhang, Xiangyang Xue,
BMVC 2021
- Dual Prior Learning for Blind and Blended Image Restoration
Xin Jin, Li Zhang, Chaowei Shan, Xin Li, Zhibo Chen,
IEEE TIP 2021
- Towards Efficient Scene Understanding via Squeeze Reasoning
Xiangtai Li, Xia Li, Ansheng You, Li Zhang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Zhouchen Lin,
IEEE TIP 2021
[code]
- Global Aggregation then Local Distribution for Scene Parsing
Xiangtai Li, Li Zhang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Xiatian Zhu, Tao Xiang,
IEEE TIP 2021
[code]
- How to trust unlabeled data? Instance Credibility Inference for Few-Shot Learning
Yikai Wang, Li Zhang, Yuan Yao, Yanwei Fu
TPAMI 2021
[paper]
[code]
- Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Sixiao Zheng, Jiachen Lu, Hengshuang Zhao, Xiatian Zhu, Zekun Luo, Yabiao Wang, Yanwei Fu, Jianfeng Feng, Tao Xiang, Philip HS Torr, Li Zhang
CVPR 2021
[paper]
[code]
[project page]
- Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection
Li Wang, Liang Du, Xiaoqing Ye, Yanwei Fu, Guodong Guo, Xiangyang Xue, Jianfeng Feng, Li Zhang
CVPR 2021
[paper]
[code]
- Learning Dynamic Alignment via Meta-filter for Few-shot Learning
Chengming Xu, Yanwei Fu, Chen Liu, Chengjie Wang, Jilin Li, Feiyue Huang, Li Zhang, Xiangyang Xue,
CVPR 2021
[code]
- Delving into Data: Effectively Substitute Training for Black-box Attack
Wenxuan Wang, Bangjie Yin, Taiping Yao, Li Zhang, Yanwei Fu, Shouhong Ding, Jilin Li, Feiyue Huang, Xiangyang Xue
CVPR 2021
- Learning a Few-shot Embedding Model with Contrastive Learning
Chen Liu, Yanwei Fu, Chengming Xu, Siqian Yang, Jilin Li, Chengjie Wang, Li Zhang
AAAI 2021
[code]
- Long-Term Cloth-Changing Person Re-identification
Xuelin Qian, Wenxuan Wang, Li Zhang, Fangrui Zhu, Yanwei Fu, Tao Xiang, Yu-Gang Jiang, Xiangyang Xue
ACCV 2020 (Oral)
[paper]
[project page]
- Dynamic Depth Fusion and Transformation for Monocular 3D Object Detection
Erli Ouyang*, Li Zhang*, Mohan Chen, Anurag Arnab, Yanwei Fu
ACCV 2020
[paper]
- Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition
Yuqian Fu*, Li Zhang*, Junke Wang, Yanwei Fu, Yu-Gang Jiang
ACM MM 2020 (Oral)
[paper]
- Few-shot Action Recognition with Permutation-invariant Attention
Hongguang Zhang, Li Zhang, Xiaojuan Qi, Hongdong Li, Philip H.S. Torr, Piotr Koniusz
ECCV 2020 (Spotlight)
[paper]
- XingGAN for Person Image Generation
Hao Tang, Song Bai, Li Zhang, Philip H.S. Torr, Nicu Sebe
ECCV 2020
[paper]
[code]
- Improving Semantic Segmentation via Decoupled Body and Edge Supervision
Xiangtai Li, Xia Li, Li Zhang, Guangliang Cheng, Jianping Shi, Zhouchen Lin, Shaohua Tan, Yunhai Tong
ECCV 2020
[paper]
[code]
- Dynamic Graph Message Passing Network
Li Zhang, Dan Xu, Anurag Arnab, Philip H.S. Torr
CVPR 2020 (Oral)
[paper]
[project page]
- Strip Pooling: Rethinking Spatial Pooling for Scene Parsing
Qibin Hou, Li Zhang, Ming-Ming Cheng, Jiashi Feng
CVPR 2020
[paper]
[code]
- Instance Credibility Inference for Few-Shot Learning
Yikai Wang, Chengming Xu, Chen Liu, Li Zhang, Yanwei Fu
CVPR 2020
[paper]
[code]
- Style Normalization and Restitution for Generalizable Person Re-identification
Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen, Li Zhang
CVPR 2020
[paper]
- Dual Graph Convolutional Network for Semantic Segmentation
Li Zhang, Xiangtai Li, Anurag Arnab, Kuiyuan Yang, Yunhai Tong, Philip H.S. Torr
BMVC 2019
[paper]
[code]
- Global Aggregation then Local Distribution in Fully Convolutional Networks
Xiangtai Li, Li Zhang, Ansheng You, Maoke Yang, Kuiyuan Yang, Yunhai Tong
BMVC 2019
[paper]
[code]
- Fast Online Object Tracking and Segmentation: A Unifying Approach
Qiang Wang*, Li Zhang*, Luca Bertinetto*, Weiming Hu, Philip H.S. Torr
CVPR 2019
[paper]
[code]
- Learning to Compare: Relation Network for Few-Shot Learning
Flood Sung, Yongxin Yang, Li Zhang, Tao Xiang, Philip H.S. Torr, Timothy M. Hospedales
CVPR 2018
[paper]
[FSL code]
[ZSL code]
- Learning a Deep Embedding Model for Zero-Shot Learning
Li Zhang, Tao Xiang, Shaogang Gong
CVPR 2017
[paper]
[code]
[data]
- Learning a Discriminative Null Space for Person Re-identification
Li Zhang, Tao Xiang, Shaogang Gong
CVPR 2016
[paper]
[code]
[data]
[cmc curve]
Workshop papers
- A Unified Efficient Pyramid Transformer for Semantic Segmentation
Fangrui Zhu, Yi Zhu, Li Zhang, Chongruo Wu, Yanwei Fu, Mu Li
ICCV Workshop on Video Scene Parsing in the Wild Challenge, 2021
[code]
- Egocentric Action Recognition by Video Attention and Temporal Context
Juan-Manuel Pérez-Rúa, Antoine Toisoul, Brais Martinez, Victor Escorcia, Li Zhang, Xiatian Zhu, Tao Xiang
EPIC-Kitchens challenges@CVPR 2020
[technical challenges report]
[result]
Win 3rd (Seen Kitchens) and 6th (Unseen Kitchens) in action recognition
- An Embarrassingly Simple Baseline to One-shot Learning
Chen Liu, Chengming Xu, Yikai Wang, Li Zhang, Yanwei Fu
CVPR 2020 workshop on Visual Learning with Limited Labels
[paper]
[code]
- Actor-Critic Sequence Training for Image Captioning
Li Zhang, Flood Sung, Feng Liu, Tao Xiang, Shaogang Gong, Yongxin Yang, Timothy M. Hospedales
NeurIPS 2017 workshop on Visually-Grounded Interaction and Language
[paper]