|
Panwang Pan | ๆฝๆๆ
I am currently employed as a Senior Researcher at PICO within ByteDance Ltd. Previously, I held the position of Senior Algorithm Engineer at Alibaba Cloud.
In 2019, I earned my Master's degree from Xiamen University, where I was enrolled in the School of Informatics.
My research focuses on the intersection of generative models and multi-modal representation learning. These contributions have been deployed in real-world systems, including embedded XR devices and large-scale platforms like the Aliyun Cloud AI-Box.
I welcome opportunities for coffee chats and collaborations. Please feel free to reach out!
Email
 / 
Google Scholar
 / 
Github
 / 
Twitter
 / 
Wechat
|
|
๐ข Latest News
[2025-06] InstructLayout was accepted to T-PAMI 2025 ๐ .
[2025-06] InfoBridge was accepted to ICCV 2025 ๐ .
[2025-06] We released PartCrafter , a 3D-native DiT model designed to generate 3D objects in modular parts ๐.
[2025-02] One paper about VLM + RRHF (JarvisIR) was accepted to CVPR 2025 ๐ .
[2025-01] 4K4DGEN was selected as ICLR25 Spotlight, top 3.2% among 11672 ๐.
[2025-01] Three papers about 3D/4D Generative Models (InstantSplamp & DiffSplat & 4K4DGEN) were accepted to ICLR 2025๐.
[2024-09] One paper about generalizable single-view human reconstruction (HumanSplat) was accepted to NeurIPS 2024 ๐ .
[2024-09] One paper about VLM Distillation (MRD) was accepted to ECCV 2024 ๐ .
|
๐ Selected Publications (
Google Scholar
)
* Equal contribution, โ Project leader, โก Corresponding author
|
Generative AI
ICLR 2025 ๐ spotlight ๐
|
4K4DGEN: Panoramic 4D Generation at 4K Resolution
Panwang Pan*โก, Renjie Li*, Bangbang Yang, Dejia Xu, Shijie Zhou, Xuanyang Zhang,
Zeming Li, Achuta Kadambi, Zhangyang Wang, Zhengzhong Tu, Zhiwen Fan
[Openreview]
[Paper]
[Project]
[Code]
4K4DGEN achieves high-quality Panorama-to-4D generation at a resolution of 4K for the first time using efficient splatting techniques for real-time exploration.
|
|
|
DynamicVerse: Physically-Aware Multimodal Modeling for Dynamic 4D Worlds
Kairun Wen, Yuzhi Huang, Runyu Chen, Hui Zheng, Yunlong Lin, Panwang Pan, Chenxin Li, Wenyan Cong, Jian Zhang, Junbin Lu, Chenguo Lin, Dilin Wang, Zhicheng Yan, Hongyu Xu, Justin Theiss, Yue Huang, Xinghao Ding, Rakesh Ranjan, Zhiwen Fan
[Paper]
[Project]
[Code]
DynamicVerse is a physicalโscale, multimodal 4D modeling framework for real-world video.
|
Multi-modal Learning
NeurIPS 2025
|
JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent
Yunlong Lin, Zixu Lin, Kunjie Lin, Jinbin Bai, Panwang Pan, Chenxin Li, Haoyu Chen, Zhongdao Wang, Xinghao Dingโก, Wenbo Li, Shuicheng Yanโก
[Paper]
[Project]
[Code]
JarvisArt outperforms GPT-4o with a 60% improvement in average pixel-level metrics on MMArt-Bench for content fidelity, while maintaining comparable instruction-following capabilities.
|
ByteDance Ltd, Beijing, China, Senior Computer Vision Algorithm Engineer, advised by Cheng Chen and Zeming Li.
|
08/2022 - Present |
Alibaba Cloud, Hangzhou, China, Senior Computer Vision Algorithm Engineer
|
07/2019 - 07/2022 |
DevTech Compute, NVIDIA, Beijing, China,
AI Developer Technology Engineer Intern
advised by Xipeng Li .
|
07/2018 - 10/2018 |
๐ Selected Awards
2024: โStar Team Awardโ Innovation Breakthrough Award, Bytedance
2023: โStar Team Awardโ Innovation Breakthrough Award, Bytedance
2022: ByteStyle Award, Bytedance
2019: Outstanding Graduates of Xiamen University
2018: National Scholarship for Postgraduates, Ministry of Education
2018: First Prize of GEDC, Second Prize of MCM & CPIPC
2017: ZhongXian Huang Scholarship, Xiamen University (about 10 awards per year)
2015: National Scholarship for Undergraduates (the highest honor scholarship in China)
|
๐ฌ Miscellaneous
Conference Reviewer: NeurIPS, ICLR, CVPR, ICML, ICCV, ACM MM
|
|