Yuhang Zang

Hi, I am Yuhang Zang (่‡งๅฎ‡่ˆช), a young researcher at Shanghai AI Laboratory. I obtained my PhD at the MMLab@NTU, Nanyang Technological University in 2023, supervised by Prof. Chen Change Loy. I obtained my Bachelorโ€™s degree at UESTC in 2019.

Research Focus: My current research focuses on (1) post-training for multimodal LLMs, and (2) vision-language pre-training.

News
Selected Papers Full List Scholar
New!
Visual-RFT: Visual Reinforcement Fine-Tuning
Ziyu Liu, Zeyi Sun, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang
IEEE International Conference on Computer Vision (ICCV), 2025
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model
Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma, Haodong Duan, Wenwei Zhang, Kai Chen, Dahua Lin, Jiaqi Wang
Findings of the Association for Computational Linguistics (Findings of ACL), 2025
VideoRoPE: What Makes for Good Video Rotary Position Embedding?
Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jiaqi Wang, Xipeng Qiu, Dahua Lin
International Conference on Machine Learning (ICML), 2025 Oral
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang
Neural Information Processing Systems (NeurIPS), 2024
MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations
Yubo Ma, Yuhang Zang, Liangyu Chen, Meiqi Chen, Yizhu Jiao, Xinze Li, Xinyuan Lu, Ziyu Liu, Yan Ma, Xiaoyi Dong, Pan Zhang, Liangming Pan, Yu-Gang Jiang, Jiaqi Wang, Yixin Cao, Aixin Sun
Neural Information Processing Systems (NeurIPS), 2024 (Datasets and Benchmarks Track) Spotlight
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization
Yuhang Zang, Hanlin Goh, Josh Susskind, Chen Huang
International Conference on Learning Representations (ICLR), 2024
Contextual Object Detection with Multimodal Large Language Models
Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy
International Journal of Computer Vision (IJCV), 2024
Unified Vision and Language Prompt Learning
Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy
arXiv 2022
Semi-Supervised and Long-Tailed Object Detection with CascadeMatch
Yuhang Zang, Kaiyang Zhou, Chen Huang, Chen Change Loy
International Journal of Computer Vision (IJCV), 2023
Open-Vocabulary DETR with Conditional Matching
Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy
European Conference on Computer Vision (ECCV), 2022 Oral
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation
Yuhang Zang, Chen Huang, Chen Change Loy
IEEE International Conference on Computer Vision (ICCV), 2021
Interns Mentored
Current Interns (3)
Ziyu Liu
Ziyu Liu 2023.10 - Present
PhD Student, Shanghai Jiao Tong University
Visual-RFT ICCV 2025
MIA-DPO ICLR 2025
MMDU NeurIPS 2024 D&B
Xilin Wei
Xilin Wei 2023.10 - Present
PhD Student, Fudan University
VideoRoPE ICML 2025 Oral
Shengyuan Ding
Shengyuan Ding 2024.10 - Present
PhD Student, Fudan University
MM-IFEngine ICCV 2025
Alumni (1)
Yubo Ma
Yubo Ma 2023 - 2024
PhD Candidate, Nanyang Technological University
Light-ColPali Findings of ACL 2025
MMLongbench-Doc NeurIPS 2024 D&B Spotlight
Services
Area Chair:
Conference Reviewer:
Journal Reviewer:
Workshop Organizer: