| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Tue, 23 Dec 2025 02:49:40 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"694a0344-158f0"
expires: Mon, 29 Dec 2025 13:40:19 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: D09D:3ABDEF:8CC66B:9E248C:6952826B
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 13:30:19 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210046-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767015019.147817,VS0,VE230
vary: Accept-Encoding
x-fastly-request-id: 784c5f223680e323c408259e87d174e90b182459
content-length: 16023
Yuhang Zang - Home
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning
Visual-RFT: Visual Reinforcement Fine-Tuning
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model
VideoRoPE: What Makes for Good Video Rotary Position Embedding?
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations
Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization
Contextual Object Detection with Multimodal Large Language Models
Unified Vision and Language Prompt Learning
Semi-Supervised and Long-Tailed Object Detection with CascadeMatch
Open-Vocabulary DETR with Conditional Matching
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation
Yuhang Zang
Hi, I am Yuhang Zang (臧宇航), a young researcher at Shanghai AI Laboratory. I obtained my PhD at the Nanyang Technological University in 2023, supervised by Prof. Chen Change Loy. I obtained my Bachelor's degree at UESTC in 2019.
I regularly serve as an Area Chair for NeurIPS, ICLR, CVPR, AAAI, and COLM. I also serve as the Action Editor for Transactions on Machine Learning Research (TMLR).
Research Focus: My current research focuses on 1) post-training for multimodal LLMs (reinforcement fine-tuning, reward models), and 2) vision-language pre-training.
News
- [09/2025] UnifiedReward-Think and Hi-Flow were accepted by NeurIPS 2025.New!
- [06/2025] Visual-RFT, MM-IFEngine, X-Prompt, Bootstrap3D, Grounded CoT Highlight, Light-A-Video, MIR, SAM2Long were accepted by ICCV 2025.New!
- [05/2025] IXC-2.5-Reward and Light-ColPali were accepted by Findings of ACL 2025.New!
- [05/2025] VideoRoPE Oral and SongGen were accepted by ICML 2025.
- [02/2025] ByTheWay, OVO-Bench, Dispider, PyramidDrop and WildAvatar were accepted by CVPR 2025.
- [01/2025] MIA-DPO and MotionClone were accepted by ICLR 2025.
- [09/2024] MMLongbench-Doc Spotlight, ShareGPT4Video and MMDU were accepted by NeurIPS 2024 DB Track.
- [09/2024] InternLM-XC2-4khd, VideoStreaming and MMStar were accepted by NeurIPS 2024.
- [08/2024] VLMEvalKit was accepted by ACM MM 2024 Open Source Software Competition.
- [07/2024] Long-CLIP and MVSGaussian were accepted by ECCV 2024.
- [02/2024] Alpha-CLIP was accepted by CVPR 2024.
- [01/2024] My Apple internship project, O-GEN, was accepted by ICLR 2024.
- [06/2023] I joined
Apple (AI/ML) as a research intern.
- [12/2022] CascadeMatch was accepted by IJCV.
- [07/2022] OV-DETR was accepted by ECCV 2022 Oral.
Selected Papers Full List Scholar
Topics:
Reinforcement Learning from Human Feedback,
Multimodal Large Language Models,
Vision-Language Models,
Image Understanding
Equal contribution
Corresponding author
New!
Yibin Wang, Zhimin Li, Yuhang Zang, Chunyu Wang, Qinglin Lu, Cheng Jin, Jiaqi Wang
Neural Information Processing Systems (NeurIPS), 2025
New!
Ziyu Liu, Zeyi Sun, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Haodong Duan, Dahua Lin, Jiaqi Wang
IEEE International Conference on Computer Vision (ICCV), 2025
Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma, Haodong Duan, Wenwei Zhang, Kai Chen, Dahua Lin, Jiaqi Wang
Findings of the Association for Computational Linguistics (Findings of ACL), 2025
Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jiaqi Wang, Xipeng Qiu, Dahua Lin
International Conference on Machine Learning (ICML), 2025 Oral
Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang
Neural Information Processing Systems (NeurIPS), 2024
Yubo Ma, Yuhang Zang, Liangyu Chen, Meiqi Chen, Yizhu Jiao, Xinze Li, Xinyuan Lu, Ziyu Liu, Yan Ma, Xiaoyi Dong, Pan Zhang, Liangming Pan, Yu-Gang Jiang, Jiaqi Wang, Yixin Cao, Aixin Sun
Neural Information Processing Systems (NeurIPS), 2024 (Datasets and Benchmarks Track) Spotlight
Yuhang Zang, Hanlin Goh, Josh Susskind, Chen Huang
International Conference on Learning Representations (ICLR), 2024
Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy
International Journal of Computer Vision (IJCV), 2024
Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy
arXiv 2022
Yuhang Zang, Kaiyang Zhou, Chen Huang, Chen Change Loy
International Journal of Computer Vision (IJCV), 2023
Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy
European Conference on Computer Vision (ECCV), 2022 Oral
Yuhang Zang, Chen Huang, Chen Change Loy
IEEE International Conference on Computer Vision (ICCV), 2021
Services
Area Chair / Senior Program Committee:
Action Editor:
Journal Reviewer:
Workshop Organizer:
Awards
Influential Paper (Paperdigest)
Visual-RFT:
Most Influential ArXiv CV 2025: #5 in 2025-09 Version
2025
Influential Paper (Paperdigest)
2024
Influential Paper (Paperdigest)
InternLM-XComposer2:
Most Influential ArXiv CV 2024: #10 in 2024-10 Version
2024
2025
2020
2019