| CARVIEW |
Chaohui Yu
DAMO Academy, Alibaba Group
- superhuiych [at] gmail.com
- Google Scholar
I'm an algorithm engineer at DAMO Academy, Alibaba Group. Before this, I got my Master degree and Bachelor degree from Institute of Computing Technology (ICT) and Shandong University in 2020 and 2017, respectively. My research interest includes: Transfer Learning, Object Detection/Segmentation, Semi/Self-supervised Learning, Multimodal Learning, image/video/3D/4D Generation, and related applications.
CamPVG: Camera-Controlled Panoramic Video Generation with Epipolar-Aware Diffusion
Chenhao Ji, Chaohui Yu, Junyao Gao, Fan Wang, Cairong Zhao.
Siggraph Asia 2025
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
Chenjie Cao, Jingkai Zhou, Shikai Li, Jingyun Liang, Chaohui Yu*, Fan Wang, Yanwei Fu, Xiangyang Xue.
Siggraph Asia 2025
WorldVLA: Towards Autoregressive Action World Model
Jun Cen, Chaohui Yu, et al.
Preprint
3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models
Min Wei, Chaohui Yu*, Jingkai Zhou, Fan Wang.
The ACM International Conference on Multimedia. (ACMMM-25)
AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation
Zijie Wu, Chaohui Yu, Fan Wang, Xiang Bai.
The International Conference on Computer Vision. (ICCV-25)
Yisu Zhang, Chenjie Cao, Chaohui Yu, Jianke Zhu.
The International Conference on Computer Vision. (ICCV-25)
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
Chenjie Cao, Chaohui Yu, Shang Liu, Fan Wang, Xiangyang Xue, Yanwei Fu.
The Conference on Computer Vision and Pattern Recognition (CVPR-25)
LPM: Efficient 3D Content Creation from Single Image by Large-Scale Partial 3D Modeling.
Yisu Zhang, Chaohui Yu, Fan Wang, Jianke Zhu.
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT-25)
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Yanqin Jiang*, Chaohui Yu*, Chenjie Cao, Fan Wang, Weiming Hu, Jin Gao.
Neural Information Processing Systems. (NeurIPS-24)
MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing
Chenjie Cao, Chaohui Yu, Yanwei Fu, Fan Wang, Xiangyang Xue.
Neural Information Processing Systems. (NeurIPS-24)
SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer
Zijie Wu, Chaohui Yu, Yanqin Jiang, Chenjie Cao, Fan Wang, Xiang Bai.
European Conference on Computer Vision. (ECCV-24)
VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing
Shang Liu, Chaohui Yu, Chenjie Cao, Wen Qian, Fan Wang
European Conference on Computer Vision. (ECCV-24)
MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis
Ziming Zhong, Yanxu Xu, Jing Li, Jiale Xu, Zhengxin Li, Chaohui Yu, Shenghua Gao
European Conference on Computer Vision. (ECCV-24)
Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation
Chaohui Yu, Qiang Zhou, Jingliang Li, Zhe Zhang, Zhibin Wang, Fan Wang
The 31th ACM International Conference on Multimedia. (ACMMM-23)
RegionBLIP: A Unified Multi-modal Pre-training Framework for Holistic and Regional Comprehension
Qiang Zhou, Chaohui Yu, Shaofeng Zhang, Sitong Wu, Zhibin Wang, Fan Wang
Preprint
Foundation Model Drives Weakly Incremental Learning for Semantic Segmentation
Chaohui Yu, Qiang Zhou, Jingliang Li, Jianlong Yuan, Zhibin Wang, Fan Wang
The Conference on Computer Vision and Pattern Recognition (CVPR-23)
LMSeg: Language-guided Multi-dataset Segmentation
Qiang Zhou, Yuang Liu, Chaohui Yu, Jingliang Li, Zhibin Wang, Fan Wang
The International Conference on Learning Representations 2023. (ICLR-23)
MimCo: Masked Image Modeling Pre-training with Contrastive Teacher
Qiang Zhou, Chaohui Yu, Hao Luo, Zhibin Wang, Hao Li
The 30th ACM International Conference on Multimedia. (ACMMM-22)