| CARVIEW |
CUHK-Shenzhen
Oxford University
Shanghai AI LaboratoryHi, I'm Yiran, a forth-year Ph.D. student at The Chinese University of Hong Kong, Shenzhen, advised by Prof. Ruimao Zhang. Currently, I am a visiting Ph.D. student at TVG in Oxford University, advised by Prof. Philip Torr. I also work as a research intern at Shanghai AI Laboratory, advised Prof. Lei Bai. I am honored to collaborate with Prof. Xihui Liu, Dr. Xintao Wang and my friend Jiwen Yu.
Conference Reviewer for ICLR (2025), CVPR (2024, 2025), ICCV (2025), NeurIPS (2025), ICML(2025), CORL(2025), ICRA(2024,2025), IROS(2025), WACV (2025). Workshop Challenge Organizer for MFM-EAI in ICML 2024.
My goal is to address real-world problems by translating cutting-edge research into practical solutions:
- Robot Manipulation, Navigation and Collaborative Simulation (Imitation Learning, Reinforcement Learning)
- Building Real-World Embodied Society with Agents (Spatio-temporal Intelligence, Robotic Planning)
- Video generation models as World Simulators (Physics-compliance, Memory Consistency)
Research Framework
Warning
Problem: The current name of your GitHub Pages repository ("") does not match the recommended repository name for your site ("").
Solution: Please consider renaming the repository to "
", so that your site can be accessed directly at "https://".
However, if the current repository name is intended, you can ignore this message by removing "{% include widgets/debug_repo_name.html %}" in index.html.
Action required
Problem: The current root path of this site is "",
which does not match the baseurl ("") configured in _config.yml.
Solution: Please set the
baseurl in _config.yml to "".
Education
-
Oxford UniversityVisiting Ph.D. Student, advised by Prof. Philip TorrJun. 2025 - present -
The Chinese University of Hong Kong, ShenzhenPh.D. Student, advised by Prof. Ruimao ZhangSep. 2021 - present -
The University of Hong KongVisiting Ph.D. Student, advised by Prof. Xihui LiuMar. 2024 - Jul. 2025 -
Shandong UniversityB.S. in Computer ScienceSep. 2017 - Jul. 2021
Experience
-
Shanghai AI LaboratoryResearch Intern, advised by Dr. Lei BaiApr. 2025 - present -
Kuaishou KlingResearch Intern, advised by Dr. Xintao WangOct. 2024 - Apr. 2025 -
Shanghai AI LaboratoryResearch Intern, advised by Dr. Jing ShaoJun. 2023 - Oct. 2024 -
NIOResearch Intern, advised by Dr. Ningning MaDec. 2021 - Jun. 2023
News
Selected Publications (view all )

CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion
Jiahua Ma*, Yiran Qin*†, Yixiong Li, Xuanqi Liao, Yulan Guo, Ruimao Zhang#(* equal contribution, # corresponding author, † project lead)
Conference on Robot Learning (CoRL) 2025
CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion
Jiahua Ma*, Yiran Qin*†, Yixiong Li, Xuanqi Liao, Yulan Guo, Ruimao Zhang#(* equal contribution, # corresponding author, † project lead)
Conference on Robot Learning (CoRL) 2025

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning
Li Kang*, Xiufeng Song*, Heng Zhou*, Yiran Qin#, Jie Yang, Xiaohong Liu, Philip Torr, Lei Bai#, Zhenfei Yin#(* equal contribution, # corresponding author)
Annual Conference on Neural Information Processing Systems (NeurIPS) 2025
VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning
Li Kang*, Xiufeng Song*, Heng Zhou*, Yiran Qin#, Jie Yang, Xiaohong Liu, Philip Torr, Lei Bai#, Zhenfei Yin#(* equal contribution, # corresponding author)
Annual Conference on Neural Information Processing Systems (NeurIPS) 2025

Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval
Jiwen Yu, Jianhong Bai, Yiran Qin, Quande Liu#, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu#(# corresponding author)
SIGGRAPH Asia 2025
Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval
Jiwen Yu, Jianhong Bai, Yiran Qin, Quande Liu#, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu#(# corresponding author)
SIGGRAPH Asia 2025

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints
Yiran Qin*, Li Kang*, Xiufeng Song*, Zhenfei Yin#, Xiaohong Liu, Xihui Liu, Ruimao Zhang#, Lei Bai#(* equal contribution, # corresponding author)
International Conference on Computer Vision (ICCV) 2025 Best Paper Award at CVPR 2025 MEIS Workshop
RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints
Yiran Qin*, Li Kang*, Xiufeng Song*, Zhenfei Yin#, Xiaohong Liu, Xihui Liu, Ruimao Zhang#, Lei Bai#(* equal contribution, # corresponding author)
International Conference on Computer Vision (ICCV) 2025 Best Paper Award at CVPR 2025 MEIS Workshop

GameFactory: Creating New Games with Generative Interactive Videos
Jiwen Yu*, Yiran Qin*, Xintao Wang#, Pengfei Wan, Di Zhang, Xihui Liu#(* equal contribution, # corresponding author)
International Conference on Computer Vision (ICCV) 2025 Highlight
GameFactory: Creating New Games with Generative Interactive Videos
Jiwen Yu*, Yiran Qin*, Xintao Wang#, Pengfei Wan, Di Zhang, Xihui Liu#(* equal contribution, # corresponding author)
International Conference on Computer Vision (ICCV) 2025 Highlight

Interactive Generative Video as Next-Generation Game Engine
Jiwen Yu*, Yiran Qin*, Haoxuan Che, Quande Liu, Xintao Wang#, Pengfei Wan, Di Zhang, Xihui Liu#(* equal contribution, # corresponding author)
ArXiv Preprint
Interactive Generative Video as Next-Generation Game Engine
Jiwen Yu*, Yiran Qin*, Haoxuan Che, Quande Liu, Xintao Wang#, Pengfei Wan, Di Zhang, Xihui Liu#(* equal contribution, # corresponding author)
ArXiv Preprint

WorldSimBench: Towards Video Generation Models as World Simulators
Yiran Qin*, Zhelun Shi*, Jiwen Yu, Xijun Wang, Enshen Zhou, Lijun Li, Zhenfei Yin, Xihui Liu, Lu Sheng, Jing Shao#, Lei Bai#, Ruimao Zhang#(* equal contribution, # corresponding author)
International Conference on Machine Learning (ICML) 2025 Oral at CVPR 2025 WorldModelBench Workshop
WorldSimBench: Towards Video Generation Models as World Simulators
Yiran Qin*, Zhelun Shi*, Jiwen Yu, Xijun Wang, Enshen Zhou, Lijun Li, Zhenfei Yin, Xihui Liu, Lu Sheng, Jing Shao#, Lei Bai#, Ruimao Zhang#(* equal contribution, # corresponding author)
International Conference on Machine Learning (ICML) 2025 Oral at CVPR 2025 WorldModelBench Workshop

NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants
Yiran Qin*, Ao Sun*, Yuze Hong, Benyou Wang, Ruimao Zhang#(* equal contribution, # corresponding author)
International Conference on Robotics and Automation (ICRA) 2025
NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants
Yiran Qin*, Ao Sun*, Yuze Hong, Benyou Wang, Ruimao Zhang#(* equal contribution, # corresponding author)
International Conference on Robotics and Automation (ICRA) 2025

Minedreamer: Learning to follow instructions via chain-of-imagination for simulated-world control
Enshen Zhou*, Yiran Qin*, Zhenfei Yin†, Yuzhou Huang, Ruimao Zhang#, Lu Sheng#, Yu Qiao, Jing Shao(* equal contribution, # corresponding author, † project lead)
International Conference on Intelligent Robots and Systems (IROS) 2025
Minedreamer: Learning to follow instructions via chain-of-imagination for simulated-world control
Enshen Zhou*, Yiran Qin*, Zhenfei Yin†, Yuzhou Huang, Ruimao Zhang#, Lu Sheng#, Yu Qiao, Jing Shao(* equal contribution, # corresponding author, † project lead)
International Conference on Intelligent Robots and Systems (IROS) 2025

Mp5: A multi-modal open-ended embodied system in minecraft via active perception
Yiran Qin*, Enshen Zhou*, Qichang Liu*, Zhenfei Yin, Lu Sheng#, Ruimao Zhang#, Yu Qiao, Jing Shao†(* equal contribution, # corresponding author, † project lead)
Conference on Computer Vision and Pattern Recognition (CVPR) 2024
Mp5: A multi-modal open-ended embodied system in minecraft via active perception
Yiran Qin*, Enshen Zhou*, Qichang Liu*, Zhenfei Yin, Lu Sheng#, Ruimao Zhang#, Yu Qiao, Jing Shao†(* equal contribution, # corresponding author, † project lead)
Conference on Computer Vision and Pattern Recognition (CVPR) 2024
