| CARVIEW |
Bachelor degree (2015-2019)
北京邮电大学 Beijing University of Posts and Telecommunications
伦敦大学玛丽女王学院Queen Mary University of London
Master graduate (2019-2022)
北京邮电大学 Beijing University of Posts and Telecommunications
Pattern Recognition and Intelligent Systems Lab (PRIS Lab)
Employee (2022-2023)
OPPO研究院 OPPO Research Institute
Supervised by OPPO Chief Scientist - Yandong Guo
P.h.D candidate (2023-Now)
北京大学 Peking University, School of Computer Science
National Key Laboratory for Multimedia Information Processing
Email: jiamingliu@stu.pku.edu.cn
About Me (Google Scholar)
Supervisor
We have several academic visitor and intern positions at HMI Lab (Peking University). We actively work on Robotics, Multi-Modal Learning, and 3D Vision. If you like what we do, don't hesitate to contact me.
Research Interests
-
My past research mainly focuses on Autonomous Driving, Out Of Distribution, and Neural Video Delivery. My current research direction is Robotic manipulation, Vision-Language-Action model, and Post-training.
News
- 2025: 人才托举:中国科协“青年人才托举工程博士生专项计划”.
- 2025: 国自然项目:青年学生基础研究项目(项目负责人)《机器人长程移动操纵的全身控制大模型方法研究》.
- 2025: Two papers were accepted by NeurIPS2025 (Vision-Language-Action Model + Mobile manipulation).
- 2025: Two papers were accepted by RSS2025 (Robotic manipulation dataset + Dexterous hand manipulation).
- 2025: Three papers were accepted by CVPR2025 (3D Robotic manipulation + Vision-Language-Action Model).
- 2025: One papers was accepted by AAAI2025 (Multimodal Large Language Models + Autonomous Driving).
- 2025: Two papers were accepted by ICLR2025 (Multimodal Large Language Models + Math).
- 2024: One paper was accepted by Neurips2024 (Robotic manipulation + Multimodal Large Language Models).
- 2024: Two papers were accepted by ECCV2024 (Multimodal Large Language Models + 3D large-scale model).
Publications (First author or Project leader) x 20
AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation
Sixiang Chen, Jiaming Liu (equal contribution + project lead), Siyuan Qian, Han Jiang, Lily Li, Renrui Zhang, Zhuoyang Liu, Chenyang Gu, Chengkai Hou, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang
[PDF] [Web page] [Code]
Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning
Hao Chen, Jiaming Liu (equal contribution + project lead), Chenyang Gu, Zhuoyang Liu, Renrui Zhang, Xiaoqi Li, Xiao He, Yandong Guo, Chi-Wing Fu, Shanghang Zhang, Pheng-Ann Heng
[PDF] [Web page] [Code]
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
Jiaming Liu, Hao Chen, Pengju An, Zhuoyang Liu, Renrui Zhang, Chenyang Gu, Xiaoqi Li, ..., Pheng-Ann Heng, Shanghang Zhang
[PDF] [Web page] [Code]
Robomind: Benchmark on multi-embodiment intelligence normative data for robot manipulation
Kun Wu, Chengkai Hou, Jiaming Liu (equal contribution), Zhengping Che, Xiaozhu Ju, ..., Shanghang Zhang, Jian Tang
[PDF] [Web page] [Data]
Lift3d foundation policy: Lifting 2d large-scale pretrained models for robust 3d robotic manipulation
Yueru Jia, Jiaming Liu (equal contribution), Sixiang Chen, Chenyang Gu, Zhilue Wang, Longzan Luo, Lily Lee, Pengwei Wang, Zhongyuan Wang, Renrui Zhang, Shanghang Zhang
[PDF] [Web page] [Code]
Lidar-llm: Exploring the potential of large language models for 3d lidar understanding
Senqiao Yang, Jiaming Liu (equal contribution), Ray Zhang, Mingjie Pan, Zoey Guo, Xiaoqi Li, Zehui Chen, Peng Gao, Yandong Guo, Shanghang Zhang
[PDF] [Code]
Robomamba: Efficient vision-language-action model for robotic reasoning and manipulation
Jiaming Liu, Mengzhen Liu, Zhenyu Wang, Pengju An, Xiaoqi Li, Kaichen Zhou, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang
[PDF] [Web page] [Code(test)]
Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding
Yiwen Tang, Jiaming Liu (equal contribution), Dong Wang, Zhigang Wang, Shanghang Zhang, Bin Zhao, Xuelong Li
[PDF] [Code]
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui Chen, Yandong Guo, Shanghang Zhang
[PDF] [Web page]
Cloud-Device Collaborative Learning for Multimodal Large Language Models
Guanqun Wang, Jiaming Liu (equal contribution), Chenxuan Li, Yuan Zhang, Junpeng Ma, Xinyu Wei, Kevin Zhang, Maurice Chong, Renrui Zhang, Yijiang Liu, Shanghang Zhang
[PDF]
Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer
Jiaming Liu, Qizhe Zhang, Xiaoqi Li Jianing Li, Ming Lu, Tiejun Huang, Shanghang Zhang
[PDF] [Code]
Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection
Jiaming Liu, Rongyu Zhang, Xiaowei Chi, Xiaoqi Li, Ming Lu, Yandong Guo, Shanghang Zhang
[PDF] [Code]
Renderocc: Vision-centric 3d occupancy prediction with 2d rendering supervision
Mingjie Pan, Jiaming Liu (equal contribution), Renrui Zhang, Peixiang Huang, Xiaoqi Li, Li Liu, Shanghang Zhang
[PDF] [Code]
Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation
Jiayi Ni, Senqiao Yang, Jiaming Liu (project leader), Xiaoqi Li, Wenyu Jiao, Ran Xu, Zehui Chen, Yi Liu, Shanghang Zhang
[PDF]
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
Jiaming Liu, Senqiao Yang, Peidong Jia, Ming Lu, Yandong Guo, Wei Xue, Shanghang Zhang
[PDF] [Code]
Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction
Senqiao Yang, Jiarui Wu, Jiaming Liu (project leader), Xiaoqi Li, Qizhe Zhang, Mingjie Pan, Shanghang Zhang
RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery
Rongyu Zhang, Lixuan Du, Jiaming Liu (project leader), Xiaoqi Li, Ming Lu, Yandong Guo, Shanghang Zhang
[PDF]
BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks
Xiaowei Chi, Jiaming Liu (equal contribution), Ming Lu, Rongyu Zhang, Zhaoqing Wang, Yandong Guo, Shanghang Zhang
Adaptive Patch Exiting for Scalable Single Image Super-Resolution
Shizun Wang, Jiaming Liu (equal contribution), Kaixin Chen, Xiaoqi Li, Ming Lu, Yandong Guo
Efficient Meta-Tuning for Content-aware Neural Video Delivery
Xiaoqi Li, Jiaming Liu (equal contribution), Shizun Wang, Cheng Lyu, Ming Lu, Yurong Chen, Anbang Yao, Yandong Guo, Shanghang Zhang
Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation
Jiaming Liu, Ming Lu, Kaixin Chen, Xiaoqi Li, Shizun Wang, Zhaoqing Wang, Enhua Wu, Yurong Chen, Chuang Zhang, Ming Wu
Publications (Coauthor)
CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World
Yankai Fu, Qiuxuan Feng, Ning Chen, Zichen Zhou, Mengzhen Liu, ..., Jiaming Liu, Hao Dong, Shanghang Zhang
[PDF] [Web page] [Code]
Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation
Xiaoqi Li, Lingyun Xu, Mingxu Zhang, Jiaming Liu, Yan Shen, Iaroslav Ponomarenko, Jiahui Xu, Liang Heng, Siyuan Huang, Shanghang Zhang, Hao Dong
[PDF] [Web page]
RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete
Yuheng Ji, Huajie Tan, Jiayu Shi, Xiaoshuai Hao, Yuan Zhang, Hengyuan Zhang, Pengwei Wang, Mengdi Zhao, ..., Jiaming Liu, Zhongyuan Wang, Shanghang Zhang
[PDF] [Web page]
Mavis: Mathematical visual instruction tuning with an automatic data engine
Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Ziyu Guo, Shicheng Li, Yichi Zhang, Chengzhuo Tong, Jiaming Liu, ..., Shanghang Zhang, Peng Gao, Chunyuan Li, Hongsheng Li
[PDF] [Code]
LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model
Yulin Luo, Ruichuan An, Bocheng Zou, Yiming Tang, Jiaming Liu, Shanghang Zhang
[PDF]
Manipllm: Embodied multimodal large language model for object-centric robotic manipulation
Xiaoqi Li, Mingxu Zhang, Yiran Geng, Haoran Geng, Yuxing Long, Yan Shen, Renrui Zhang, Jiaming Liu, Hao Dong
[PDF] [Web page]
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation
Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao
[PDF]
FreeKD: Knowledge Distillation via Semantic Frequency Prompt
Yuan Zhang, Tao Huang, Jiaming Liu, Tao Jiang, Kuan Cheng, Shanghang Zhang
[PDF]
NTO3D: Neural Target Object 3D Reconstruction with Segment Anything
Xiaobao Wei, Renrui Zhang, Jiarui Wu, Jiaming Liu, Ming Lu, Yandong Guo, Shanghang Zhang
[PDF]
Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation
Rongyu Zhang, Yulin Luo, Jiaming Liu, Huanrui Yang, Zhen Dong, Denis Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Yuan Du, Shanghang Zhang
Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-Supervised 3D Object Detection
Hongzhi Gao, Zheng Chen, Zehui Chen, Lin Chen, Jiaming Liu, Shanghang Zhang, Feng Zhao
Awards
1st Workshop on Visual Continual Learning, @ICCV 2023
Jiaming Liu (Peking University), Ran Xu, Senqiao Yang, Peidong, Jia, Jiayi Ni
1st Workshop on Visual Continual Learning, @ICCV 2023
Zehui Chen (USTC), Jiaming Liu (Peking University)
UniOcc: Unifying Vision-Centric 3D Occupancy Prediction with Geometric and Semantic Rendering
Mingjie Pan (Xiaomi Car), Jiaming Liu (Peking University)
Last Updated on 25th June, 2025