HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Mon, 03 Mar 2025 08:51:14 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"67c56d82-6f43"
expires: Sun, 28 Dec 2025 10:20:16 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: A10D:1F53DD:77F0D2:8669F8:69510208
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 10:10:16 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210097-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766916617.691344,VS0,VE206
vary: Accept-Encoding
x-fastly-request-id: 16be0d3bce1b0ba3b8a5596bd93bf5a41d7f865b
content-length: 5785
👋 About me - Peihao Chen’s Homepage I am now a Researcher at Robotics X Lab, Tencent . Previously, I obtained my bachelor’s and PhD degrees from South China University of Technology, advised by Prof. Mingkui Tan and Prof. Chuang Gan . I was fortunate to be a visiting scholar at MIT-IBM Watson AI Lab and UMass Amherst. I engage in developing an agent that can understand and interact with the multi-modal world. Toward this goal, my research mainly focuses on:
Embodied AI : Robot Manipulation; Visual NavigationMulti-Modal Video Understanding : Self-Supervised Video Representation Learning; Temporal Action Localization; Visually-Aligned Sound Generation
🗞️ News 2025.02: Two papers are accepted by CVPR 2025 2024.11: One paper about TTA for Navigation is accepted by TMM 2024.09: FlexAttention is accepted by ECCV 2024 2024.07: Happy to start my new journey at Robotics X Lab 2024.05: 3D-VLA is accepted by ICML 2024 2024.02: Two papers are accepted by CVPR 2024 2024.01: One papers is accepted by ICLR 2024 2023.09: Two papers are accepted by NeurIPS 2023 and one is seleceted as Spotlight ! 2023.09: Happy to join UMass Amherst as a visiting scholar working closely with Prof. Chuang Gan! 2023.07: One paper is accepted by ICCV 2023 ! 2023.06: Happy to join MIT-IBM Watson Lab for intership! 2023.02: One paper is accepted by CVPR 2023 ! 2023.02: The code for MGMap and ActiveCamera is now available. 2022.11: Two NeurIPS 2022 papers are selected as Spotlight ! 2022.10: Two papers are accepted by NeurIPS 2022 ! 2021.01: One paper is accepted by AAAI 2021 !
Conferences Flexattention for efficient high-resolution vision-language models
Junyan Li, Delin Chen, Tianle Cai, Peihao Chen, Yining Hong, Zhenfang Chen, Yikang Shen, Chuang Gan
ECCV 2024
Pdf BibTex Project Page Code 3D-VLA: 3D Vision-Language-Action Generative World Model
Haoyu Zhen, Xiaowen Qiu, Peihao Chen, Jincheng Yang, Xin Yan, Yilun Du, Yining Hong, Chuang Gan
ICML 2024
Pdf BibTex Project Page Code MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
Yining Hong, Zishuo Zheng, Peihao Chen, Yian Wang, Junyan Li, Zhenfang Chen, Chuang Gan
CVPR 2024
Pdf BibTex Project Page RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation
Zeyuan Yang, Jiageng Liu, Peihao Chen, Anoop Cherian, Tim K. Marks, Jonathan Le Roux, Chuang Gan
CVPR 2024
Pdf CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
Junyan Li, Delin Chen, Yining Hong, Zhenfang Chen, Peihao Chen, Yikang Shen, Chuang Gan
ICLR 2024
Pdf BibTex Project Page A2Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models
Peihao Chen, Xinyu Sun, Hongyan Zhi, Runhao Zeng, Thomas H. Li, Gaowen Liu, Mingkui Tan, Chuang Gan
NeurIPS Workshop 2023
Pdf 3D-LLM: Injecting the 3D World into Large Language Models
Yining Hong, Haoyu Zhen, Peihao Chen, Shuhong Zheng, Yilun Du, Zhenfang Chen, Chuang Gan
NeurIPS 2023 (Spotlight)
Pdf BibTex Project Page Code FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation
Xinyu Sun, Peihao Chen, Jugang Fan, Jian Chen, Thomas H. Li, Mingkui Tan
NeurIPS 2023
Pdf BibTex Learning Vision-and-Language Navigation from YouTube Videos
Kunyang Lin*, Peihao Chen*, Diwei Huang, Thomas H. Li, Mingkui Tan, Chuang Gan
ICCV 2023
Pdf BibTex Masked Motion Encoding for Self-Supervised Video Representation Learning
Xinyu Sun*, Peihao Chen*, Liangwei Chen, Changhao Li, Thomas H Li, Mingkui Tan, Chuang Gan
CVPR 2023
Pdf BibTex Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation
Peihao Chen*, Dongyu Ji*, Kunyang Lin, Runhao Zeng, Thomas H Li, Mingkui Tan, Chuang Gan
NeurIPS 2022 (Spotlight)
Pdf BibTex Project Page Code Learning Active Camera for Multi-Object Navigation
Peihao Chen, Dongyu Ji, Kunyang Lin, Weiwen Hu, Wenbing Huang, Thomas H Li, Mingkui Tan, Chuang Gan
NeurIPS 2022 (Spotlight)
Pdf BibTex Code RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning
Peihao Chen, Deng Huang, Dongliang He, Xiang Long, Runhao Zeng, Shilei Wen, Mingkui Tan, Chuang Gan
AAAI 2021
Pdf BibTex Code Foley Music: Learning to Generate Music from Videos
Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, Antonio Torralba
ECCV 2020
Pdf BibTex Project Page Dense Regression Network for Video Grounding
Runhao Zeng, Haoming Xu, Wenbing Huang, Peihao Chen, Mingkui Tan, Chuang Gan
CVPR 2020
Pdf BibTex Code Location-aware Graph Convolutional Networks for Video Question Answering
Deng Huang*, Peihao Chen*, Runhao Zeng, Qing Du, Mingkui Tan, Chuang Gan
AAAI 2020
Pdf BibTex Code Self-supervised Moving Vehicle Tracking with Stereo Sound
Chuang Gan, Hang Zhao, Peihao Chen, David Cox, Antonio Torralba
ICCV 2019
Pdf BibTex Project Page
Journals Generating Visually Aligned Sound from Videos
Peihao Chen, Yang Zhang, Mingkui Tan, Hongdong Xiao, Deng Huang, and Chuang Gan
TIP 2020
Pdf BibTex Code Relation Attention for Temporal Action Localization
Chen Peihao, Gan Chuang, Shen Guangyao, Huang Wenbing, Zeng Runhao, Tan Mingkui
IEEE TMM 2019
Pdf BibTex Breaking Winner-Takes-All: Iterative-Winners-Out Networks for Weakly Supervised Temporal Action Localization
Runhao Zeng, Chuang Gan, Peihao Chen, Wenbing Huang, Qingyao Wu, Mingkui Tan
IEEE Trans. Image Processing 28(12) 2019
Pdf BibTex
🏆 Award 2023: The Principle’s Scholarship of SCUT2020: The Principle’s Scholarship of SCUT2018: The First Prize Scholarship of SCUT2017: The Second Prize of the NXP Cup National University Students Intelligent Car Race