| CARVIEW |
Hao Chen 
I am currently a Research Scientist at ByteDance. Before this, I worked as a Postdoctoral Researcher at Meta. I earned my Ph.D. in Computer Science from the University of Maryland, College Park, under the guidance of Prof. Abhinav Shrivastava. Prior to this, I completed my Master's degree in Pattern Recognition & Intelligent Systems at Huazhong University of Science & Technology (HUST), supervised by Prof. Guoyou Wang. I also hold a Bachelor's degree from the School of Optical and Electronic Information at HUST.
Growing Visual Generative Capacity for Pre-Trained MLLMs
Hanyu Wang, Jiaming Han, Ziyan Yang, Qi Zhao, Shanchuan Lin, Xiangyu Yue, Abhinav Shrivastava, Zhenheng Yang, Hao Chen†
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
NeurIPS 2025
Jiamin Han, Hao Chen†, Yang Zhao, Hanyu Wang, Qi Zhao, Zhiyan Yang, Hao He, Xiangyu Yue, Lu Jiang
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
ByteDance Seed VideoGen Team
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
ICLR 2025 (Oral)
Hanyu Wang, Saurabh Suri, Yixuan Ren, Hao Chen†, Abhinav Shrivastava
NeRV-Diffusion: Diffuse Implicit Neural Representations for Video Synthesis
Yixuan Ren, Hanyu Wang, Hao Chen†, Bo He, Abhinav Shrivastava
Seedance 1.0: Exploring the Boundaries of Video Generation Models
ByteDance Seed VideoGen Team
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
Jianyi Wang, Shanchuan Lin, Zhijie Lin, Yuxi Ren, Meng Wei, Zongsheng Yue, Shangchen Zhou, Hao Chen, Yang Zhao, Ceyuan Yang, Xuefeng Xiao, Chen Change Loy, Lu Jiang
SkipSR: Faster Super-Resolution with Token Skipping
Rohan Choudhury, Shanchuan Lin, Jianyi Wang, Hao Chen, Qi Zhao, Feng Cheng, Lu Jiang, Kris Kitani, Laszlo A. Jeni

Fast Encoding and Decoding for Implicit Video Representation
ECCV 2024
Hao Chen, Saining Xie, Ser-Nam Lim, Abhinav Shrivastava
We propose NeRV-Enc, which encodes videos 104 times faster than its predecessor NeRV, utilizing hyper-networks. Additionally, we introduce NeRV-Dec, which decodes video 8.9 times faster than NeRV via parallel decoding, and is 11 times faster compared to the H.264 codec.

HNeRV: A Hybrid Neural Representation for Videos
CVPR 2023
Hao Chen, Matt Gwilliam, Ser-Nam Lim, Abhinav Shrivastava
We propose a hybrid video neural representation and a evenly distributed neural network to improve modeling capacity and introduce internal generalization.

Towards Scalable Neural Representation for Diverse Videos
CVPR 2023
Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava
We propose D-NeRV, a novel neural representation framework designed to encode large-scale and diverse videos.

CNeRV: Content-adaptive Neural Representation for Visual Data
BMVC 2022 (Oral)
Hao Chen, Matt Gwilliam, Bo He, Ser-Nam Lim, Abhinav Shrivastava
We propose a hybrid video neural representation with content-adaptive embedding to introduce internal generalization.

NeRV: Neural Representations for Videos
NeurIPS 2021
Hao Chen, Bo He, Hanyu Wang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava
We propose an image-wise neural representation for videos, which achieves good compression results and fast decoding speed.

The Lottery Ticket Hypothesis for Object Recognition
CVPR 2021
Sharath Girish, Shishira R. Maiya, Kamal Gupta, Hao Chen, Larry Davis, Abhinav Shrivastava


HR-RCNN: Hierarchical Relational Reasoning for Object Detection
BMVC 2021
Hao Chen, Abhinav Shrivastava


LSTD: A Low-Shot Transfer Detector for Object Detection
AAAI 2018 (Spotlight)
Hao Chen, Yali Wang, Guoyou Wang, Yu Qiao
Acknowledgements
I appreciate everyone who helped me or encouraged me throughout my life, especially Prof. Abhinav Shrivastava, Prof. Guoyou Wang, Prof. Yu Qiao, Prof. Yali Wang, Prof. Xiang Bai and all good friends I met in China and the US.
The website template was borrowed from Ben Mildenhall.