| CARVIEW |
Hi I am Jiarui, a fifth-year PhD student at UC San Diego. I'm honored to be advised by Professor Xiaolong Wang. I received my Bachelor's degree in Computer Science at Hong Kong University of Science and Technology (HKUST). I'm currently a Research Intern at FAIR Labs. Previously I have interned at Google Research, NVIDIA Research, Microsoft Research and OpenMMLab .
News
- New 04/2025: TTT-Video and GSPN are accepted by CVPR 2025.
- 03/2024: PixelLLM is accepted by CVPR 2024.
- 02/2023: ODISE is accepted by CVPR 2023 as highlight.
- 01/2023: GPViT is accepted by ICLR 2023 as spotlight presentation.
- 03/2022: GroupViT is accepted by CVPR 2022.
- 07/2021: VFS is accepted by ICCV 2021 as oral presentation.
Publications show selected / show all by date / show all by topic
Topics: Vision Language / Representation Learning / Others (*: Equal Contribution)
One-Minute Video Generation with Test-Time Training
Karan Dalal*, Daniel Koceja*, Gashon Hussein*, Jiarui Xu*, Yue Zhao†, Youjin Song†, Shihao Han, Ka Chun Cheung, Jan Kautz, Carlos Guestrin, Tatsunori Hashimoto, Sanmi Koyejo, Yejin Choi, Yu Sun, Xiaolong Wang
Conference on Computer Vision and Pattern Recognition (CVPR), 2025project page / arXiv / code
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Hongjun Wang, Wonmin Byeon, Jiarui Xu, Jingwei Gu, Jianwei Yang, Xiaolong Wang, Kai Han, Jan Kautz, Sifei Liu
Conference on Computer Vision and Pattern Recognition (CVPR), 2025 (Spotlight)project page / arXiv / code
Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Yu Sun*, Xinhao Li*, Karan Dalal*, Jiarui Xu, Arjun Vikram, Genghan Zhang, Yann Dubois, Xinlei Chen†, Xiaolong Wang†, Sanmi Koyejo†, Tatsunori Hashimoto†, Carlos Guestrin†
International Conference on Machine Learning (ICML), 2025 (Spotlight)Pixel Aligned Language Models
Jiarui Xu, Xingyi Zhou, Shen Yan, Xiuye Gu, Anurag Arnab, Chen Sun, Xiaolong Wang, Cordelia Schmid
Conference on Computer Vision and Pattern Recognition (CVPR), 2024project page / arXiv / code
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Jiarui Xu, Yossi Gandelsman, Amir Bar, Jianwei Yang, Jianfeng Gao, Trevor Darrell, Xiaolong Wang
TMLR, 2024project page / arXiv / code
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
Jiarui Xu, Sifei Liu*, Arash Vahdat*, Wonmin Byeon, Xiaolong Wang, Shalini De Mello
Conference on Computer Vision and Pattern Recognition (CVPR), 2023 (Highlight)project page / arXiv / code / demo / video
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
Chenhongyi Yang*, Jiarui Xu*, Shalini De Mello, Elliot J. Crowley, Xiaolong Wang
International Conference on Learning Representations (ICLR), 2023 (Spotlight)project page / arXiv / code
GroupViT: Semantic Segmentation Emerges from Text Supervision
Jiarui Xu, Shalini De Mello, Sifei Liu, Wonmin Byeon, Thomas Breuel, Jan Kautz, Xiaolong Wang
Conference on Computer Vision and Pattern Recognition (CVPR), 2022
Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective
Jiarui Xu, Xiaolong Wang
International Conference on Computer Vision (ICCV), 2021 (Oral)project page / arXiv / code / video
Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time.
Shaowei Liu*, Hanwen Jiang*, Jiarui Xu, Sifei Liu, Xiaolong Wang
Conference on Computer Vision and Pattern Recognition (CVPR), 2021
Fast Video Object Segmentation With Temporal Aggregation Network and Dynamic Template Matching
Xuhua Huang*, Jiarui Xu*, Yu-Wing Tai, Chi-Keung Tang
Conference on Computer Vision and Pattern Recognition (CVPR), 2020project page / arXiv / video
Learning to Group: A Bottom-Up Framework for 3D Part Discovery in Unseen Categories
Tiange Luo, Kaichun Mo, Zhiao Huang, Jiarui Xu, Siyu Hu, Liwei Wang, Hao Su
International Conference on Learning Representations (ICLR), 2020project page / arXiv / code
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
Yue Cao*, Jiarui Xu*, Steve Lin, Fangyun Wei, Han Hu
International Conference on Computer Vision Workshop (ICCVW), 2019 (Best Paper Award)IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Spatial-Temporal Relation Networks for Multi-Object Tracking
Jiarui Xu, Yue Cao, Zheng Zhang, Han Hu
International Conference on Computer Vision (ICCV), 2019
Deep High Dynamic Range Imaging with Large Foreground Motions
Shangzhe Wu, Jiarui Xu, Yu-Wing Tai, Chi-Keung Tang
European Conference on Computer Vision ( ECCV), 2018project page / arXiv / code