| CARVIEW |
|
Shijie Wang (王世杰) |
Short Bio
I am a final year CS Ph.D. student at Brown University working with Prof. Chen Sun. Previously, I also worked at Google and Meta as a research intern. I obtained my bachelor degree in software engineering at Tsinghua University in 2021.My research interests involve building physically grounded, reasoning-capable vision-language models and exploring their effective integration into the physical world. Feel free to contact me for collaborations and casual chats. I'm actively looking for industry full-time opportunities in 2026.
Education
- 09/2021 - NOW Ph.D. in Department of Computer Science, Brown University
- 08/2016 - 06/2021 B.S. in School of Software, Tsinghua University. (Outstanding Undergrad)
News
- 05/2025, I will join Salesforce AI Research as a research intern.
- 05/2024, I joined Meta GenAI as a research scientist intern.
- 10/2023, I started to work at Google DeepMind as a Student Researcher.
- 06/2023, I became a PhD candidate.
- 10/2022, I got 3rd prize in Ego4D Object State Change Classification Challenge.
- 05/2022, I started to work at Google as a Student Researcher.
- 08/2021, I moved to Providence and started my PhD career at Brown University!
- 06/2021, I graduated from Tsinghua University as an outstanding undergrad!
- 03/2021, My first paper was accepted in CVPR 2021!
Selected Papers
[9] MotiF: Making Text Count in Image Animation with Motion Focal Loss
[Link]
[Website]
[Benchmark]
Shijie Wang, Samaneh Azadi, Rohit Girdhar, Sai Saketh Rambhatla, Chen Sun, and Xi Yin
CVPR 2025
[8] How Can Objects Help Video-Language Understanding?
[Link]
Zitian Tang, Shijie Wang, Junho Cho, Jaewook Yoo, and Chen Sun
ICCV 2025
[7] Learning Visual Grounding from Generative Vision and Language Model
[Link]
Shijie Wang, Dahun Kim, Ali Taalimi, Chen Sun, and Weicheng Kuo
WACV 2025
[6] Vamos: Versatile Action Models for Video Understanding
[Link]
[Website]
[Code]
Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, and Chen Sun
ECCV 2024
[5] Do Pre-trained Vision-Language Models Encode Object States?
[Link]
Kaleb Newman, Shijie Wang, Yuan Zang, David Heffren, and Chen Sun
ECCV 2024 Workshop EVAL-FoMo
[4] AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
[Link]
[Website]
[Code]
Qi Zhao*, Shijie Wang*, Ce Zhang, Changcheng Fu, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, and Chen Sun
ICLR 2024
[3] Object-centric Video Representation for Long-term Action Anticipation
[Link]
[Code]
Ce Zhang*, Changcheng Fu*, Shijie Wang, Nakul Agarwal, Kwonjoon Lee, Chiho Choi, and Chen Sun
WACV 2024
[2] Goal-Conditioned Predictive Coding as an Implicit Planner for Offline Reinforcement Learning
[Link]
[Website]
[Code]
Zilai Zeng, Ce Zhang, Shijie Wang, and Chen Sun
NeurIPS 2023
[1] Pose Recognition with Cascade Transformers
[Link]
[Code]
Ke Li*, Shijie Wang*, Xiang Zhang*, Yifan Xu, Weijian Xu, and Zhuowen Tu
CVPR 2021
Experience
- 06/2025 - NOW Research Intern at Salesforce AI Research with Dr. Juan Carlos Niebles and Dr. Honglu Zhou.
- 05/2024 - 11/2024 Research Scientist Intern at Meta GenAI with Dr. Xi Yin.
- 09/2023 - 03/2024 Student Researcher at Google DeepMind with Dr. Weicheng Kuo.
- 05/2022 - 12/2022 Student Researcher at Google Research with Dr. Yin Cui.
- 07/2020 - 03/2021 Research Assistant at UCSD with Prof. Zhuowen Tu.
- 07/2019 - 09/2019 Machine Learning Intern, Kwai.
- 12/2018 - 06/2020 Research Assistant at Tsinghua with Prof. Mingsheng Long.
Awards
- 2022, 3rd Prize of Ego4D Object State Change Classification Challenge, ECCV 2022.
- 2021, Outstanding Undergrad Awards, Tsinghua University.
- 2018 & 2019 & 2020, Scholarship for Academic Excellence, Tsinghua University.
- 2019, First Prize in Student Research Training Program, Tsinghua University.
- 2019, Member of Tsinghua University Initiative Scientific Research Program (funding: 30,000¥).
- 2018, Champion of Yuehan Ma Campus Football Cup, Tsinghua University.
Service
Reviewer:
- IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
- International Journal of Computer Vision (IJCV)
- The International Conference on Learning Representations (ICLR) 2024, 2025
- The International Conference on Machine Learning (ICML) 2024
- The Conference on Neural Information Processing Systems (NeurIPS) 2023, 2024, 2025
- The Conference on Computer Vision and Pattern Recognition (CVPR) 2022, 2023, 2025
- The International Conference on Computer Vision (ICCV) 2023
- The European Conference on Computer Vision (ECCV) 2022, 2024
- AAAI Conference on Artificial Intelligence (AAAI) 2023, 2024
- Winter Conference on Applications of Computer Vision (WACV) 2023, 2024
Part of the page is generated by jemdoc.
Last Updated: .
