| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Mon, 24 Nov 2025 17:58:20 GMT
access-control-allow-origin: *
etag: W/"69249cbc-826a"
expires: Mon, 29 Dec 2025 08:03:42 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 6D0E:292AC1:87AB7D:9870A2:69523386
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 07:53:42 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210038-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766994822.167791,VS0,VE212
vary: Accept-Encoding
x-fastly-request-id: ef822fb34277a10cfd68c5a02378895fd7ef5dce
content-length: 6889
Jie Lei
Jie Lei
Deception Pass, Washington, Sept 2025 |
I am a research scientist at Fundamental AI Research (FAIR), Meta. My primary research interests are multimodal learning and video modeling. I received my PhD in Computer Science from UNC Chapel Hill in 2022, advised by Tamara L. Berg and Mohit Bansal. I received my bachelor's degree in Computer Science from Yingcai Honors College, University of Electronic Science and Technology of China (UESTC) in 2017. I am a receipt of the Adobe Research Fellowship and the CVPR 2021 Best Student Paper Honorable Mention award. Email: jielei [at] meta.com |
News
- Nov 2025 » Releasing SAM 3, the most advanced segmentation and tracking model, try our demo.
- Feb 2023 » Two papers accepted at CVPR 2023.
- Feb 2023 » Our tutorial Knowledge-Driven Vision-Language Pretraining is accepted at CVPR 2023, see you in Vancouver.
- Dec 2022 » Our tutorial Knowledge-Driven Vision-Language Pretraining is accepted at AAAI 2023.
- May 2022 » I graduated with a PhD in Computer Science from UNC.
- Mar 2022 » Our workshop T4V: Transformers for Vision is accepted at CVPR 2022.
- Jun 2021 » We are hosting VALUE Challenge for video and language understanding at ICCV 2021 CLCV workshop, please join!
- Jun 2021 » ClipBERT is awarded the CVPR 2021 Best Student Paper Honorable Mention! 😍
- Feb 2021 » Received Adobe Research Fellowship, thanks Adobe!
- Jan 2021 » Research Internship @Facebook AI, working with Licheng Yu, Xinlei Chen and Ning Zhang.
- May 2020 » Research Internship @Microsoft, working with Linjie Li, Luowei Zhou, Zhe Gan and Jingjing Liu.
- May 2019 » Research Internship @Tencent AI Lab, Seattle, with Liwei Wang, Yelong Shen and Dong Yu
- Aug 2017 » I joined UNC as a PhD student.
Publications & Preprints
Data-Efficient Pretraining with Group-Level Data Influence Modeling
Zichun Yu, Fei Peng, Jie Lei, Arnold Overwijk, Wen-tau Yih, Chenyan Xiong
NeurIPS 2025
[PDF]
UNICORN: A Unified Causal Video-Oriented Language-Modeling Framework for Temporal Video-Language Tasks
Yuanhao Xiong, Yixin Nie, Haotian Liu, Boxin Wang, Jun Chen, Rong Jin, Cho-Jui Hsieh, Lorenzo Torresani, Jie Lei
EMNLP 2024
[PDF]
Revealing Single Frame Bias for Video-and-Language Learning
Jie Lei, Tamara L. Berg, Mohit Bansal
ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound
Yan-Bo Lin, Jie Lei, Mohit Bansal, Gedas Bertasius
Resin-11: Schema-guided event prediction for 11 newsworthy scenarios
Xinya Du, Zixuan Zhang, Sha Li, Pengfei Yu, Hongwei Wang, Tuan Lai, Xudong Lin, Ziqi Wang, Iris Liu, Ben Zhou, Haoyang Wen, Manling Li, Darryl Hannan, Jie Lei, Hyounghun Kim, Rotem Dror, Haoyu Wang, Michael Regan, Qi Zeng, Qing Lyu, Charles Yu, Carl Edwards, Xiaomeng Jin, Yizhu Jiao, Ghazaleh Kazeminejad, Zhenhailong Wang, Chris Callison-Burch, Mohit Bansal, Carl Vondrick, Jiawei Han, Dan Roth, Shih-Fu Chang, Martha Palmer, Heng Ji
NAACL 2022 System Demo
[PDF]
LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Jie Lei, Xinlei Chen, Ning Zhang, Mengjiao Wang, Mohit Bansal, Tamara L. Berg, Licheng Yu
arXiv 2022
[PDF]
QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
Jie Lei, Tamara L. Berg, Mohit Bansal
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation
Linjie Li*, Jie Lei*, Zhe Gan, Licheng Yu, Yen-Chun Chen, Rohit Pillai, Yu Cheng, Luowei Zhou, Xin
Eric Wang, William Yang Wang, Tamara L. Berg, Mohit Bansal, Jingjing Liu, Lijuan Wang, Zicheng Liu
What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal
Projects
AnimeGAN: Create Anime Face using Generative Adversarial Networks,
Jie Lei
A simple GAN model that could automatically generate anime girl faces.
Miscs
- My Chinese name is 雷杰. I am from Sichuan, the hometown of pandas.
- I am
a big fanof Attack on Titan.