HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Tue, 02 Dec 2025 15:05:07 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"692f0023-9d95"
expires: Mon, 29 Dec 2025 17:24:24 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 9A59:15317B:920F61:A3D16D:6952B6F0
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 17:14:25 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210089-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767028465.890424,VS0,VE207
vary: Accept-Encoding
x-fastly-request-id: b11bd9ce327df139b368c9ebadf59f323da17291
content-length: 7789
Zhangjie Wu (*) denotes equal contribution ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation
Jay Zhangjie Wu* , Xuanchi Ren*, Tianchang Shen, Tianshi Cao, Kai He, Yifan Lu, Ruiyuan Gao, Enze Xie, Shiyi Lan, Jose M. Alvarez, Jun Gao, Sanja Fidler, Zian Wang, and Huan Ling*
Technical Report
· Sparse Image Synthesis via Joint Latent and RoI Flow
Ziteng Gao, Jay Zhangjie Wu , and Mike Zheng Shou
NeurIPS 2025
Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models
Jay Zhangjie Wu* , Yuxuan Zhang*, Haithem Turki, Xuanchi Ren, Jun Gao, Mike Zheng Shou, Sanja Fidler, Zan Gojcic, and Huan Ling
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models
Xuanchi Ren*, Yifan Lu*, Tianshi Cao*, Ruiyuan Gao*, Shengyu Huang, Amirmojtaba Sabour, Tianchang Shen, Tobias Pfaff, Jay Zhangjie Wu , Runjian Chen, Seung Wook Kim, Jun Gao, Laura Leal-Taixe, Mike Chen, Sanja Fidler, and Huan Ling
White Paper
· Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control
NVIDIA
White Paper
· Cosmos World Foundation Model Platform for Physical AI
NVIDIA (Jay Zhangjie Wu: Core contributor)
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video
Yifan Lu*, Xuanchi Ren*, Jiawei Yang, Tianchang Shen, Zhangjie Wu , Jun Gao, Yue Wang, Siheng Chen, Mike Chen, Sanja Fidler, and Jiahui Huang
ICCV 2025
· SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
Xuanchi Ren*, Yifan Lu*, Hanxue Liang, Zhangjie Wu , Huan Ling, Mike Chen, Sanja Fidler, Francis Williams, and Jiahui Huang
NeurIPS 2024
· EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Rui Zhao, Hangjie Yuan, Yujie Wei, Shiwei Zhang, Yuchao Gu, Lingmin Ran, Xiang Wang, Zhangjie Wu , Junhao Zhang, Yingya Zhang, and Mike Zheng Shou
NeurIPS 2024
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Rui Zhao, Yuchao Gu, Jay Zhangjie Wu , David Junhao Zhang, Jiawei Liu, Weijia Wu, Jussi Keppo, and Mike Zheng Shou
ECCV 2024 (Oral)
· Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
David Junhao Zhang, Mutian Xu, Jay Zhangjie Wu , Wenqing Zhang, Xiaoguang Han, Song Bai, and Mike Zheng Shou
ECCV 2024
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Yuchao Gu, Yipin Zhou, Bichen Wu, Licheng Yu, Jia-Wei Liu, Rui Zhao, Jay Zhangjie Wu , David Junhao Zhang, Mike Zheng Shou, and Kevin Tang
CVPR 2024
· DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing
Jia-Wei Liu, Yan-Pei Cao, Jay Zhangjie Wu , Weijia Mao, Yuchao Gu, Rui Zhao, Jussi Keppo, Ying Shan, and Mike Zheng Shou
CVPR 2024
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
David Junhao Zhang*, Jay Zhangjie Wu* , Jia-Wei Liu*, Rui Zhao, Lingmin Ran, Yuchao Gu, Difei Gao, and Mike Zheng Shou
IJCV 2024
· Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
Yuchao Gu, Xintao Wang, Jay Zhangjie Wu , Yujun Shi, Yunpeng Chen, Zihan Fan, Wuyou Xiao, Rui Zhao, Shuning Chang, Weijia Wu, Yixiao Ge, Ying Shan, and Mike Zheng Shou
NeurIPS 2024
· Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Jay Zhangjie Wu , Yixiao Ge, Xintao Wang, Stan Weixian Lei, Yuchao Gu, Yufei Shi, Wynne Hsu, Ying Shan, Xiaohu Qie, and Mike Zheng Shou
Label-Efficient Online Continual Object Detection in Streaming Video
Jay Zhangjie Wu , David Junhao Zhang, Wynne Hsu, Mengmi Zhang, and Mike Zheng Shou
ICCV 2023
Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
Stan Weixian Lei*, Difei Gao*, Jay Zhangjie Wu , Yuxuan Wang, Wei Liu, Mengmi Zhang, and Mike Zheng Shou
AAAI 2023 (Oral)
Mining whole-lung information by artificial intelligence for predicting EGFR genotype and targeted therapy response in lung cancer: a multicohort study
Shuo Wang*, He Yu*, Yuncui Gan*, Zhangjie Wu , and et al.
The Lancet Digital Health 2022 (Impact Factor: 23.8)
Occluded Prohibited Items Detection: An X-ray Security Inspection Benchmark and De-occlusion Attention Module
Yanlu Wei*, Renshuai Tao*, Zhangjie Wu , Yuqing Ma, Libo Zhang, and Xianglong Liu
ACM MM 2020 (Oral)