HTTP/2 301
server: GitHub.com
content-type: text/html
location: https://www.wenyancong.com/
x-github-request-id: 67CD:318CF6:79A614:887987:6951324A
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 13:36:14 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210076-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766928974.230762,VS0,VE195
vary: Accept-Encoding
x-fastly-request-id: 7755b7c3263b5d0a06ed67c93c83985807bad57e
content-length: 162
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Thu, 06 Nov 2025 06:33:10 GMT
access-control-allow-origin: *
etag: W/"690c4126-7885"
expires: Sun, 28 Dec 2025 13:46:14 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: B64B:1F53DD:7A9C89:8972B5:6951324E
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 13:36:14 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210024-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766928975.507087,VS0,VE213
vary: Accept-Encoding
x-fastly-request-id: bd8e6920f62a52b05a010286e1d2ade918c82351
content-length: 6043
Wenyan Cong's Homepage
Research Interests
My research focuses on modeling dynamic and complex 3D/4D environments : both capturing the real world through digital reconstruction and synthesizing controllable virtual worlds. I’m also interested in efficient AI algorithems , with an emphasis on improving the training and inference efficiency of large foundation models.
Selected PublicationsFull publication list at Google
Scholar
E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models
Wenyan Cong , Yiqing Liang, Yancheng Zhang,
Ziyi Yang, Yan Wang, Boris Ivanovic, Marco Pavone, Chen Chen, Zhangyang Wang, Zhiwen Fan
In Submission
Can Scaling Test-Time Compute Improve World Foundation Model?
Wenyan Cong *, Hanqing Zhu*, Peihao Wang,
Bangya Liu, Dejia Xu, Kevin Wang, David Z. Pan, Yan Wang, Zhiwen Fan, Zhangyang Wang
COLM 2025
Videolifter: Lifting videos to 3d with fast hierarchical stereo alignment
Wenyan Cong , Hanqing Zhu, Kevin Wang,
Jiahui Lei, Colton Stearns, Yuanhao Cai, Dilin Wang, Rakesh Ranjan, Matt Feiszli, Leonidas Guibas, Zhangyang Wang, Weiyao Wang, Zhiwen Fan
AI4CC CVPRW 🏆 Best Paper Award. 3DV 2026.
APOLLO: SGD-like Memory, AdamW-level Performance
Hanqing
Zhu*, Zhenyu Zhang*, Wenyan Cong ,
Xi
Liu, Sem Park, Vikas Chandra, Bo Long, David Z Pan, Zhangyang Wang, Jinwon Lee
MLSys 2025 🏆 Outstanding Paper Honorable
Mention Award
Large spatial model: End-to-end unposed images to semantic 3d
Zhiwen Fan*, Jian Zhang*, Wenyan Cong , Peihao Wang, Renjie Li, Kairun Wen,
Shijie Zhou, Achuta Kadambi, Zhangyang Wang, Danfei Xu, Boris Ivanovic, Marco Pavone, Yue Wang
NeurIPS 2024
PACE: Pacing Operator Learning to Accurate Optical
Field Simulation for Complicated Photonic Devices
Hanqing
Zhu, Wenyan Cong , Guojin Chen,
Shupeng
Ning, Ray T Chen, Jiaqi Gu, David Z Pan
NeurIPS 2024
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds
Zhiwen Fan*, Wenyan Cong *, Kairun Wen*, Kevin Wang, Jian Zhang, Xinghao Ding, Danfei Xu,
Boris Ivanovic, Marco Pavone, Georgios Pavlakos, Zhangyang Wang, Yue Wang
In Submission
Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts
Wenyan Cong *, Hanxue Liang*, Peihao Wang, Zhiwen Fan, Tianlong Chen, Mukund Varma, Yi Wang, Zhangyang Wang
ICCV 2023
High-resolution Image Harmonization via Collaborative Dual Transformations
Wenyan Cong , Xinhao Tao, Li Niu, Jing Liang, Xuesong Gao, Qihao Sun, Liqing Zhang
CVPR 2022
Deep Image Harmonization by Bridging the Reality Gap
Junyan Cao, Wenyan Cong , Li Niu, Jianfu Zhang, Liqing Zhang
BMVC 2022
Making Images Real Again: A Comprehensive Survey on Deep Image Composition
Li Niu, Wenyan Cong , Liu Liu, Yan Hong, Bo Zhang, Jing Liang, Liqing Zhang
ArXiv 2021
Dovenet: Deep Image Harmonization via Domain Verification
Wenyan Cong , Jianfu Zhang, Li Niu, Liu Liu, Zhixin Ling, Weiyuan Li, Liqing Zhang
CVPR 2020