| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Fri, 19 Dec 2025 14:31:39 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"694561cb-51e4"
expires: Mon, 29 Dec 2025 00:49:38 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 2BE1:3827E5:82729A:9287E3:6951CDC9
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 00:39:38 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210065-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766968778.188581,VS0,VE209
vary: Accept-Encoding
x-fastly-request-id: cb103fc93c628db7178780839763daad5bdc6e2d
content-length: 3758
Hanyu Wang
About Me
I am a Ph.D. student in the Department of Computer Science at University of Maryland, College Park, advised by Prof. Abhinav Shrivastava.
My research focuses on computer vision and generative AI, with an emphasis on visual content generation under various conditions. My long-term goal is to build multimodal foundation models that unify the understanding and generation of different data types, enabling cross-modal learning and mutual enhancement across modalities.
Selected Publications
Vision as a Dialect : Unifying Visual
Understanding & Generation via Text-Aligned Representations
Jiaming Han, Hao Chen, Yang Zhao, Hanyu Wang, Qi Zhao, Ziyan Yang, Hao He,
Xiangyu Yue, Lu Jiang
NeurIPS 2025
LARP: Tokenizing Videos 🎬 with a Learned Autoregressive
Generative Prior 🚀
Hanyu Wang, Saksham Suri, Yixuan Ren, Hao Chen, Abhinav Shrivastava
ICLR 2025 (Oral)
NeRV-Diffusion: Diffuse
Implicit Neural Representation for Video Synthesis
Yixuan Ren, Hanyu Wang, Hao Chen, Bo He, Abhinav Shrivastava
arXiv, 2025
Multimodality-guided Image Style Transfer using Cross-modal
GAN Inversion
Hanyu Wang, Pengxiang Wu, Kevin Dela Rosa, Chen Wang, Abhinav Shrivastava
WACV 2024
Solving General Noisy Inverse Problem via Posterior Sampling: A Policy
Gradient Viewpoint
Haoyue Tang, Tian Xie, Aosong Feng, Hanyu Wang, Chenyang Zhang, Yang Bai
AISTATS 2024
Chop & Learn: Recognizing and
Generating Object-State Compositions
Nirat Saini*, Hanyu Wang*, Archana Swaminathan, Vinoj
Jayasundara, Bo He, Kamal Gupta, Abhinav Shrivastava
ICCV 2023
Towards Scalable Neural
Representation for Diverse Videos
Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan
Ren, Ser-Nam Lim, Abhinav Shrivastava
CVPR 2023
NIRVANA:
Neural Implicit Representations of Videos with Adaptive Networks
Shishira R Maiya*, Sharath Girish*, Max Ehrlich, Hanyu
Wang, Kwot Sin Lee, Patrick Poirson, Pengxiang Wu, Chen Wang, Abhinav
Shrivastava
CVPR 2023
NeRV: Neural Representations for
Videos
Hao Chen, Bo He, Hanyu Wang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava
NeurIPS 2021
Learning
local shape descriptors for computing non-rigid dense correspondence
Jianwei Guo, Hanyu Wang, Zhanglin Cheng, Xiaopeng Zhang, Dong-Ming Yan
Computational Visual Media, 2020
Learning 3d keypoint descriptors for non-rigid shape matching
Hanyu Wang*, Jianwei Guo*, Dong-Ming Yan, Weize Quan,
Xiaopeng Zhang
ECCV 2018