| CARVIEW |
Select Language
HTTP/2 200
accept-ranges: bytes
age: 1
cache-control: public,max-age=0,must-revalidate
cache-status: "Netlify Edge"; fwd=miss
content-encoding: gzip
content-type: text/html; charset=UTF-8
date: Sun, 28 Dec 2025 19:10:58 GMT
etag: "a23f8afbb556dd78e61503507f3a615b-ssl-df"
permissions-policy: accelerometer=(), camera=(), geolocation=(), gyroscope=(), magnetometer=(), microphone=(), payment=(), usb=()
referrer-policy: strict-origin-when-cross-origin
server: Netlify
strict-transport-security: max-age=31536000; includeSubDomains
vary: Accept-Encoding
x-content-type-options: nosniff
x-nf-request-id: 01KDK5XXD7ZFRQ7KZD7TRX6977
x-xss-protection: 1; mode=block
Jin Wang 
I am currently a 3rd-year PhD student at University of Hong Kong (HKU), supervised by Prof. Ping Luo. I obtained my Master degree from Institute of Computing Technology (ICT), Chinese Academy of Sciences (CAS), supervised by Prof. Chao Li. I received my Bachelor degree from Dalian University of Technology (DLUT). My current research interests lie in multimodality learning, forgery detection, and explainable artificial intelligence.
Experience
MEng in Electronic and Information Engineering
September 2020 –
June 2023
Beijing
BEng in Digital Media Technology
September 2016 –
June 2020
Dalian, Liaoning
Recent Publications
Quickly discover relevant content by filtering publications.
Jin Wang, Yao Lai, Aoxue Li, Shifeng Zhang, Jiacheng Sun, Ning Kang, Chengyue Wu, Zhenguo Li, Ping Luo
(2025).
FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities.
To appear in NeurIPS 2025 (Spotlight).
Xin Dong, Shichao Dong, Jin Wang, Jing Huang, Li Zhou, Zenghui Sun, Lihua Jing, Jingsong Lan, Xiaoyong Zhu, Bo Zheng
(2025).
INTER: Mitigating Hallucination in Large Vision-Language Models by Interaction Guidance Sampling.
In ICCV 2025.
Jin Wang, Chenghui Lv, Xian Li, Shichao Dong, Huadong Li, Kelu Yao, Chao Li, Wenqi Shao, Ping Luo
(2025).
Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models.
In CVPR 2025.
Shurong Yang, Huadong Li, Juhao Wu, Minhao Jing, Linze Li, Renhe Ji, Jiajun Liang, Haoqiang Fan, Jin Wang
(2025).
Megactor-sigma: Unlocking flexible mixed-modal control in portrait animation with diffusion transformer.
In AAAI 2025.