HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Fri, 17 Oct 2025 02:48:40 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"68f1ae88-2673"
expires: Mon, 29 Dec 2025 06:54:33 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: ECE0:2680BD:86055B:96AA99:69522350
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 06:44:33 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210042-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766990673.173142,VS0,VE212
vary: Accept-Encoding
x-fastly-request-id: 7779515c693e5b962a4e7c60dd177150cb1ffeed
content-length: 2768
Yiming Zhang
News
- [2025.10]
Our paper FoleyCrafter is accepted by IJCV.
- [2024.07]
Release the paper and code of FoleyCrafter.
- [2024.06]
Graduated from Dalian University of Technology.
- [2024.03]
Our paper PIA is accepted by CVPR 2024.
|
|
|
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
Yiming Zhang*,
Zhening Xing*,
Yanhong Zeng,
Youqing Fang,
Kai Chen
CVPR, 2024
project page /
video /
arXiv /
demo /
code
PIA can animate any images from personalized models by text while preserving high-fidelity details and unique styles.
|
|
|
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Yiming Zhang,
Yicheng Gu,
Yanhong Zeng,
Zhening Xing,
Yuancheng Wang,
Zhizheng Wu,
Kai Chen
IJCV
project page /
video /
arXiv /
demo /
code
FoleyCrafter is a video-to-audio generation framework which can produce realistic sound effects semantically relevant and synchronized with videos.
|
- Invited Talk: Personalized Image Animator (OpenMMLab on Bilibili Live 2024)
- Conference Reviewer: NeurIPS, CVPR.
-
Outstanding Undergraduate in 2024.
-
National Scholarship in 2022 (Top 1% in DUT).
- Merit Students.
-
First Prize Excellence Scholarship.
|