| CARVIEW |
Zhao Zhang (张 钊)
I'm currently working as a Vision-Language Researcher at ByteDance, with a focus on multimodal LLMs and their applications. I completed my Master's degree at Nankai University, where I was under the supervision of Ming-Ming Cheng. Please feel free to contact me at (📮: zzhang🥳mail🔅nankai🔅edu🔅cn)
Recent News
- [🔝/2025] 🌳 We're seeking interns passionate about Graphic Design to collaborate on impactful research projects. Please feel free to contact me via email.
-
[06/2025] We released CreatiPoster
, an AI-driven graphic design generation system for multi-layer and editable compositions with strong visual appeal.
- [05/2025] Our Unified-MLLM for image layer decomposition was was accepted by ICML 2025. The technical report will be released soon.
- [01/2025] RelationLMM was accepted by TPAMI-25.
- [12.2024] Our GRES is selected as an "Excellent Science & Technology Academic Paper" for the 2024 Shenzhen 4th Excellent Science & Technology Academic Paper Selection.
-
[04/2024] 🔥 Graphist
was accepted by AAAI 2025. We have unleashed the potential of MLLM in graphic design.
- [02/2024] One paper was accepted by CVPR 2024.
- [09/2023] D-Cube was accepted by NeurlPS 2023.
- [08/2023] We released Link-Context Learning for MLLMs as well as an interesting dataset ISEKAI.
- [07/2023] GRES was accepted by ICCV 2023.
-
[06/2023] 🔥 We released Shikra
, an awesome MLLM for Referential Dialogue.
- [01/2023] One paper was accepted by TPAMI.
- [08/2022] One paper was accepted by ECCV 2022.
- [07/2022] Two papers was accepted by ACM MM 2022.
- [07/2022] I am working as vision-language researcher in SenseTime Research.
- [06/2022] One paper was accepted by CVMJ.
- [03/2022] One paper was accepted by CVPR 2022 as oral presentation.
- [12/2020] I am working as an intern in Tencent Youtu Lab.
- [12/2020] One paper was accepted by TIP 2021.
- [07/2020] One paper was accepted by ECCV 2020.
- [05/2020] One paper was accepted by TNNLS 2020.
- [04/2020] One paper was accepted by CVPR 2020.
- [09/2019] I have joined the Media Computing Lab under the supervision of Prof. Ming-Ming Cheng!
- [06/2019] I graduated from Yangzhou University, and received my bachelor degree.
- [02/2018] One paper was accepted by ICASSP 2018.
- [07/2017] Two paper were accepted by ICONIP 2017, one of which is an oral paper.
Experiences
-
Expert Researcher in Vision & Language
2023 - NowIntelligent CreationByteDance
-
Researcher in Vision & Language
2022 - 2023Smart City Group (SCG)SenseTime
-
Internship in Computer Vision
2020 - 2021Youtu LabCSIG, Tencent
-
M.S. in Computer Science
2019 - 2022Media Computing Lab (supervised by Prof Ming-Ming Cheng)School of Computer Science, Nankai University
-
B.S. in Computer Science
2015 - 2019College of Innovation and Entrepreneurship (Elite College)School of Information Engineerin, Yangzhou University
Publications
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer
for Creative Graphic Design
Hui Zhang, Dexiang Hong, Maoke Yang, Yutao Cheng, Zhao Zhang
Jie Shao, Xinglong Wu, Zuxuan Wu, and Yu-Gang Jiang
arXiv 2025  
[Repo]
[Project page]
[Paper]
[bib]
Decomposition of Graphic Design with Unified Multimodal Model
Hui Nie, Zhao Zhang, Yutao Cheng, Maoke Yang, Gonglei Shi, Qingsong Xie, Jie Shao, Xinglong Wu
ICML 2025  
[Repo Coming Soon]
Gradient-Induced Co-Saliency Detection
Zhao Zhang*, Wenda Jin*, Jun Xu, Ming-Ming Cheng
ECCV 2020  
[PDF]
[Project]
[Code]
[Short Video]
[Long Video]
[Slides]
[中译版]
[bib]
Services
- Reviewer for T-PAMI, TIP, TMI, TMM, TCSVT, CVPR, ICCV, ECCV, NeurIPS, EMNLP, ACMMM, etc.