| CARVIEW |
AI for Content Creation Workshop
June 12th @ CVPR 2025
Karl F. Dean Grand Ballroom A1, 4th Floor, Music City Center, Nashville, TN, USA
Remote (Zoom): Via CVPR site
Jeong et al., AI4CC 2024
Visual Style Prompting with Swapping Self-Attention
Zheng et al., AI4CC 2024
Towards Safer AI Content Creation by Immunizing Text-to-image Models
Barquero et al., AI4CC 2024
Seamless Human Motion Composition with Blended Positional Encodings
Matsunaga et al., AI4CC 2023
Fine-grained Image Editing by Pixel-wise Guidance Using Diffusion Models
Zhang et al., AI4CC 2023
Text-to-image Editing by Image Information Removal
Jain et al., AI4CC 2022
Zero-Shot Text-Guided Object Generation with Dream Fields
Lee, Lee, Kim, Choi, & Kim, AI4CC 2022
Fix the Noise: Disentangling Source Feature for Transfer Learning of StyleGAN
Jang, Villegas, Yang, Ceylan, Sun, & Lee, AI4CC 2022
RiCS: A 2D Self-Occlusion Map for Harmonizing Volumetric Objects
Poirier-Ginter, Alexandre Lessard, Ryan Smith, Jean-François Lalonde, AI4CC 2022
Overparameterization Improves StyleGAN Inversion
Jahn et al., AI4CC 2021
High-Resolution Complex Scene Synthesis with Transformers
Rombach, Esser, and Ommer, AI4CC 2020
Network Fusion for Content Creation with Conditional INNs
Cha et al., AI4CC 2020
Toward High-quality Few-shot Font Generation with Dual Memory
Sylvain et al., AI4CC 2020
Object-Centric Image Generation from Layouts
Summary
Content creation plays a crucial role in domains such as photography, videography, virtual reality, gaming, art, design, fashion, and advertising design. Recent progress in machine learning and AI has transformed hours of manual, painstaking content creation work into minutes or seconds of automated or interactive work. For instance, generative modeling approaches can produce photorealistic images of 2D and 3D items such as humans, landscapes, interior scenes, virtual environments, clothing, or even industrial designs. New large text, image, and video models that share latent spaces let us imaginatively describe scenes and have them realized automatically—with new multi-modal approaches able to generate consistent video and audio across long timeframes. Such approaches can also super-resolve and super-slomo videos, interpolate and extrapolate between photos and videos with intermediate novel views, decompose scene objects and appearance, and transfer styles to convincingly render and reinterpret content. Learned priors of images, videos, and 3D data can also be combined with explicit appearance and geometric constraints, perceptual understanding, or even functional and semantic constraints of objects. While often creating awe-inspiring artistic images, such techniques offer unique opportunities for generating diverse synthetic training data for downstream computer vision tasks, both in 2D, video, and 3D domains.
The AI for Content Creation workshop explores this exciting and fast-moving research area. We bring together invited speakers of world-class expertise in content creation, up-and-coming researchers, and authors of submitted workshop papers, to engage in a day filled with learning, discussion, and network building.
Welcome! -
Deqing Sun (Google)
Lingjie Liu (University of Pennsylvania)
Krishna Kumar Singh (Adobe)
Lu Jiang (ByteDance)
Jun-Yan Zhu (Carnegie Mellon University)
James Tompkin (Brown University)
Firefly Video (Adobe, 2025), Genie 2 (DeepMind, 2024), SORA (OpenAI, 2024).
2025 Awards
Best paper
VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment
Wenyan Cong (University of Texas at Austin); Hanqing Zhu (University of Texas at Austin); Kevin Wang (University of Texas at Austin); Jiahui Lei (University of Pennsylvania); Colton Stearns (Stanford University); Yuanhao Cai (Johns Hopkins University); Dilin Wang (Meta); Rakesh Ranjan (Meta); Matt Feiszli (Meta); Leonidas Guibas (Stanford University); Atlas Wang (University of Texas at Austin); Weiyao Wang (Meta); Zhiwen Fan (University of Texas at Austin) [https://videolifter.github.io/]Best presentation (shared)
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu (KAIST); Meera Hahn (Google DeepMind); Dan Kondratyuk (Luma AI); Jinwoo Shin (KAIST); Agrim Gupta (Google DeepMind); José Lezama (Google DeepMind); Irfan Essa (Google DeepMind); David Ross (Google DeepMind); Jonathan Huang (Scaled Foundations)Best presentation (shared)
Art3D: Training-Free 3D Generation from Flat-Colored Illustration
Xiaoyan Cong (Brown University); Jiayi Shen (Brown University); Zekun Li (Brown University); Rao Fu (Brown University); Tao Lu (Brown University); Srinath Sridhar (Brown University) [https://joy-jy11.github.io/]Best presentation (shared)
Training-Free Sketch-Guided Diffusion with Latent Optimization
Sandra Zhang Ding (The University of Tokyo); Kiyoharu AIZAWA (The University of Tokyo); Jiafeng MAO (The University of Tokyo)
2025 Schedule
Morning session:| Time CDT | |||
|---|---|---|---|
| 08:45 | Welcome and introductions | 👋 | |
| 09:00 | Maneesh Agrawala (Stanford University) |
|
|
| 09:30 | Kai Zhang (Adobe) | ||
| 10:00 | Coffee break | ☕ | |
| 10:30 | Charles Herrmann (Google) | ||
| 11:00 | Mark Boss (Stability AI) | ||
| 11:30 | Poster session 1 - ExHall D #412-431
|
||
| 12:30 | Lunch break - ExHall C | 🥪 |
Cat4D (Google, 2024), AssetGen (Meta, 2024), DreamFusion (Google, 2022).
Afternoon session:
| Time CDT | |||
|---|---|---|---|
| 13:30 | Oral session + best paper announcement + best presentation competition
|
||
| 14:00 | Yutong Bai (UC Berkeley) |
|
|
| 14:30 | Nanxuan (Cherry) Zhao (Adobe) | ||
| 15:00 | Coffee break | ☕ | |
| 15:30 | Ishan Misra (Meta) |
|
|
| 16:00 | Panel discussion — Open Source in AI and the Creative Industry
|
🗣️ | |
| 17:00 | Poster session 2 - ExHall D #412-431
|
Dall-E 2 (OpenAI, 2022), Imagen (Google, 2022), GauGAN2 (NVIDIA, 2021).
Previous Workshops (including session videos)
- 2024 - AI for Content Creation (Workshop at CVPR 2024).
- 2023 - AI for Content Creation (Workshop at CVPR 2023).
- 2022 - AI for Content Creation (Workshop at CVPR 2022).
- 2021 - AI for Content Creation (Workshop at CVPR 2021).
- 2020 - AI for Content Creation (Workshop at CVPR 2020).
- 2019 - Deep Learning for Content Creation (Tutorial at CVPR 2019)