| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Thu, 25 Jul 2024 01:34:32 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"66a1aba8-2bb1"
expires: Sun, 28 Dec 2025 07:30:43 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: C464:272D88:760D51:8437BC:6950DA4A
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 07:20:43 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210098-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766906443.129985,VS0,VE222
vary: Accept-Encoding
x-fastly-request-id: f145d0920dca6c2ed47543bdb814d58e3e4b26ed
content-length: 2724
SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
1Stability AI,
2Northeastern University
*Equal contribution
^Equal advising
×
Abstract
We present Stable Video 4D (SV4D) — a latent video diffusion model for multi-frame and multi-view consistent dynamic 3D content generation.
Unlike previous methods that rely on separately trained generative models for video generation and novel view synthesis, we design a unified diffusion model to generate novel view videos of dynamic 3D objects.
Specifically, given a monocular reference video, SV4D generates novel views for each video frame that are temporally consistent.
We then use the generated novel view videos to optimize an implicit 4D representation (dynamic NeRF) efficiently, without the need for cumbersome SDS-based optimization used in most prior works.
To train our unified novel view video generation model, we curated a dynamic 3D object dataset from the existing Objaverse dataset.
Extensive experimental results on multiple datasets and user studies demonstrate SV4D’s state-of-the-art performance on novel-view video synthesis as well as 4D generation compared to prior works.
Summary Video
Results and Comparison
Novel View Video Synthesis
Comparing our results with baselines.
4D Optimization
Comparing our results with baselines.
More results generated by SV4D
BibTeX
@article{xie2024sv4d,
title={{SV4D}: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency},
author={Yiming Xie and Chun-Han Yao and Vikram Voleti and Huaizu Jiang and Varun Jampani},
journal={arXiv preprint arXiv:2407.17470},
year={2024},
}