| CARVIEW |
Select Language
Check our gallery for more interactive results!
More Results
Input Video
3D Tracks
Novel View
Input Video
3D Tracks
Novel View
Input Video
3D Tracks
Novel View
Input Video
3D Tracks
Novel View
Input Video
3D Tracks
Novel View
3D Tracking Comparison
For each method, we render the video from a novel viewpoint and overlay their predicted 3D tracks onto the novel views. TAPIR + Depth Anything does not produce novel views and we instead overlay their tracks onto our renderings.
HyperNeRF
Deformable-3D-GS
TAPIR + Depth Anything
Ours
HyperNeRF
Deformable-3D-GS
TAPIR + Depth Anything
Ours
HyperNeRF
Deformable-3D-GS
TAPIR + Depth Anything
Ours
Novel View Synthesis Comparison
HyperNeRF
Deformable-3D-GS
Ours
HyperNeRF
Deformable-3D-GS
Ours
2D Tracking Comparison
TAPIR
Ours
TAPIR
Ours
Failure Cases
Fast motion and occlusions are challenging for our method.
Our method relies on off-the-shelf methods, e.g., mono-depth estimation, which can be incorrect.
Related links
We recognize that a few concurrent works address similar problems to ours. We encourage you to check them out:Acknowledgements
We thank Ruilong Li, Brent Yi, Noah Snavely and Aleksander Holynski for helpful discussion. We also thank Brent Yi for helping us set up the interactive demo. We are in memory of our beloved cat Sriracha, who will always be missed and loved. This project is supported in part by DARPA No. HR001123C0021. and IARPA DOI/IBC No. 140D0423C0035. The views and conclusions contained herein are those of the authors and do not represent the official policies or endorsements of these institutions.BibTeX
@inproceedings{som2024,
title = {Shape of Motion: 4D Reconstruction from a Single Video},
author = {Wang, Qianqian and Ye, Vickie and Gao, Hang and Zeng, Weijia and Austin, Jake and Li, Zhengqi and Kanazawa, Angjoo},
booktitle = {International Conference on Computer Vision (ICCV)},
year = {2025}
}