HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Wed, 17 Dec 2025 06:32:57 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"69424e99-3a46"
expires: Mon, 29 Dec 2025 21:25:13 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 6312:123DE:94EF3C:A71E16:6952EF60
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 21:15:13 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210091-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767042913.099895,VS0,VE212
vary: Accept-Encoding
x-fastly-request-id: a6edda684e7a3a29a1ad3668c95f5afd007a2439
content-length: 3576
Ziyang Song
OSN: Infinite Representations of Dynamic 3D Scenes from Monocular Videos
Ziyang Song , Jinxi Li, Bo Yang
International Conference on Machine Learning (ICML ) , 2024
arXiv /
Video /
Code
The first framework to represent dynamic 3D scenes in infinitely many ways from a monocular RGB video.
Unsupervised 3D Object Segmentation of Point Clouds by Geometry Consistency
Ziyang Song , Bo Yang
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI ) , 2024
IEEE Xplore /
Video /
Code
The journal version of our OGC (NeurIPS 2022). More experiments and analysis are included.
NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos
Jinxi Li, Ziyang Song , Bo Yang
Advances in Neural Information Processing Systems (NeurIPS ) , 2023
arXiv /
Code
A novel representation of dynamic 3D scenes by disentangling physical velocities from geometry and appearance, enabling: 1) future frame extrapolation, 2) unsupervised semantic scene decomposition, and 3) velocity transfer.
ActFormer: A GAN-based Transformer towards General Action-Conditioned 3D Human Motion Generation
Liang Xu*, Ziyang Song* , Dongliang Wang, Jing Su, Zhicheng Fang, Chenjing Ding, Weihao Gan, Yichao Yan, Xin Jin, Xiaokang Yang, Wenjun Zeng, Wei Wu
International Conference on Computer Vision (ICCV ) , 2023
arXiv /
Project Page /
Code
(* denotes equal contribution)
A GAN-based Transformer for general action-conditioned 3D human motion generation, including single-person actions and multi-person interactive actions.
OGC: Unsupervised 3D Object Segmentation from Rigid Dynamics of Point Clouds
Ziyang Song , Bo Yang
Advances in Neural Information Processing Systems (NeurIPS ) , 2022
arXiv /
Video /
Code
We propose the first unsupervised 3D object segmentation method, learning from dynamic motion patterns in point cloud sequences.