| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Tue, 23 Dec 2025 15:34:49 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"694ab699-a09c"
expires: Sun, 28 Dec 2025 20:39:48 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: D3D4:318CF6:7EA062:8E2E3A:6951933C
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 20:29:48 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210043-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766953788.213417,VS0,VE207
vary: Accept-Encoding
x-fastly-request-id: a2cb3dfb4ff01c31fb7454a7507769699597e6fd
content-length: 9341
Open☀️3D
OpenSUN3D
6th Workshop on Open-World 3D Scene Understanding and Representations
in conjunction with CVPR in Colorado, USA.
Introduction
For intelligent agents to thrive in the physical world, they must not only perceive but also comprehend and act within it.
This year’s workshop unites leading researchers advancing world models and spatial intelligence, from 3D perception and scene understanding to generative world representations.
Through keynotes and a challenge on open 3D interaction perception, we will explore how machines learn to reason about and engage with their environments.
The workshop aims to shape the next generation of spatially grounded AI, bridging scientific discovery, practical deployment, and responsible innovation.
More information on paper submission and challenge track will follow soon.
More information on paper submission and challenge track will follow soon.
Related Works
Below is a collection of concurrent and related works in the field of open-set 3D scene understanding.- ConceptFusion: Open-set Multimodal 3D Mapping RSS'23
- OpenScene: 3D Scene Understanding with Open Vocabularies CVPR'23
- LERF: Language Embedded Radiance Fields ICCV'23
- Decomposing NeRF for Editing via Feature Field Distillation NeurIPS'22
- Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models CoRL'22
- Language-Grounded Indoor 3D Semantic Segmentation in the Wild ECCV'22
- MultiScan: Scalable RGBD Scanning for 3D Environments with Articulated Objects NeurIPS'22
- ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models CVPR'23
- Weakly Supervised 3D Open-Vocabulary Segmentation NeurIPS'23
- Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes
- OpenMask3D: Open-Vocabulary 3D Instance Segmentation NeurIPS'23
- CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP
- VL-Fields: Towards Language-Grounded Neural Implicit Spatial Representations ICRA'23
- PLA: Language-Driven Open-Vocabulary 3D Scene Understanding CVPR'23
- RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
- OpenIns3D: Snap and Lookup for 3D open-vocabulary Instance Segmentation
- ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning
- Open-Vocabulary Point-Cloud Object Detection without 3D Annotation CVPR'23
- CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection NeurIPS'23
- SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes CVPR'24
- Clio: Real-time Task-Driven Open-Set 3D Scene Graphs RA-L'24
- Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation CVPR'25
Organizers