CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 server: GitHub.com content-type: text/html; charset=utf-8 last-modified: Sun, 31 Aug 2025 16:15:13 GMT access-control-allow-origin: * strict-transport-security: max-age=31556952 etag: W/"68b47511-310a" expires: Mon, 29 Dec 2025 12:19:50 GMT cache-control: max-age=600 content-encoding: gzip x-proxy-cache: MISS x-github-request-id: 1522:1387E:8B7064:9CA9C9:69526F8D accept-ranges: bytes age: 0 date: Mon, 29 Dec 2025 12:09:50 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210074-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1767010190.025705,VS0,VE206 vary: Accept-Encoding x-fastly-request-id: b9cac2adc29cad8a0f25c0e976d997107d71576b content-length: 3149 GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions

GaussianEditor:
Editing 3D Gaussians Delicately with
Text Instructions

CVPR 2024

Junjie Wang^*, Jiemin Fang^*†, Xiaopeng Zhang, Lingxi Xie, Qi Tian

Huawei Inc.

* denotes equal contributions. † denotes corresponding author.

arXiv Video

Abstract

Recently, impressive results have been achieved in 3D scene editing with text instructions based on a 2D diffusion model. However, current diffusion models primarily generate images by predicting noise in the latent space, and the editing is usually applied to the whole image, which makes it challenging to perform delicate, especially localized, editing for 3D scenes. Inspired by recent 3D Gaussian splatting, we propose a systematic framework, named GaussianEditor, to edit 3D scenes delicately via 3D Gaussians with text instructions. Benefiting from the explicit property of 3D Gaussians, we design a series of techniques to achieve delicate editing. Specifically, we first extract the region of interest (RoI) corresponding to the text instruction, aligning it to 3D Gaussians. The Gaussian RoI is further used to control the editing process. Our framework can achieve more delicate and precise editing of 3D scenes than previous methods while enjoying much faster training speed, i.e. within 20 minutes on a single V100 GPU, more than twice as fast as Instruct-NeRF2NeRF (45 minutes -- 2 hours).

GaussianEditor:
Editing 3D Gaussians Delicately with
Text Instructions

Abstract

Video

360° Scene Editing

Multiple-Round Editing

Comparisons with Instruct-NeRF2NeRF

Complex Multi-Object Scenes

More Examples

More Examples

Extension with GaussianDreamer

Citation

GaussianEditor: Editing 3D Gaussians Delicately withText Instructions

Abstract

Video

360° Scene Editing

Multiple-Round Editing

Comparisons with Instruct-NeRF2NeRF

Complex Multi-Object Scenes

More Examples

More Examples

Extension with GaussianDreamer

Citation

GaussianEditor:
Editing 3D Gaussians Delicately with
Text Instructions