| CARVIEW |
Select Language
HTTP/2 301
server: GitHub.com
content-type: text/html
location: https://junlinhan.github.io/projects/flex3d/
access-control-allow-origin: *
strict-transport-security: max-age=31556952
expires: Sun, 28 Dec 2025 15:13:35 GMT
cache-control: max-age=600
x-proxy-cache: MISS
x-github-request-id: 3EFA:2D64E0:7B0E9A:8A0A77:695146BB
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 15:03:35 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210050-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766934216.621098,VS0,VE200
vary: Accept-Encoding
x-fastly-request-id: f9ba0b18ee630c41b2f11eba533f8f4fdd2b915b
content-length: 162
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Mon, 01 Dec 2025 04:25:01 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"692d189d-2b0d"
expires: Sun, 28 Dec 2025 15:13:35 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: CB49:3827E5:7C89B0:8B9A3E:695146C7
accept-ranges: bytes
age: 0
date: Sun, 28 Dec 2025 15:03:36 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210050-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766934216.834513,VS0,VE209
vary: Accept-Encoding
x-fastly-request-id: 424d930fcd4057b9bf0ca1c8af5e48d9df66b65e
content-length: 3337
Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation
Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation
ICML 2025
¹GenAI, Meta ²University of Oxford
Flex3D comprises two stages:
(1) candidate view generation and selection, and
(2) 3D reconstruction using FlexRM.
In the first stage, an input image or textual prompt drives the generation of a diverse set of candidate views through fine-tuned multi-view and video diffusion models.
These views are subsequently filtered based on quality and consistency using a view selection mechanism.
The second stage leverages the selected high-quality views, feeding them to FlexRM which reconstruct the 3D object using a tri-plane representation decoded into 3D Gaussians.
Summary: Flex3D is a two-stage pipeline that generates high-quality 3D assets from single images or text prompts.
Interactive Results
Explore generation results (Gaussian Splats) below.Method
Acknowledgements
Junlin Han is supported by Meta. We would like to thank Luke Melas-Kyriazi, Runjia Li, Yawar Siddiqui, Minghao Chen, David Novotny, and Natalia Neverova for the helpful discussions and support.