Exporters From Japan
Wholesale exporters from Japan   Company Established 1983
CARVIEW
Select Language

Coarse-to-fine text-to-3D optimization of DreamFlow

Our text-to-3D generation is done in coarse-to-fine manner; we first optimize NeRF, then extract 3D mesh and fine-tune. We use same latent diffusion model (denoiser 1) for first and second stage. Lastly, we refine 3D mesh with high-resolution latent diffusion prior (denoiser 2). At each stage, we optimize with different timestep schedule, which effectively utilize the diffusion priors.

The proposed framework, DreamFlow, perform coarse-to-fine text-to-3D optimization for high-quality 3D content generation. We first optimize NeRF (e.g., using hash-grid encoder) using latent diffusion model (e.g., Stable Diffusion v2.1) with resolution of 256x256, with timesteps decreasing from 1.0 to 0.2. Then, we extract a 3D mesh from stage 1 for efficient 3D modeling, and optimize 3D mesh with resolution of 512x512 using same denoiser of stage 1, with timesteps decreasing from 0.5 to 0.1. Lastly at stage 3, we refine the 3D mesh using diffusion refiner (e.g., Stable Diffusion XL refiner), to generate 3D mesh in resolution of 1024x1024. 3D mesh refinement significantly enhance the photorealism of 3D model, compared to prior methods.

Results

A sliced loaf of fresh bread.

A corgi standing up drinking boba.

An imperial state crown of England.

A beautiful dress made out of garbage bags, on a mannequin.

A 3D model of adorable cottage with a thatched roof.

A silver platter piled high with fruits.

A tarantula, highly detailed.

A tiger eating an ice cream cone.

A tiger dressed as a doctor.

Wedding dress made out of tenacles.

The template for this website is from here, and we appreciate their kindness in open-sourcing it.