Carview!

HOME
ABOUT
- RESULTS
- differences
- BENEFITS
- HISTORY
- TEAM
- LOCATION
- FACILITIES
- BANKING
- MEMBERSHIPS
- APPROVALS
- LICENCES
- SUPPLIERS
- SPONSORSHIPS
- MEDIA
- PRIVACY
AUCTIONS
SHIPPING
FEES
- TS REWARDS
TOOLS
guides
FAQ
CONTACT
- CONNECT

VEHICLES
BRAND
- JAPANESE CARS
  - DAIHATSU
  - EUNOS
  - FORD
  - HONDA
  - ISUZU
  - LEXUS
  - MAZDA
  - MITSUBISHI
  - MITSUOKA
  - NISSAN
  - SUBARU
  - SUZUKI
  - TOYOTA
- GERMAN CARS
- AMERICAN CARS
- BRITISH CARS
- ITALIAN CARS
- FRENCH CARS
- SWEDISH CARS
- KOREAN CARS
TYPE
- mobility
- VENDING
- instruction
- TAXIS
- AMBULANCES
- FIRE ENGINES
- HEARSES
- LIMOUSINES
- COMMERCIAL
CLASS
FUEL
TRUCKS
minitrucks
- DAIHATSU
- HONDA
- MAZDA
- MITSUBISHI
- NISSAN
- SUBARU
- SUZUKI
- DUMP
- CRANE
- CAMPER
- REFRIGERATED
- 4WD
- NEW
BUSES
MOTORHOMES
- YAHOO!
- RAKUTEN
- DEALER

PARTS
- FREE REPORT
- PARTS CONTAINERS
- PARTS SYSTEMS
- PARTS PROTECTION
- BODY SHELLS
- DISMANTLING
- ONLINE PARTS
- NEW PARTS
- INTERIOR PARTS
- EXTERIOR PARTS
  - BONNETS
  - BUMPERS
  - GRILLES
  - FENDERS
  - DOORS
  - TRUNKS
  - SPOILERS
  - LIGHTS
  - EMBLEMS
  - CAMERAS
- ENGINES
- TRANSMISSIONS
- WHEELS & TYRES
  - WHEELS
  - TYRES
CUTS
PERFORMANCE PARTS
TRUCK PARTS
MOTORBIKE PARTS
- MOTORBIKE ENGINES
- MOTORBIKE ACCESSORIES

MOTORBIKES
MARINE
FORKLIFTS
MACHINERY
AGRICULTURAL
OTHER
COUNTRY
- AUSTRALIA
- CANADA
- KENYA
- MYANMAR
- NEW ZEALAND
- PAKISTAN
- TANZANIA
- UNITED STATES

CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 server: GitHub.com content-type: text/html; charset=utf-8 last-modified: Sat, 20 Jul 2024 05:47:11 GMT access-control-allow-origin: * strict-transport-security: max-age=31556952 etag: W/"669b4f5f-8d1c" expires: Mon, 29 Dec 2025 16:48:09 GMT cache-control: max-age=600 content-encoding: gzip x-proxy-cache: MISS x-github-request-id: 939F:3946E9:907B45:A22C2C:6952AE71 accept-ranges: bytes age: 0 date: Mon, 29 Dec 2025 16:38:09 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210054-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1767026289.131226,VS0,VE220 vary: Accept-Encoding x-fastly-request-id: 5d939835a08bee17c77a1bba30b61bc3c24f06e8 content-length: 5611 DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data

DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data

CVPR 2024

Qihao Liu ¹ Yi Zhang ¹ Song Bai ² Adam Kortylewski ^3,4 Alan Yuille ¹

¹ Johns Hopkins University

² ByteDance

³ University of Freiburg

⁴ Max-Planck-Institute for Informatics

[Paper] [Code & Models] [BibTeX]

DIRECT-3D is a new text-to-3D generative model that directly generates 3D contents in a single forward pass without optimization.

[Fast Text-to-3D generation without optimization] It can generate high-quality 3D objects with accurate geometric details and various textures in 12 seconds on a single V100, driven by text prompts.

a Wall-E

an astronaut wearing a colorful spacesuit

a Transformed Bumblebee robot with intricate body details

a batman mask

a biplane with yellow wings

a voxelized cupcake made with LEGO

the Statue of Liberty, aerial view

an french throne chair

[Accurate 3D geometry prior] It also provides accurate and effective 3D geometry prior that significantly alleviates the Janus problem in 2D-lifting methods. The 3D knowledge is embedded in a stable diffusion-like architecture, ensuring ease of use and compatibility with many existing algorithms.

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

(3D prior from DIRECT-3D eliminates the Janus problem.)

Abstract

We present DIRECT-3D, a diffusion-based 3D generative model for creating high-quality 3D assets (represented by Neural Radiance Fields) from text prompts. Unlike recent 3D generative models that rely on clean and well-aligned 3D data, limiting them to single or few-class generation, our model is directly trained on extensive noisy and unaligned ‘in-the-wild’ 3D assets, mitigating the key challenge (i.e., data scarcity) in large-scale 3D generation. In particular, DIRECT-3D is a tri-plane diffusion model that integrates two innovations: 1) A novel learning framework where noisy data are filtered and aligned automatically during the training process. Specifically, after an initial warm-up phase using a small set of clean data, an iterative optimization is introduced in the diffusion process to explicitly estimate the 3D pose of objects and select beneficial data based on conditional density. 2) An efficient 3D representation that is achieved by disentangling object geometry and color features with two separate conditional diffusion models that are optimized hierarchically. Given a prompt input, our model generates high-quality, high-resolution, realistic, and complex 3D objects with accurate geometric details in seconds. We achieve state-of-the-art performance in both single-class generation and text-to-3D generation. We also demonstrate that DIRECT-3D can serve as a useful 3D geometric prior of objects, for example to alleviate the well-known Janus problem in 2D-lifting methods such as DreamFusion

Method

Given a prompt, we generate a NeRF with two modules: The disentangled tri-plane diffusion module uses 2 (or 4 if the super-resolution plug-in is used) diffusion models to generate geometry (fg) and color (fc) tri-plane separately. Then both tri-planes are reshaped and fed into a NeRF auto-decoder to get the final outputs. During training, an iterative optimization process is introduced in the geometry diffusion to explicitly model the pose θ of objects and select beneficial ones, enabling efficient training on noisy ‘in-the-wild’ data. The whole model is end-to-end trainable (with or without 3D Super Resolution plug-in), with only multi-view 2D images as supervision.

Direct Text-to-3D Generation

a baby grand piano

a batman mask

a beautiful white daisy

a biplane with yellow wings

a birthday cupcake

a bowl of food

a brown boot

a campfire

a chair that looks like a tree

a chair that looks like an avocado

a cheeseburger

a diamond ring

a donut with pink icing

a firetruck

a goldfish

a green coffee mug

a hypercube

a kingfisher bird

a lemon cut in half

a minion

a model of a house in Tudor style

a penguin

a pumpkin

a race car

a red convertible car with the top down

a soap dispenser

a spanish galleon sailing on the open sea

a statue of a black dog

a traffic cone

a Transformed Bumblebee robot with intricate body details

a voxelized cupcake made with LEGO

a voxelized dog

a Wall-E

a yellow schoolbus

an astronaut

an elephant

an french throne chair

an orange road bike

the Statue of Liberty, aerial view

Ube ice cream cone

Improving 2D-lifting Methods with 3D Prior

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Ours + DreamFusion

DreamFusion

Bibtex

@inproceedings{liu2024direct,
  title={DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data},
  author={Liu, Qihao and Zhang, Yi and Bai, Song and Kortylewski, Adam and Yuille, Alan},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={6881--6891},
  year={2024}
}

HOME
ABOUT
AUCTIONS
SHIPPING
FEES
TOOLS
HOW
FAQ
CONTACT

Original Source | Taken Source