| CARVIEW |
Select Language
HTTP/2 301
server: GitHub.com
content-type: text/html
location: https://sepidsh.github.io/puzzlefusion/
access-control-allow-origin: *
strict-transport-security: max-age=31556952
expires: Tue, 30 Dec 2025 00:35:45 GMT
cache-control: max-age=600
x-proxy-cache: MISS
x-github-request-id: 1B25:292AC1:96A33D:A92867:69531C09
accept-ranges: bytes
age: 0
date: Tue, 30 Dec 2025 00:25:46 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210058-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767054346.865815,VS0,VE213
vary: Accept-Encoding
x-fastly-request-id: 46f05b4cb39fdda48ed1c87276cf342a966eacbb
content-length: 162
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
x-origin-cache: HIT
last-modified: Tue, 21 Oct 2025 02:49:09 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"68f6f4a5-148d"
expires: Tue, 30 Dec 2025 00:35:46 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 26F9:123DE:968067:A90516:69531C09
accept-ranges: bytes
age: 0
date: Tue, 30 Dec 2025 00:25:46 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210058-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767054346.092655,VS0,VE202
vary: Accept-Encoding
x-fastly-request-id: 2a4ac0b212487ebc1e11c4b5097e3ae320ddf21a
content-length: 2182
PuzzleFusion
PDF
PuzzleFusion: Unleashing the Power of Diffusion Models for Spatial Puzzle Solving
Sepidehsadat Hosseini, Mohammad Amin Shabani, Saghar Irandoust, and Yasutaka Furukawa
PDF Code PuzzleFusion Dataset
Abstract
This paper presents an end-to-end neural architecture based on Diffusion Models for spatial puzzle solving, particularly jigsaw puzzle and room arrangement tasks. In the latter task, for instance, the proposed system takes a set of room layouts as polygonal curves in the top-down view and aligns the room layout pieces by estimating their 2D translations and rotations, akin to solving the jigsaw puzzle of room layouts. A surprising discovery of the paper is that the simple use of a Diffusion Model effectively solves these challenging spatial puzzle tasks as a conditional generation process. To enable learning of an end-to-end neural system, the paper introduces new datasets with ground-truth arrangements:
- 2D Voronoi jigsaw dataset, a synthetic one where pieces are generated by Voronoi diagram of 2D pointset;
- PuzzleFusion dataset, a real one offered by MagicPlan from its production pipeline, where pieces are room layouts constructed by augmented reality App by real-estate consumers.
- The qualitative and quantitative evaluations demonstrate that our approach outperforms the competing methods by significant margins in all the tasks.
Data
We provide the PuzzleFusion dataset, a large-scale (98,000 samples) real-world dataset with ground-truth labels for house reconstruction and generation tasks. This dataset includes both Manhattan and non-Manhattan samples and has been gathered using the MagicPlan app. The dataset is available in both unprocessed and processed versions. The unprocessed version includes more houses, including multi-floor structures. The data reader code has been provided in in Here. Please also check ReadMe in the foler before using it.Citation
@misc{hosseini2023puzzlefusion,
title={PuzzleFusion: Unleashing the Power of Diffusion Models for Spatial Puzzle Solving},
author={Sepidehsadat Hosseini and Mohammad Amin Shabani and Saghar Irandoust and Yasutaka Furukawa},
year={2023},
eprint={2211.13785},
archivePrefix={arXiv},
primaryClass={cs.AI}}