HOME
ABOUT
- RESULTS
- differences
- BENEFITS
- HISTORY
- TEAM
- LOCATION
- FACILITIES
- BANKING
- MEMBERSHIPS
- APPROVALS
- LICENCES
- SUPPLIERS
- SPONSORSHIPS
- MEDIA
- PRIVACY
AUCTIONS
SHIPPING
FEES
- TS REWARDS
TOOLS
guides
FAQ
CONTACT
- CONNECT

VEHICLES
BRAND
- JAPANESE CARS
  - DAIHATSU
  - EUNOS
  - FORD
  - HONDA
  - ISUZU
  - LEXUS
  - MAZDA
  - MITSUBISHI
  - MITSUOKA
  - NISSAN
  - SUBARU
  - SUZUKI
  - TOYOTA
- GERMAN CARS
- AMERICAN CARS
- BRITISH CARS
- ITALIAN CARS
- FRENCH CARS
- SWEDISH CARS
- KOREAN CARS
TYPE
- mobility
- VENDING
- instruction
- TAXIS
- AMBULANCES
- FIRE ENGINES
- HEARSES
- LIMOUSINES
- COMMERCIAL
CLASS
FUEL
TRUCKS
minitrucks
- DAIHATSU
- HONDA
- MAZDA
- MITSUBISHI
- NISSAN
- SUBARU
- SUZUKI
- DUMP
- CRANE
- CAMPER
- REFRIGERATED
- 4WD
- NEW
BUSES
MOTORHOMES
- YAHOO!
- RAKUTEN
- DEALER

PARTS
- FREE REPORT
- PARTS CONTAINERS
- PARTS SYSTEMS
- PARTS PROTECTION
- BODY SHELLS
- DISMANTLING
- ONLINE PARTS
- NEW PARTS
- INTERIOR PARTS
- EXTERIOR PARTS
  - BONNETS
  - BUMPERS
  - GRILLES
  - FENDERS
  - DOORS
  - TRUNKS
  - SPOILERS
  - LIGHTS
  - EMBLEMS
  - CAMERAS
- ENGINES
- TRANSMISSIONS
- WHEELS & TYRES
  - WHEELS
  - TYRES
CUTS
PERFORMANCE PARTS
TRUCK PARTS
MOTORBIKE PARTS
- MOTORBIKE ENGINES
- MOTORBIKE ACCESSORIES

MOTORBIKES
MARINE
FORKLIFTS
MACHINERY
AGRICULTURAL
OTHER
COUNTRY
- AUSTRALIA
- CANADA
- KENYA
- MYANMAR
- NEW ZEALAND
- PAKISTAN
- TANZANIA
- UNITED STATES

CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 server: GitHub.com content-type: text/html; charset=utf-8 last-modified: Sun, 14 Sep 2025 16:22:39 GMT access-control-allow-origin: * strict-transport-security: max-age=31556952 etag: W/"68c6ebcf-c669" expires: Tue, 30 Dec 2025 08:09:29 GMT cache-control: max-age=600 content-encoding: gzip x-proxy-cache: MISS x-github-request-id: 8B6F:3827E5:9DA494:B10A22:69538661 accept-ranges: bytes age: 0 date: Tue, 30 Dec 2025 07:59:29 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210027-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1767081570.600837,VS0,VE206 vary: Accept-Encoding x-fastly-request-id: ecdd9a052e3e3a62932f094bf0c2a36e33622f18 content-length: 6917 Unsupervised Object Segmentation

Promising or Elusive? Unsupervised Object Segmentation
from Real-world Single Images

NeurIPS 2022

Yafei Yang Bo Yang

vLAR Group, The Hong Kong Polytechnic University

Paper Code Video Slides Poster

Abstract

In this paper, we study the problem of unsupervised object segmentation from single images. We do not introduce a new algorithm, but systematically investigate the effectiveness of existing unsupervised models on challenging real-world images. We firstly introduce four complexity factors to quantitatively measure the distributions of object- and scene-level biases in appearance and geometry for datasets with human annotations. With the aid of these factors, we empirically find that, not surprisingly, existing unsupervised models catastrophically fail to segment generic objects in real-world images, although they can easily achieve excellent performance on numerous simple synthetic datasets, due to the vast gap in objectness biases between synthetic and real images. By conducting extensive experiments on multiple groups of ablated real-world datasets, we ultimately find that the key factors underlying the colossal failure of existing unsupervised models on real-world images is the challenging distributions of object- and scene-level biases in appearance and geometry. Because of this, the inductive biases introduced in existing unsupervised models can hardly capture the diverse object distributions. Our research results suggest that future work should exploit more explicit objectness biases in the network design.

Unsupervised Segmentation Performance

Synthetic datasets

training

dSprites

Tetris

CLEVR

CLEVR/train/IODINE/sample/train_result.gif

dSprites/test/SlotAtt/train_result_05.gif

CLEVR/test/SlotAtt/sample 1/train_result_05.gif

testing

dSprites

Tetris

CLEVR

dSprites/test/SlotAtt/test_result_05.gif

Real-world datasets

training

YCB

ScanNet

COCO

ScanNet/train/IODINE/sample 3/train_result.gif

COCO/train/IODINE/sample 1/train_result.gif

ScanNet/test/SlotAtt/sample 3/train_result_05.gif

COCO/test/SlotAtt/sample 1/train_result_05.gif

testing

YCB

ScanNet

COCO

* First row are input images. Second row are GT object masks. Third row are results from IODINE. Last row are results from SlotAtt.

Complexity Factors

Object Color Gradient

Object Shape Concavity

Given an RGB image, we first convert it to grayscale, then calculate its gradient horizontally and vertically. Specifically, to avoid the effect from background, we remove gradient from object boundary. The final score is the averaged inner gradient.

Given a binary mask of an object shape, we first find its smallest convex polygon that surrounds the object. Factor value is computed as 1 - area of object / area of convex mask.

Inter-object Color Similarity

Inter-object Shape Variation

Given an image consisiting of multiple objects, we first calculate the average RGB color of each object. In RGB space, we average Euclidean distance between each pair of objects. Factor value if computed as 1 - normalized averaged distance.

We calculate diagonal length of bounding box for each object. The averaged diagonal variation is normalized to be the final factor value.

Ablations

C: Single Color Ablation

S: Convex Shape Ablation

Remove color gradient inside each object such that: Object Color Gradient is effectively reduced; Inter-object Color Similarity remains similar.

Make convex the shape of each object such that: Object Shape Concavity is effectively reduced; Inter-object Shape Variation remains similar.

T: Texture Replaced Ablation

U: Uniform Scale Ablation

Replaced with distinctive texture for all objects such that: Object Color Gradient remains similar; Inter-object Color Similarity is effectively reduced.

Rescale for all objects such that: Object Shape Concavity remains similar; Inter-object Shape Variation is effectively reduced.

Qualitative Results from Ablation

Full Ablation

YCB

YCB-CSTU

GT mask

IODINE

SlotAtt

ScanNet

ScanNet-CSTU ScanNet_CSTU/test/0_input_image.png

GT mask

IODINE

SlotAtt

COCO

COCO-CSTU

GT mask

IODINE

SlotAtt

Object-level Ablation

C: Single Color Ablation

YCB-C

ScanNet-C

COCO-C

COCO_C/test/IODINE/sample/test_result.gif

ScanNet_C/test/SlotAtt/test_result_05.gif

S: Convex Shape Ablation

YCB-S

ScanNet-S

COCO-S

ScanNet_S/test/SlotAtt/test_result_05.gif

Scene-level Ablation

T: Texture Replaced Ablation

YCB-T

ScanNet-T

COCO-T

ScanNet_T/test/SlotAtt/test_result_05.gif

U: Uniform Scale Ablation

YCB-U

ScanNet-U

COCO-U

ScanNet_U/test/SlotAtt/test_result_05.gif

Quantitative Results from Ablation

Complexity Factor Distributions

Quantitatively Segmentation Performance

Video

Short Demo (40s)

Long presentation (11min)

BibTeX

If you find this work useful for your research, please cite:

@inproceedings{yang2022,
  title={{Promising or Elusive? Unsupervised Object Segmentation from Real-world Single Images}},
  author={Yang, Yafei and Yang, Bo},
  booktitle={NeurIPS},
  year={2022},
}

HOME
ABOUT
AUCTIONS
SHIPPING
FEES
TOOLS
HOW
FAQ
CONTACT

Original Source | Taken Source