| CARVIEW |
Select Language
HTTP/2 301
server: GitHub.com
content-type: text/html
location: https://georgegu1997.github.io/OSSID/
x-github-request-id: 9D62:2B0FD4:90F0E1:A28BBD:6952A33D
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 15:50:22 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210092-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767023422.870069,VS0,VE206
vary: Accept-Encoding
x-fastly-request-id: 2e064fb893863e7662aafed663b2e86813bfa8cd
content-length: 162
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Tue, 05 Apr 2022 21:02:55 GMT
access-control-allow-origin: *
etag: W/"624cae7f-2a8f"
expires: Mon, 29 Dec 2025 16:00:22 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: FD55:3A7A40:901A18:A1B714:6952A33D
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 15:50:22 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210092-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767023422.089637,VS0,VE220
vary: Accept-Encoding
x-fastly-request-id: 15bf0286e6043a9d1147e12eb13052b653816146
content-length: 3171
OSSID: Online Self-Supervised Instance Detection by (and for) Pose Estimation
OSSID: Online Self-Supervised Instance Detection
(and for) Pose Estimation
(and for) Pose Estimation
|
|
|
|
|
|
|
|
|
| We propose a self-supervised learning pipeline for object instance detection by pose estimation. The results of a zero-shot pose estimation network are used to finetune a zero-shot detector online. Then the detection results in turn provide object bounding boxes and reduce the search space for pose estimation. Without any manual annotation required, both the detector and the pose estimator get better and faster. |
| Real-time object pose estimation is necessary for many robot manipulation algorithms. However, state-of-the-art methods for object pose estimation are trained for a specific set of objects; these methods thus need to be retrained to estimate the pose of each new object, often requiring tens of GPU-days of training for optimal performance. In this paper, we propose the OSSID framework, leveraging a slow zero-shot pose estimator to self-supervise the training of a fast detection algorithm. This fast detector can then be used to filter the input to the pose estimator, drastically improving its inference speed. We show that this self-supervised training exceeds the performance of existing zero-shot detection methods on two widely used object pose estimation and detection datasets, without requiring any human annotations. Further, we show that the resulting method for pose estimation has a significantly faster inference speed, due to the ability to filter out large parts of the image. Thus, our method for self-supervised online learning of a detector (trained using pseudo-labels from a slow pose estimator) leads to accurate pose estimation at real-time speeds, without requiring human annotations. |
Talk
![]() |
Qiao Gu, Brian Okorn, David Held. OSSID: Online Self-Supervised Instance Detection (and for) Pose Estimation In RA-L and ICRA 2022. (hosted on ArXiv) |
Acknowledgements |
