CARVIEW

MOTORHOMES

Select Language

HTTP/2 301 server: GitHub.com content-type: text/html location: https://renjunli99.github.io/vbcom.github.io/ x-github-request-id: A1FA:3157C7:A6346F:BADFD3:69543A33 accept-ranges: bytes age: 0 date: Tue, 30 Dec 2025 20:46:44 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210071-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1767127604.367828,VS0,VE200 vary: Accept-Encoding x-fastly-request-id: 0913c6963d368d36fbfd928aed5a0fdd40b134ff content-length: 162 HTTP/2 200 server: GitHub.com content-type: text/html; charset=utf-8 last-modified: Sun, 01 Jun 2025 10:20:15 GMT access-control-allow-origin: * strict-transport-security: max-age=31556952 etag: W/"683c295f-7006" expires: Tue, 30 Dec 2025 20:56:44 GMT cache-control: max-age=600 content-encoding: gzip x-proxy-cache: MISS x-github-request-id: 4404:2916CC:A871EA:BD1E80:69543A34 accept-ranges: bytes age: 0 date: Tue, 30 Dec 2025 20:46:44 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210071-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1767127605.581540,VS0,VE206 vary: Accept-Encoding x-fastly-request-id: 2e3ab5f1d658750c5fcf87692b61c99ee3f5cf7b content-length: 5316 VB-COM: Learning Vision-Blind Composite Humanoid Locomotion Against Deficient Perception

VB-Com: Learning Vision-Blind Composite Humanoid Locomotion Against Deficient Perception

Junli Ren^1,2, Tao Huang^1,3, Huayi Wang^1,3, Zirui Wang^1,4, Qingwei Ben^1,5, Junfeng Long¹, Yanchao Yang², Jiangmiao Pang^1,+ Ping Luo^1,2,+

¹Shanghai AI Laboratory, ²The University of Hong Kong, ³Shanghai Jiao Tong University,
⁴Zhejiang University, ⁵The Chinese University of Hong Kong

Paper arXiv Video X Code (coming soon)

Overview

Abstract

The performance of legged locomotion is closely tied to the accuracy and comprehensiveness of state observations. ``Blind policies", which rely solely on proprioception, are considered highly robust due to the reliability of proprioceptive observations. However, these policies significantly limit locomotion speed and often require collisions with the terrain to adapt. In contrast, ``Vision policies" allows the robot to plan motions in advance and respond proactively to unstructured terrains with an online perception module. However, perception is often compromised by noisy real-world environments, potential sensor failures, and the limitations of current simulations in presenting dynamic or deformable terrains. Humanoid robots, with high degrees of freedom and inherently unstable morphology, are particularly susceptible to misguidance from deficient perception, which can result in falls or termination on challenging dynamic terrains. To leverage the advantages of both vision and blind policies, we propose VB-Com, a composite framework that enables humanoid robots to determine when to rely on the vision policy and when to switch to the blind policy under perceptual deficiency. We demonstrate that VB-Com effectively enables humanoid robots to traverse challenging terrains and obstacles despite perception deficiencies caused by dynamic terrains or perceptual noise.

Avoid Dynamic and Static Obstacles

Avoid low speed robots

Recover from high speed collision

Consecutive Avoid Obstacles

H1 vs Dynamic Obstacles

Traverse Hurdles

Deficient Perception

Comprehensive Perception

Sudden Falling Hurdles

Step Recovery on Gaps

G1

H1

h1

Framework

VB-Com contributes to the following aspects:

We develop a perceptive and a non-perceptive humanoid locomotion policy that can traverse gaps, hurdles and avoid obstacles.
We propose a novel hardware-deployable return estimator that predicts future returns achieved by current policy conditioned on proprioceptive states observation.
We design a dual-policy composition system that integrates perceptive and non-perceptive policies for robust locomotion through dynamic obstacles and terrains where onboard sensors providedeficient external perception.

Original Source | Taken Source