CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 server: GitHub.com content-type: text/html; charset=utf-8 last-modified: Thu, 14 Nov 2024 18:06:16 GMT access-control-allow-origin: * strict-transport-security: max-age=31556952 etag: W/"67363c18-56ec" expires: Mon, 29 Dec 2025 14:17:28 GMT cache-control: max-age=600 content-encoding: gzip x-proxy-cache: MISS x-github-request-id: 78B4:21D6A4:900C7E:A17992:69528B1F accept-ranges: bytes age: 0 date: Mon, 29 Dec 2025 14:07:28 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210057-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1767017248.111108,VS0,VE212 vary: Accept-Encoding x-fastly-request-id: 2b3ad8c5d5191bd8efa73e4e2ff43e27726a8fa4 content-length: 3830 Learning Multi-Agent Collaborative Manipulation for Long-Horizon Quadrupedal Pushing

Learning Multi-Agent Loco-Manipulation for Long-Horizon Quadrupedal Pushing

Chuye Hong^*¹, Yuming Feng^*¹, Yaru Niu^*¹, Shiqi Liu¹, Yuxiang Yang², Wenhao Yu², Tingnan Zhang², Jie Tan², Ding Zhao¹

¹Carnegie Mellon University ²Google DeepMind ^*Equal contributions

Paper arXiv Code

Our method coordinates multiple quadrupeds to push a large object to its target location within environments with obstacles.

Abstract

Recently, quadrupedal locomotion has achieved significant success, but their manipulation capabilities, particularly in handling large objects, remain limited, restricting their usefulness in demanding real-world applications such as search and rescue, construction, industrial automation, and room organization. This paper tackles the task of obstacle-aware, long-horizon pushing by multiple quadrupedal robots. We propose a hierarchical multi-agent reinforcement learning framework with three levels of control. The high-level controller integrates an RRT planner and a centralized adaptive policy to generate subgoals, while the mid-level controller uses a decentralized goal-conditioned policy to guide the robots toward these sub-goals. A pre-trained low-level locomotion policy executes the movement commands. We evaluate our method against several baselines in simulation, demonstrating significant improvements over baseline approaches, with 36.0% higher success rates and 24.5% reduction in completion time than the best baseline. Our framework successfully enables long-horizon, obstacle-aware manipulation tasks like Push-Cuboid and Push-T on Go1 robots in the real world.

Methodology

To enable quadrupedal robots to collaboratively perform long-horizon pushing tasks in environments with obstacles, we propose a hierarchical reinforcement learning framework composed of three layers of controllers.