| CARVIEW |
Scaling Short-Term Memory of Visuomotor Policies for Long-Horizon Tasks
Abstract
Method Overview
PRISM applies gated attention to filter historical context and hierarchical architecture to scale attention over long interaction histories, improving causal transformer policies trained with behavior cloning to handle noisy histories and reduce computation.
ReMemBench Categories
ReMemBench is designed to evaluate short-term memory in visuomotor policies. Guided by the cognitive science literature, we decompose short-term memory into several functional categories. Diversity in categories promotes developments in general memory mechanisms and not custom, non-generalizable solutions for a particular task. Each category is instantiated with two household-manipulation tasks. Below are videos of each category.
Spatial Memory
Prospective Memory
Object-Associative Memory
Object-Set Memory
Real-World Rollouts
We evaluate PRISM on a real-world adaptation of 'Wash and Return to Container' task from ReMemBench. Below is the visualization of two successful rollouts of PRISM.