| CARVIEW |
Temporal Logic Imitation: Learning Plan-Satisficing Motion Policies from Demonstrations
Yanwei Wang
Nadia Figueroa
Shen Li
Ankit Shah
Julie Shah
Interactive Robotics Lab Massachusetts Institute of Technology
Abstract
Paper
Yanwei Wang, Nadia Figueroa, Shen Li, Ankit Shah, Julie Shah
arxiv / review / code / PBS News Coverage
CoRL 2022 (Oral, acceptance rate: 6.5%)
IROS 2023 Workshop (Best Student Paper, Learning Meets Model-based Methods for Manipulation and Grasping Workshop)
Teaser
Talk
|
Generically Learned Motion Policy / Motion Policy with Stability Guarantee
|
|
|
Motion Policy without Mode Invariance / with Mode Invariance
|
|
|
Iterative Boundary Estimation of Unknown Mode with Cutting Planes
|
|
|
To modulate motion policies so that they become mode-invariant,
unknown mode boundary is first estimated. Invariance failures detected by sensors
are used to find cutting planes that bound the mode within which DS flows are modulated
to stay.
Note flows that have left the mode will re-enter the mode due to LTL's reactivity,
and iteratively increasingly better boundary estimation is attained.
|
|
More Videos
|
Generalization to New Tasks by Reusing Learned Skills
|
|||
|
Line Inspection Task
|
|||
|
Color Tracing Task
|
|||
|
Scooping Task
|
|||
MIT Museum Demo
| A permanent interactive exhibition at MIT museum of programming robots via demonstrations |
Related Works - Extending TLI by Learning instead of Engineering Sensor Models
|
Yanwei Wang, Tsun-Hsuan Wang, Jiayuan Mao, Michael Hagenow, Julie Shah arxiv / code / project page ICLR 2024 (Spotlight, acceptance rate: 5%) This work learns grounding classifiers for LLM planning. By locally perturbing a few human demonstrations, we augment the dataset with more successful executions and failing counterfactuals. Our end-to-end explanation-based network is trained to differentiate successes from failures and as a by-product learns classifiers that ground continuous states into discrete manipulation mode families without dense labeling. |