Exporters From Japan
Wholesale exporters from Japan   Company Established 1983
CARVIEW
Select Language

Experiments

AHA-13B Outperforms 6 SoTA VLMs in Failure reasoning for robotic manipulation Across 3 diverse datasets and 4 evaluation metrics.

Scaling law with AHA

AHA scales in performance with data

Example failure reasoning from VLMs

AHA generalizes failure reasoning across embodiments, unseen domains, and novel tasks.

Evaluation Datasets

Maniskill-Fail Dataset



AHA dataset (Test)

RoboFail Dataset

AHA Downstream Robotic Applications


VLM Reward Function Generation (Eureka)

<b>Task: Pick up the mustard bottle</b>, AHA Improved Reward Function:
                        [sep]
                        static/txt/reward_functions/maniskill/task1.txt <b>Task: Push T to shaped T area</b>, AHA Improved Reward Function:
                        [sep]
                        static/txt/reward_functions/maniskill/task2.txt <b>Task: Put sphere into sphere holder</b>, AHA Improved Reward Function:
                        [sep]
                        static/txt/reward_functions/maniskill/task3.txt <b>Task: Stack the red cube onto the green cube</b>, AHA Improved Reward Function:
                        [sep]
                        static/txt/reward_functions/maniskill/task4.txt <b>Task: Open the drawer</b>, AHA Improved Reward Function:
                        [sep]
                        static/txt/reward_functions/maniskill/task5.txt

VLM Task-plan Generation (PRoC3S)

<b>Task: Stack banana onto spam can</b>, AHA Improved Task-plan:
                        [sep]
                        static/txt/reward_functions/tamp/t1.txt <b>Task: Stack two cubes inside a bowl</b>, AHA Improved Task-plan:
                        [sep]
                        static/txt/reward_functions/tamp/t2.txt <b>Task: Move the banana to the centre</b>, AHA Improved Task-plan:
                        [sep]
                        static/txt/reward_functions/tamp/t3.txt

VLM Sub-task Verification (Manipulate Anything)