Hang Yin is currently a PhD student in the Department of Automation, Tsinghua University. His research interests include computer vision and embodied navigation.
We propose a training-free object-goal navigation framework by leveraging LLM and VFMs. We construct an online hierarchical 3D scene graph and prompt LLM to exploit structure information contained in subgraphs for zero-shot decision making.