You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
π Currently Co-Founder at Synvo AI. We build the contextual intelligence engine. I graduated from MMLab@NTU in June 2025.
π Research Projects during my PhD:
Visual Generalist Models (2023-2025): Developing models that process diverse visual data (e.g., images, videos, 3D, audio, IMU) to tackle various tasks in perception, reasoning, generation, robotics, and gaming. Notable projects include EgoLife, Octopus, FunQA, and Otter.
AI Safety for Foundation Models (2023-2024): Investigating how to mitigate hallucinations in large language models (LLMs) and multimodal models (LMMs). A key contribution is the introduction of UPD to withhold answers when faced with unsolvable questions.
PSG Series (2022-2023): Led the development of the PSG, PVSG, and PSG4D models, focusing on relation modeling for scene understanding. I also collaborated on works like Relate-Anything and PairNet.
OOD Detection (2021-2022): Led a comprehensive survey and developed OpenOOD, a popular codebase for Out-of-Distribution detection in AI safety.
Prompt Tuning (2022): Contributed to foundational works like CoOp and CoCoOp for prompt tuning in vision-language models.