I am currently working as TopMinds at Huawei (base in HK) where I lead a research group of 30+ excellent researchers focusing on Multi-modal (Includes both vision understanding and generation) and AI Agent.
Before that, I was a senior researcher at SenseTime Group where I investigated on-device multi-modal models including vision language models (VLMs) and diffusion models (DMs).