Biqing Qi is currently a XingQi Researcher at the Shanghai AI Lab and the Postdoctoral Fellow at the University of Hong Kong, collaborating with Professor Yi Ma. He received his Ph.D. from the Key Laboratory of Autonomous Intelligent Unmanned Systems (AIUS) at Harbin Institute of Technology, under the joint supervision of the Center for Collaborative & Conversational Intelligence (C3I) at Tsinghua University, guided by Professors Bowen Zhou and Ligang Wu. He serves as a Member of the Technology Center at the Chinese Academy of Engineering (CAE), a Committee Member for the National Supply Chain AI Application Platform, and a Member of the Embodied Intelligence Committee of the Chinese Information Processing Society. His research focuses on machine learning and natural language processing. He has published over 60 papers in top-tier conferences and journals, including NeurIPS, CVPR, ACL, and TPAMI, of which more than 20 are first-author or corresponding-author. Key contributions include: 1) Co-proposing the “General-Specialized Integration Intelligence” pathway towards AGI with Prof. Bowen Zhou’s team, developing dynamic architectures such as Nirvana, and advancing hardware-software co-design and early stroke screening—work covered by People’s Daily; 2) Proposing an interactive continual learning framework and the SDAR model, establishing a leading open-source implementation; 3) Pioneering the early validation of LLM-driven autonomous hypothesis generation, building the MARTI multi-agent training-inference system, and achieving significant breakthroughs in computational chemistry. His work has garnered significant media attention and has been implemented in leading technology companies such as Tencent, ByteDance, and Xianyuan. He has led two national major projects, one Shanghai municipal major project and one project under the National Natural Science Foundation of China.
齐弼卿,上海人工智能实验室星启研究员,港大博士后(合作导师马毅教授),哈工大、清华联培博士,博士生导师周伯文教授与吴立刚教授。现任中国工程院技术专班成员、国家供应链AI平台建设咨询委员会委员、中文信息学会具身智能专委会委员。长期致力于机器学习与自然语言处理前沿研究,在NeurIPS、CVPR、ACL、TPAMI等高水平会议期刊发表论文60余篇,其中第一/通讯作者论文20余篇。主要贡献包括:1)与周伯文教授团队共同提出“通专融合智能”AGI路径,并研发Nirvana等动态通专架构,推动软硬件协同设计与脑卒中早期筛查,成果获《人民日报》报道;2)提出交互式持续学习框架驱动SDAR模型构建,建立领先开源实现,月下载超3万次;3)早期验证大模型驱动独立假设提出研究范式,构建MARTI多智能体训推一体系统,在计算化学领域取得Nature级别科学新发现。相关研究成果在腾讯、阿里、字节及天坛医院等落地闭环。主持4项国家重大课题(亿级)、上海市重大课题(亿级)及国家自然基金项目。
If you are seeking any form of academic collaborations with Shanghai AI Lab or AIUS, SCIR Lab at HIT and Tsinghua C3I Lab, please feel free to email me at qibiqing7@gmail.com or qibiqing@pjlab.org.cn
🔥 News
- 2025.11: 🎉 Two papers are accepted by AAAI 2026
- 2025.11: 🔥”Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism” released on Project Page Paper Link
- 2025.09: 🔥”ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data” released on Project Page Paper Link
- 2025.09: 🎉 Three papers are accepted by NeurIPS 2025
- 2025.09: 🔥”A Survey of Reinforcement Learning for Large Reasoning Models” released on Paper Link
- 2025.08: 🔥”InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency” released on Project Page Paper Link
- 2025.08: 🎉 One paper is accepted by EMNLP 2025
- 2025.08: 🔥”SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)” released on Project Page Paper Link
- 2025.07: 🎉 One paper is accepted by ACM MM 2025
- 2025.06: 🔥”MARTI: A Framework for LLM-based Multi-Agent Reinforced Training and Inference” released on Project Page
- 2025.06: 🔥”Scienceboard: Evaluating multimodal autonomous agents in realistic scientific workflows” released on Project Page
- 2025.05: 🎉 three papers are accepted by ACL 2025 (One oral and be invited to pannel discussion, 0.8%)
- 2025.04: 🎉 One paper is accepted by ICML 2025
- 2025.02: 🎉 One paper is accepted by CVPR 2025 (Highlight, Top 2.5%)
- 2025.02: 🔥”Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling” released on Project Page
- 2025.01: 🎉 Two papers are accepted by ICLR 2025 and TCSVT 2025
- 2024.12: 🎉 Two papers are accepted by AAAI 2025 (One Oral)
- 2024.10: 🎉 Four papers are accepted by NeurIPS 2024(One Dataset Track)
- 2024.09: 🎉 Two papers are accepted by EMNLP 2024 (One Findings)
- 2024.07: 🎉 Two papers are accepted by COLM 2024 and ACM MM 2024
- 2024.05: 🎉 Two papers are accepted by ACL 2024 (One Findings)
- 2024.02: 🎉 Two papers are accepted by CVPR 2024 and SPL 2024
- 2023.10: 🎉 Two papers are accepted by NAACL 2024 (Oral)
- 2023.08: 🎉 Two papers are accepted by NeurIPS 2023 and TNNLS 2023
📝 Publications
- Notes:(*)indicates the equal contributions and(†)indicates the corresponding author.
🎙 Multimodal Foundation Models

Arxiv Position Paper Towards Building Specialized Generalist AI with System 1 and System 2 Fusion, Kaiyan Zhang*, Biqing Qi*, Bowen Zhou.

Arxiv Survey Paper A Survey of Reinforcement Learning for Large Reasoning Models, Kaiyan Zhang, …, Zhiyuan Ma, Ganqu Cui, Zhiyuan Liu, Biqing Qi†, Ning Ding, Bowen Zhou.

Technical Report Multimodal Large Language Models InternVL3. 5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency, Weiyun Wang, …, Biqing Qi, Jiaye Ge, Qipeng Guo, Wenwei Zhang, Wanli Ouyang, Limin Wang, Min Dou, Xizhou Zhu, Tong Lu, Dahua Lin, Jifeng Dai, Bowen Zhou, Weijie Su, Kai Chen, Yu Qiao, Wenhai Wang, Gen Luo.

Technical Report Hybrid Diffusion Language Models SDAR: A Synergistic Diffusion-AutoRegression Paradigm for Scalable Sequence Generation, Shuang Cheng, Yihan Bian, Dawei Liu, Yuhua Jiang, Yihao Liu, Linfeng Zhang, Wenhai Wang, Qipeng Guo, Kai Chen, Biqing Qi†, Bowen Zhou
- Low-Cost AR-to-BlockDiffusion
- 2-4× Faster Inference
- Advanced performance on science reasoning bechmarks (e.g., GPQA and ChemBench)

Arxiv Hybrid Model Architecture Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism, Yuhua Jiang, Shuang Cheng, Yihao Liu, Ermo Hua, Che Jiang, Weigao Sun, Yu Cheng, Feifei Gao, Biqing Qi†, Bowen Zhou

CVPR 2024 Continual Learning Cognition-Inspired Interactive continual learning: Fast and slow thinking, Biqing Qi, Xinquan Chen, Junqi Gao, Dong Li, Jianxing Liu, Ligang Wu, Bowen Zhou,
- This work was the first to propose the concept of interactive continual learning.
- Instantiated through the Cognitive Complementarity Theory (System1 and System2).
- An advanced continual learning framework with the novel structured key-value pairs memory unit.
- A potential framework to develop Specialized Generalist AI.

ACL 2025 Alignment (Oral) Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process, Eermo Hua, Biqing Qi†, Kaiyan Zhang, Yue Yu, Ning Ding, Xintai Lv, Kai Tian, Bowen Zhou.

NeurIPS 2025 Reasoning Reinforcement Learning TTRL: Test-time reinforcement learning, Yuxin Zuo, Kaiyan Zhang, Shang Qu, Li Sheng, Xuekai Zhu, Biqing Qi, Youbang Sun, Ganqu Cui, Ning Ding, Bowen Zhou.

TCSVT 2025 Continual Learning Contrastive Augmented Graph2Graph Memory Interaction for Few Shot Continual Learning, Biqing Qi, Junqi Gao, Xingquan Chen, Dong Li, Jianxing Liu, Ligang Wu, Bowen Zhou.

ICML 2025 Position Embedding Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization, Ermo Hua, Che Jiang, Xingtai Lv, Kaiyan Zhang, Ning Ding, Youbang Sun, Biqing Qi†, Yuchen Fan, Xue Kai Zhu, Bowen Zhou.
ArxivDiffusion Language ModelsMask Tokens as Prophet: Fine-Grained Cache Eviction for Efficient dLLM Inference, Jianuo Huang, Yaojie Zhang, Yicun Yang, Benhao Huang, Biqing Qi, Dongrui Liu, Linfeng ZhangArxivDiffusion Language ModelsSelf Speculative Decoding for Diffusion Large Language Models, Yifeng Gao, Ziang Ji, Yuxuan Wang, Biqing Qi, Hanlin Xu, Linfeng ZhangArxivDiffusion Language ModelsSequential Diffusion Language Models, Yangzhou Liu, Yue Cao, Hao Li, Gen Luo, Zhe Chen, Weiyun Wang, Xiaobo Liang, Biqing Qi, Lijun Wu, Changyao Tian, Yanting Zhang, Yuqiang Li, Tong Lu, Yu Qiao, Jifeng Dai, Wenhai Wang-
ArxivDiffusion Language ModelsThinking Inside the Mask: In-Place Prompting in Diffusion LLMs, Xiangqi Jin, Yuxuan Wang, Yifeng Gao, Zichen Wen, Biqing Qi, Dongrui Liu, Linfeng Zhang NeurIPS 2024Countinual LearningAn Efficient Memory Module for Graph Few-Shot Class-Incremental Learning, Dong Li, Aijia Zhang, Junqi Gao, Biqing Qi†.NAACL 2024ReasoningPaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning, Xuekai Zhu, Biqing Qi, Kaiyan Zhang, Xinwei Long, Zhouhan Lin, Bowen Zhou.ArxivAlignmentOnline DPO: Online Direct Preference Optimization with Fast-Slow Chasing, Biqing Qi, Pengfei Li, Fangyuan Li, Junqi Gao, Kaiyan Zhang, Bowen Zhou.ACL 2024 (Findings)Model ArchitectureSMR: State Memory Replay for Long Sequence Modeling, Biqing Qi, Junqi Gao, Kaiyan Zhang, Dong Li, Jianxing Liu, Ligang Wu, Bowen Zhou.EMNLP 2024 (Findings)Model ArchitectureOn the token distance modeling ability of higher RoPE attention dimension, Xiangyu Hong, Che Jiang, Biqing Qi†, Fandong Meng, Mo Yu, Bowen Zhou, Jie Zhou.NeurIPS 2024Model ArchitectureNeural Residual Diffusion Models for Deep Scalable Vision Generation,Zhiyuan Ma, Liangliang Zhao, Biqing Qi, Bowen Zhou.ACM MM 2025Sturctured MemoryT-GRAG: Temporal Graph Retrieval Augmented Generation, Dong Li, Yichen Niu, Ying Ai, Xiang Zou, Biqing Qi†, Jianxing Liu.AAAI 2025Optimizer(Oral) Fast and Slow Gradient Approximation for Binary Neural Network Optimization, Xinquan Chen, Junqi Gao, Biqing Qi†, Dong Li, Yiang Luo, Fangyuan Li, Pengfei Li.
🌱 Multi-Agents Systems

Technical Report Multi Agent Systems Marti: A framework for multi-agent llm systems reinforced training and inference, Kaiyan Zhang, …, Youbang Sun, Zhiyuan Ma, Ganqu Cui, Lei Bai, Ning Ding, Biqing Qi†, Bowen Zhou.

CVPR 2025 Model Merging (Highlight) Less is More: Efficient Model Merging with Binary Task Switch, Biqing Qi, Fangyuan Li, Zhen Wang, Junqi Gao, Dong Li, Peng Ye, Bowen Zhou.

NeurIPS 2025 Model Merging Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration, Junqi Gao, Zhichang Guo, Dazhi Zhang, Dong Li, Runze Liu, Pengfei Li, Kai Tian, Biqing Qi†.

Arxiv Test Time Scaling Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling, Runze Liu, Junqi Gao, Jian Zhao, Kaiyan Zhang, Xiu Li, Biqing Qi†, Wanli Ouyang and Bowen Zhou.

AAAI 2026 Test Time Scaling GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning, Jian Zhao, Runze Liu, Kaiyan Zhang, Zhimu Zhou, Junqi Gao, Dong Li, Jiafei Lyu, Zhouyi Qian, Biqing Qi†, Xiu Li, Bowen Zhou.

ACL 2025 Test Time Scaling Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning, Junqi Gao, Xiang Zou, Ying Ai, Dong Li, Yichen Niu, Biqing Qi†, Jianxing Liu.
👄 Applications

COLM 2024 Scientific Discovery Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation, Biqing Qi, Kaiyan Zhang, Kai Tian, Haoxiang Li, Zhang-Ren Chen, Sihang Zeng, Ermo Hua, Hu Jinfang, Bowen Zhou.

ACL 2025 Scientific Discovery Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System, Haoyang Su, Renqi Chen, SHIXIANG TANG, Zhenfei Yin, Xinzhe Zheng, Jinzhe Li, Biqing Qi, Qi Wu, Hui Li, Wanli Ouyang, Philip Torr, Bowen Zhou, Nanqing Dong.

EMNLP 2025 Scientific Discovery ReviewRL: Towards Automated Scientific Review with RL, Sihang Zeng, Kai Tian, Kaiyan Zhang, Yuru wang, Junqi Gao, Runze Liu, Sa Yang, Jingxuan Li, Xinwei Long, Jiaheng Ma, Biqing Qi†, Bowen Zhou.

Arxiv Gui Agents ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data, Zhaoyang Liu, JingJing Xie, Zichen Ding, Zehao Li, Bowen Yang, Zhenyu Wu, Xuehui Wang, Qiushi Sun, Shi Liu, Weiyun Wang, Shenglong Ye, Qingyun Li, Zeyue Tian, Gen Luo, Xiangyu Yue, Biqing Qi, Kai Chen, Bowen Zhou, Yu Qiao, Qifeng Chen, Wenhai Wang.

Arxiv GUI Agents Scienceboard: Evaluating multimodal autonomous agents in realistic scientific workflows Qiushi Sun, Zhoumianze Liu, Chang Ma, Zichen Ding, Fangzhi Xu, Zhangyue Yin, Haiteng Zhao, Zhenyu Wu, Kanzhi Cheng, Zhaoyang Liu, Jianing Wang, Qintong Li, Xiangru Tang, Tianbao Xie, Xiachong Feng, Xiang Li, Ben Kao, Wenhai Wang, Biqing Qi, Lingpeng Kong, Zhiyong Wu.

Arxiv GUI Agents OS-MAP: How Far Can Computer-Using Agents Go in Breadth and Depth?, Xuetian Chen, Yinghao Chen, Xinfeng Yuan, Zhuo Peng, Lu Chen, Yuekeng Li, Zhoujia Zhang, Yingqian Huang, Leyan Huang, Jiaqing Liang, Tianbao Xie, Zhiyong Wu, Qiushi Sun, Biqing Qi†, Bowen Zhou.

NeurIPS 2024 D&B Track Scientific Discovery (Spotlight) UltraMedical: Building Specialized Generalists in Biomedicine, Kaiyan Zhang, Sihang Zeng, Eermo Hua, Ning Ding, Zhang-Ren Chen, Zhiyuan Ma, Hhaoxiang Li, Ganqu Cui, Biqing Qi, Xuekai Zhu, Bowen Zhou, .

Arxiv Scientific Discovery MolSpectLLM: A Molecular Foundation Model Bridging Spectroscopy, Molecule Elucidation, and 3D Structure Generation, Shuaike Shen, Jiaqing Xie, Zhuo Yang, Antong Zhang, Shuzhou Sun, Ben Gao, Tianfan Fu,Biqing Qi†, Yuqiang Li.

Arxiv Scientific Discovery Chem3DLLM: 3D Multimodal Large Language Models for Chemistry, Lei Jiang, Shuzhou Sun, Biqing Qi, Yuchen Fu, Xiaohua Xu, Yuqiang Li, Dongzhan Zhou, Tianfan Fu.

Arxiv Scientific Discovery SpectrumWorld: Artificial Intelligence Foundation for Spectroscopy, Zhuo Yang, Jiaqing Xie, Shuaike Shen, Daolang Wang, Yeyun Chen, Ben Gao, Shuzhou Sun, Biqing Qi, Dongzhan Zhou, Lei Bai, Linjiang Chen, Shufei Zhang, Jun Jiang, Tianfan Fu, Yuqiang Li.

Arxiv Embodied Agents CliMRS: Cooperative Large-Language-Model Drriven Hyterogeneous Multi-robot Systems, Siqi Song, Xuanbing Xie, Zonglin Li, Yuqiang Li, Shijie Wang, Biqing Qi†.

EMNLP 2024 Embodied Agents MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making, Dayuan Fu*, Biqing Qi†, Yihuai Gao, Che Jiang, Guanting Dong, Bowen Zhou.
🌃 Teams
Team members
- Shijie Wang, Ph.D., Reseacher, Institute of Automation.
Interns
Foundation Models
- Haixv Song, 2025.12-, 4th-yr Ph.D. candidate, Tsinghua University.
- Ermo Hua, 2025,07-, 3th-yr Ph.D. candidate, Tsinghua University.
- Yuhua Jiang, 2025.02-, 2nd-yr Ph.D. candidate, Tsinghua Univeristy.
- Yicheng Gu, 2025.06-, 1st-yr Ph.D. candidate, Tsinghua University.(Joint Supervison)
- Shuang Cheng, 2024.11-, 1st-yr Ph.D. candidate, Zhejiang University.(Joint Supervison)
- Dawei Liu, 2024.11-, 1st-yr Ph.D. candidate, Shanghai Jiao Tong University.(Joint Supervison)
- Haozhen Hou, 2025.12-, 1st-yr Ph.D. candidate, Harbin Institute of Technology.(Joint Supervison)
Multi-Agents Systems
- Yikun Fu, 2025.09-, 1st-yr Ph.D. candidate, Shanghai Jiao Tong Univeristy.(Joint Supervison)
- Xiaowei Sun, 2025.9-, 1st-yr Ph.D. candidate, Fudan University.
Visiting Students
- Junqi Gao, 3th-yr Ph.D. candidate, Harbin Institute of Technology.
- Dong Li, 3th-yr Ph.D. candidate, Harbin Institute of Technology.
- Jian Zhao, 1st-yr Ph.D. candidate, Tsinghua University, IIIS.
Alumni Interns and Visiting Students
- Yihao Liu, Cheng Yang, Yihan Di, Yanlin Pan, Tianhe Lin, Yizhuo Di, Xuetian Chen, Xingfeng Yuan, Yinghao Cheng, Linan Chang, Runze Liu, Xunzhe Zhou, Jing Xiao, Yu Zhang, Yongjia Yu, Qianru Lin, Yifan Hu, Gunbing Zhang.
⚔ Projects
Commodity Price Risk Prediction and Demonstration Application Sep.2023-Sep.2026
- (Key Participants) National Science and Technology Major Project:
- Responsible for the technical planning of Project 2 and leading the team in advancing the construction of the labeling system within LLMs.
Research on Theory and Applications of Human-AI Collaboration with LLMs Jan.2024-Jan.2027
- (Key Participants) National Science and Technology Major Project:
- Responsible for designing the project architecture, planning technical aspects, and overseeing the development of human-machine collaborative systems, along with conducting applied research in knowledge discovery for Project 3.
Cognitive Load Optimization in Human-Machine Collaboration Mar.2023-Dec.2026
- (Participated) Key Research Program of the Ministry of Science and Technology in 2030:
- Responsible for project management within Tsinghua Group, as well as interaction modeling and reflective framework optimization in LLMs.
Research for Product Insight, Design, Development to Marketing Innovation Sep.2023-Dec.2025
- Participated)Beijing Municipal Science and Technology Commission Key Project.
- Responsible for project architecture, planing technical aspects.
Proteomics Data based Knowledge Discovery Mar.2022-Dec.2023
- (Student Lead) Preliminary Research Project for Major Scientific Plan.
- Responsible for project architecture, planning technical aspects, and guiding the design of human-AI systems with respect to hypothesis proposers.
Demonstration of Personified Human-Machine Dialogue System Mar.2020-Dec.2023
- (Participated) Key Research Program of the Ministry of Science and Technology in 2030:
- Responsible for the development of a robust dialogue intent detection method.