| CARVIEW |
Zuxin Liu
Email: zuxin1997 [at] gmail [dot] com
I am a researcher at OpenAI, working on reasoning model post-training. Previously I worked for Salesforce AI Research on LLM Agent. I received my PhD and MS degrees from Carnegie Mellon University, focusing on RL and robotics. I finished my bachelor's degree with honor (President's Award) from Beihang University.
News & Updates
- [2025/05] Our APIGen-MT pipeline and xLAM-2 models have been highlighted in major tech media including VentureBeat, CIO, and ZDNET!
- [2025/04] Our new paper on the agentic data synthesis that powers our xLAM-2 models is released! Check out APIGen-MT for more details.
- [2025/04] Our xLAM-2 models just got an upgrade with multi-turn support! Our 70B model ranks #1 and 32B model ranks #2 on the BFCL function-calling leaderboard—beating GPT-4o, Gemini, Qwen & more. Even our smaller models like xLAM-8B-r lands at #4 , ahead of GPT-4o.
- [2025/01] Our SWE-agent paper: Diversity Empowers Intelligence (DEI), is accepted by ICLR 2025!
- [2024/10] Our APIGen paper is accepted by NeurIPS 2024!
- [2024/09] Check out our xLAM blog post and Technical Report Paper for insights into our Salesforce's Large Action Models.
- [2024/08] We are thrilled to announce the release of the entire xLAM family, our suite of Large Action Models! From the "tiny giant" 1B model to industrial powerhouses 8x22B model. These models have achieved impressive rankings, placing #1 and #6 on the Berkeley Function-Calling Leaderboard. Explore our Hugging Face collection for more details.
- [2024/07] We are excited to announce the release of our two function-calling models: xLAM-1b-fc-r and xLAM-7b-fc-r. These models have achieved impressive rankings, placing #3 and #25 on the Berkeley Function-Calling Leaderboard, outperforming many significantly larger models.
- [2024/06] Check our latest work APIGen, the best open-sourced models for function calling. Our dataset is currently among the Top-3 trending datasets on HuggingFace as of July 4, 2024. See also the Twitter by Salesforce CEO, VentureBeat and 新智元.
- [2024/05] Our paper for efficient and safe RL is accepted by ICML 2024!
- [2024/04] Our RL dataset and benchmark paper is accepted by DMLR Journal! Checkout the website for details!
- [2024/02] We release our multi-LLM-Agent framework AgentLite library and paper!
- [2024/01] I joined Salesforce AI Research as a Research Scientist! Looking forward to working with the amazing team members on LLM Agent!
- [2024/01] Our two papers, one about efficient foundation model adaptation, and one about offline RL, are accepted by ICLR 2024!
- [2024/01] Our paper about robustness certification is accepted by AISTATS 2024!
- [2023/09] Our two papers for safe RL, one about versatile policy learning, and one about inverse constraint learning, are accepted by NeurIPS 2023!
- [2023/06] Our comprehensive datasets, benchmarks, and algorithms for offline safe learning are released! Checkout our website for details!
- [2023/05] A fast safe reinforcement learning framework is released! Checkout our GitHub repo for details!
- [2023/04] Our two papers for safe RL, one about robustness and one about offline learning, are accepted by ICML 2023!
- [2023/01] Our paper about observational robustness in safe RL is accepted by ICLR 2023!
- [2022/12] Our paper about robustness in safe RL win the AI Risk Analysis Award at the 2022 NeurIPS ML Safety Workshop!
- [2022/09] Our work about robustness certification in visual perception is accepted by CoRL 2022.
- [2022/09] Our work about safety evaluation for self-driving vehicles is accepted by NeurIPS 2022.
- [2022/07] Our paper about robustness in safe RL win the best paper runner-up in the SL4AD Workshop at ICML 2022!
- [2022/07] I am glad to present my work about safe RL at Google DeepMind robotics team.
- [2022/05] Our paper about variational inference approach for off-policy safe RL is accepted by ICML 2022!
- [2022/05] I give a talk about recent advances in safe RL at Prof. Fei Fang's lab.
- [2022/04] Our work about safe learning for delivery robot is featured on the front page of CMU news!
- [2022/03] Our paper about LiDAR sensing in autonomous vehicle is accepted by CVPR 2022!
- [2021/11] The autonomous delivery robot that we have built for one year is featured by CMU Engineering.
- [2021/07] We win the Hackathon during my intern at Nuro! Really enjoyed to solve challenging real-world problems for self-driving.
Research Interests
My long-term ambition is to develop AI agents capable of achieving and surpassing human-level
performance in various daily tasks, ultimately freeing humans from repetitive work and enhancing
productivity. Beyond the commonly recognized capabilities like reasoning and planning, I believe
that continual self-evolution (in terms of training) and
self-reflection (during deployment) are also essential traits of truly intelligent
agents. This aligns with the core principles of reinforcement learning (RL), which
I view as a guiding philosophical framework for thinking and studying AI agents.
My research aims to apply the foundational principles—not just the methods—of RL to large language
model (LLM)-based agents, contributing to the promising future of AI Agent systems that that
evolve, learn, and interact in ways that
complement and enhance human capabilities. Currently, I am developing scalable approaches, such as
utilizing synthetic data, to improve models' agentic abilities, and leveraging environmental
feedback to better enable self-learning and reflection, as exemplified like the Software Engineering
Agent.
Publications ( show selected / show all by date / show all by topic )
(* indicates equal contribution.)Topics:
LLM Agent /
Foundation Model /
Reasoning /
RL Algorithms
Past topics:
Embodied AI & Robotics /
Computer Vision & Autonomous Vehicles /
Fun Undergrads Projects
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay
Akshara Prabhakar*, Zuxin Liu*, Ming Zhu, Jianguo Zhang, Tulika Awalgaonkar, Shiyu Wang, Zhiwei Liu, Haolin Chen, Thai Hoang, Juan Carlos Niebles, Shelby Heinecke, Weiran Yao, Huan Wang, Silvio Savarese, Caiming Xiong
NeurIPS 2025 Dataset and Benchmark Track |
Paper |
Website |
Dataset |
Models
Media coverage:
Official
Blog |
VentureBeat
|
CIO
|
ZDNET
|
Yahoo
Finance |
Marktechpost
APIGen: Automated PIpeline for Generating Verifiable and Diverse Function-calling Datasets
Zuxin Liu, Thai Hoang, Jianguo Zhang, Ming Zhu, Tian Lan, Shirley Kokane, Juntao Tan, Weiran Yao, Zhiwei Liu, Yihao Feng, Rithesh Murthy, Liangwei Yang, Silvio Savarese, Juan Carlos Niebles, Huan Wang, Shelby Heinecke, Caiming Xiong
NeurIPS 2024 Dataset and Benchmark Track |
Paper |
Website |
Dataset
|
Model
Media coverage:
VentureBeat
|
The
Stack |
InfoWorld
LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Jielin Qiu, Zuxin Liu, Zhiwei Liu, Rithesh Murthy, Jianguo Zhang, Haolin Chen, Shiyu Wang, Ming Zhu, Liangwei Yang, Juntao Tan, Roshan Ram, Akshara Prabhakar, Tulika Awalgaonkar, Zixiang Chen, Zhepeng Cen, Cheng Qian, Shelby Heinecke, Weiran Yao, Silvio Savarese, Caiming Xiong, Huan Wang
xLAM: A Family of Large Action Models to Empower AI Agent Systems
Jianguo Zhang*, Tian Lan*, Ming Zhu*, Zuxin Liu*, Thai Hoang*, Shirley Kokane, Weiran Yao, Juntao Tan, Akshara Prabhakar, Haolin Chen, Zhiwei Liu, Yihao Feng, et. al.
NAACL
2025 (Oral and Spotlight) |
Paper |
Website
|
Model
Media coverage:
Official
Blog |
VentureBeat
|
InfoWorld
|
Investing.com
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
Kexun Zhang, Weiran Yao, Zuxin Liu, Yihao Feng, Zhiwei Liu, Rithesh Murthy, Tian Lan, Lei Li, Renze Lou, Jiacheng Xu, Bo Pang, Yingbo Zhou, Shelby Heinecke, Silvio Savarese, Huan Wang, Caiming Xiong
PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Zhiwei Liu, Weiran Yao, Jianguo Zhang, Rithesh Murthy, Liangwei Yang, Zuxin Liu, Tian Lan, Ming Zhu, Juntao Tan, Shirley Kokane, Thai Hoang, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Jianguo Zhang, Tian Lan, Rithesh Murthy, Zhiwei Liu, Weiran Yao, Juntao Tan, Thai Hoang, Liangwei Yang, Yihao Feng, Zuxin Liu, Tulika Awalgaonkar, Juan Carlos Niebles, Silvio Savarese, Shelby Heinecke, Huan Wang, Caiming Xiong
Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving
Haohong Lin, Wenhao Ding, Zuxin Liu, Yaru Niu, Jiacheng Zhu, Yuming Niu, Ding Zhao
Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning
Yihang Yao*, Zuxin Liu*, Zhepeng Cen, Jiacheng Zhu, Wenhao Yu, Tingnan Zhang, Ding Zhao.
Learning Shared Safety Constraints from Multi-task Demonstrations
Konwoo Kim, Gokul Swamy, Zuxin Liu, Ding Zhao, Sanjiban Choudhury, Zhiwei Steven Wu.
Pixel-wise Smoothing for Certified Robustness against Camera Motion Perturbations
Hanjiang Hu, Zuxin Liu, Linyi Li, Jiacheng Zhu, Ding Zhao
AISTATS 2023 | Paper | Code
On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Zuxin Liu, Zijian Guo, Zhepeng Cen, Huan Zhang, Jie Tan, Bo Li, Ding Zhao.
ICLR 2023 |
Paper
2022 ICML SL4AD Workshop (Best Paper Runner-up)
2022 NeurIPS ML Safety Workshop (AI Risk Analysis
Award)
Paper |
Website |
Code
Learning to Explore (L2E): Deep Reinforcement Learning-based Autonomous Exploration for Household Robot
Zuxin Liu, Mohit Deshpande, Xuewei Qi, Ding Zhao, Rajasimman Madhivanan, Arnie Sen.
SafeBench: A Benchmarking Platform for Safety Evaluation of Autonomous Vehicles
Chejian Xu, Wenhao Ding, Weijie Lyu, Zuxin Liu, Shuai Wang, Yihan He, Hanjiang Hu, Ding Zhao, Bo Li.
NeurIPS 2022 | Paper | Website
Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes
Mengdi Xu, Wenhao Ding, Jiacheng Zhu, Zuxin Liu, Baiming Chen, Ding Zhao.
NeurIPS 2020 | Paper | Code
Robot's Eyes and Brain: Visual Semantic SLAM System, 2018
Our visual semantic SLAM system empowers robots to construct detailed semantic maps of their surroundings, identifying and remembering the locations of objects. With voice command capabilities, the robot can efficiently locate and autonomously navigate to specific items. This project secured the top prize at the 2018 ICOPEN 3D Sensor Application Design Competition, standing out among 20 teams worldwide. | Video
VR Multicopter System, 2017
Ever wanted to experience the sensation of flying? Our VR Multicopter system offers an immersive flying experience. By using a VR device, users can control the orientation of a gimbal mounted on our multicopter. The stereo camera on the gimbal streams real-time video back to the VR equipment, allowing you to simply move your head and enjoy breathtaking aerial views. This project earned the 1st prize at the 2017 International Design and Innovation Competition among 14 global teams. | Video
Autonomous Navigation Robot, 2017
I led a team to build a mobile robot platform which could achieve autonomous
navigation and obstacle avoidance based on RTAB-Map SLAM and ROS Navigation
Stack. Just with one-click, the robot can autonomously navigate to wherever you
want.
Video
Automatic AI Robot System, 2017
This project is designed for ICRA DJI Robotmaster AI challenge. The robots are required to autonomously find enemy robots and hit them (shoot rubber ball). More exciting information and videos about this robot platform and the relevant robot competition can be found here.

