Wei Zhan is Co-Director of Berkeley DeepDrive, a leading research center on automotive AI involving many Berkeley faculty and industrial partners. He is also Co-Director of BAIR Center for Humanoid Intelligence, a new center combining leading humanoid robotics research labs towards robotic generalists. He is an Assistant Professional Researcher at UC Berkeley leading a team of Ph.D. students and Postdocs conducting research. He also teaches AI for Autonomy at UC Berkeley.

He is also Chief Scientist of Applied Intuition, a vehicle intelligence company delivering autonomy, toolchain and OS to various industrial verticals including passenger/trucking automotive and mining. He leads the research efforts towards next-generation autonomy and toolchain with cutting-edge AI.

His research is focused on AI for scalable autonomous systems leveraging robotics, computer vision, machine learning and control techniques to tackle challenges with complex scenes, dynamics and human behavior with applications to autonomous driving and general robotics. He received his Ph.D. degree from UC Berkeley. Four of his publications were awarded in flagship conferences and journals.

Hiring!

Wei Zhan is actively hiring Research Scientists, Research Engineers and Research Interns in Applied Intuition. Apply to the roles in “AI Research” section if you are interested in conducting research on AI for autonomous systems, robotics and simulation, or supporting research on AI infrastructure and software/hardware.

Selected Awards

Selected Research

Reinforcement Learning, Control, Autonomous Racing

Generative Model, 3D Reconstruction, Neural Simulation

  • X-Drive – Cross-modality Consistent Data Generation with Diffusion: ICLR’25
  • DeSiRe-GS – 4D Gaussians for Decomposition and Mesh: CVPR’25
  • CompGS – Compositional Text-to-3D Gaussians: CVPR’25
  • Q-SLAM – quadric representations for monocular SLAM: CoRL’24
  • S3 Gaussian – Self-Supervised Street Gaussian: arxiv, Code

Manipulation, Diffusion Policy, Robot Learning from Human

  • Generalizable representation learning human demonstrations: RSS’24, Website
  • Open X-Embodiment – Robotic Learning Datasets and RT-X Models: ICRA’24 (Best Paper Award), Blog, Dataset, Website, Code
  • Sparse Diffusion Policy – Flexible Policy with Mixture of Experts (MoE): CoRL’24
  • DexHandDiff – Interaction-aware Diffusion for Adaptive Manipulation: CVPR’25
  • PhyGrasp – grasping with physics-informed large models: IROS’25, Website

3D Perception, Fusion, Data Engine

Behavior Generation, Language Reasoning, Diagnosis

  • WOMD-Reasoning – language Dataset for interaction reasoning: ICML’25, Website
  • LANGTRAJ: language-conditioned generation model and dataset: ICCV’25
  • Efficient Diffusion Models for Prediction and Controllable Generation: ECCV’24
  • Code diagnosis and repair of motion planners by LLM: RA-Letters’24
  • Guided diffusion for traffic simulation with controllable criticality: ECCV’24

Prediction, INTERACTION Dataset and Benchmark

Planning, Behavior Design, Inverse Reinforcement Learning