| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
date: Sun, 28 Dec 2025 19:23:51 GMT
content-type: text/html; charset=utf-8
last-modified: Thu, 20 Feb 2025 04:01:20 GMT
vary: Accept-Encoding
access-control-allow-origin: *
etag: W/"67b6a910-225e"
expires: Sun, 28 Dec 2025 19:33:51 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: D1BA:3B65E7:3905873:3D89A92:695183C6
Yan (Rocky) Duan
Yan (Rocky) Duan
I'm building intelligent softwares and algorithms for robots at Amazon, Frontier AI & Robotics (FAR) lab. Prior to that I was one of the Co-founders and CTO at covariant.ai.
I obtained my PhD degree from UC Berkeley, advised by Pieter Abbeel. My thesis is Meta Learning for Control.
Previously, I also worked at OpenAI as a research scientist.
I'm interested in reinforcement learning, robotics, unsupervised learning, and meta learning. Also check out my Google Scholar page.
I co-organized the NIPS 2017 Deep RL Symposium and the first Deep RL Bootcamp (now Full Stack Deep Learning Bootcamp).
I served as GSI of the very first offering of the Deep RL Course at UC Berkeley (now a regular offering).
You can reach me at rockyduan94@gmail.com.
Software
- rllab: a framework for developing and evaluating reinforcement learning algorithms. The proposed benchmark problems were later incorporated into OpenAI Gym. Created in 2015, the project remains popular today, and has 2000+ stars and 600+ forks on github.
Preprints
-
Adversarial Attacks on Neural Network PoliciesSandy Huang, Nicolas Papernot, Ian Goodfellow, Yan Duan, Pieter Abbeel
-
RL2: Fast Reinforcement Learning via Slow Reinforcement LearningYan Duan, John Schulman, Xi Chen, Peter L Bartlett, Ilya Sutskever, Pieter Abbeel
Publications
-
Model-Ensemble Trust-Region Policy OptimizationThanard Kurutach, Ignasi Clavera, Yan Duan, Aviv Tamar, Pieter AbbeelInternational Conference on Learning Representations (ICLR), 2018
-
Variance Reduction for Policy Gradient with Action-Dependent Factorized BaselinesOral (Top 2%)Cathy Wu, Aravind Rajeswaran, Yan Duan, Vikash Kumar, Alexandre M Bayen, Sham Kakade, Igor Mordatch, Pieter AbbeelInternational Conference on Learning Representations (ICLR), 2018
-
One-Shot Imitation LearningYan Duan, Marcin Andrychowicz, Bradly Stadie, Jonathan Ho, Jonas Schneider, Ilya Sutskever, Pieter Abbeel, Wojciech ZarembaNeural Information Processing Systems (NIPS), 2017
-
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement LearningHaoran Tang, Rein Houthooft, Davis Foote, Adam Stooke, Xi Chen, Yan Duan, John Schulman, Filip De Turck, Pieter AbbeelNeural Information Processing Systems (NIPS), 2017
-
Variational Lossy AutoencoderXi Chen, Diederik P Kingma, Tim Salimans, Yan Duan, Prafulla Dhariwal, John Schulman, Ilya Sutskever, Pieter AbbeelInternational Conference on Learning Representations (ICLR), 2017
-
Stochastic Neural Networks for Hierarchical Reinforcement LearningCarlos Florensa, Yan Duan, Pieter AbbeelInternational Conference on Learning Representations (ICLR), 2017
-
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial NetsXi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter AbbeelNeural Information Processing Systems (NIPS), 2016
-
VIME: Variational Information Maximizing ExplorationRein Houthooft, Xi Chen, Yan Duan, John Schulman, Filip De Turck, Pieter AbbeelNeural Information Processing Systems (NIPS), 2016
-
Benchmarking Deep Reinforcement Learning for Continuous ControlYan Duan, Xi Chen, Rein Houthooft, John Schulman, Pieter AbbeelInternational Conference on Machine Learning (ICML), 2016
-
Deep Spatial Autoencoders for Visuomotor LearningChelsea Finn, Xin Yu Tan, Yan Duan, Trevor Darrell, Sergey Levine, Pieter AbbeelInternational Conference on Robotics and Automation (ICRA), 2016
-
Motion Planning with Sequential Convex Optimization and Convex Collision CheckingJohn Schulman, Yan Duan, Jonathan Ho, Alex Lee, Ibrahim Awwal, Henry Bradlow, Jia Pan, Sachin Patil, Ken Goldberg, Pieter AbbeelInternational Journal of Robotics Research (IJRR), Vol. 33, No. 9, pp. 1251-1270, Aug. 2014
-
Gaussian Belief Space Planning with Discontinuities in Sensing DomainsSachin Patil, Yan Duan, John Schulman, Ken Goldberg, Pieter AbbeelInternational Conference on Robotics and Automation (ICRA), 2014
-
Planning Locally Optimal, Curvature-Constrained Trajectories in 3D using Sequential Convex OptimizationYan Duan, Sachin Patil, John Schulman, Ken Goldberg, Pieter AbbeelInternational Conference on Robotics and Automation (ICRA), 2014
-
Sigma Hulls for Gaussian Belief Space Planning for Imprecise Articulated Robots amid ObstaclesAlex Lee, Yan Duan, Sachin Patil, John Schulman, Zoe McCarthy, Jur van den Berg, Ken Goldberg, Pieter AbbeelInternational Conference on Intelligent Robots and Systems (IROS), 2013