| CARVIEW |
I am a senior staff research scientist at Google DeepMind. My recent work focuses on meta reinforcement learning, LLM post-training, and LLM agents. Before Google DeepMind, I finished my Ph.D. from the University of Michigan, where I was co-advised by Honglak Lee and Satinder Singh.
News
Nov 2025: DiscoRL is now published in Nature. The majority of the work was finished in 2022.
Invited Talks
Jul 2022: ICML 2022 Decision Awareness in Reinforcement Learning, Baltimore, MD.
Jun 2019: RE-WORK Deep Reinforcement Learning Summit, San Francisco, CA.
Jul 2018: ICML 2018 Workshop on Prediction and Generative Modeling in Reinforcement Learning, Stockholm, Sweden. [Slides]
Jan 2018: RE-WORK Deep Learning Summit, San Francisco, CA.
Nov 2017: Ann Arbor Deep Learning Event, Ann Arbor, MI.
Oct 2017: Amazon Graduate Research Symposium, Seattle, WA.
Mar 2017: RE-WORK Machine Intelligence Summit, San Francisco, CA.
Dec 2016: NIPS 2016 Workshop on Deep Reinforcement Learning, Barcelona, Spain. [Slides]
Nov 2016: Ann Arbor Deep Learning Event, Ann Arbor, MI.
Publications
Junhyuk Oh, Iurii Kemaev, Greg Farquhar, Dan A Calian, Matteo Hessel, Luisa Zintgraf, Satinder Singh, Hado van Hasselt, David Silver
Nature, 2025Dan A Calian, Gregory Farquhar, Iurii Kemaev, Luisa M. Zintgraf, Matteo Hessel, Jeremy Shar, Junhyuk Oh, András György, Tom Schaul, Jeffrey Dean, Hado van Hasselt, David Silver
Neural Information Processing Systems (NeurIPS), 2025Gemini Team
Tech Report, 2025Abbas Abdolmaleki, Bilal Piot, Bobak Shahriari, Jost Tobias Springenberg, Tim Hertweck, Rishabh Joshi, Junhyuk Oh, Michael Bloesch, Thomas Lampe, Nicolas Heess, Jonas Buchli, Martin Riedmiller
International Conference on Learning Representations (ICLR), 2024 (Spotlight)Gemini Team
Tech Report, 2024Evgenii Nikishin, Junhyuk Oh, Georg Ostrovski, Clare Lyle, Razvan Pascanu, Will Dabney, André Barreto
Neural Information Processing Systems (NeurIPS), 2023 (Spotlight)Michael Laskin, Luyu Wang, Junhyuk Oh, Emilio Parisotto, Stephen Spencer, Richie Steigerwald, DJ Strouse, Steven Hansen, Angelos Filos, Ethan Brooks, Maxime Gazeau, Himanshu Sahni, Satinder Singh, Volodymyr Mnih
International Conference on Learning Representations (ICLR), 2023 (Oral)Louis Kirsch, Sebastian Flennerhag, Hado van Hasselt, Abram Friesen, Junhyuk Oh, Yutian Chen
AAAI Conference on Artificial Intelligence (AAAI), 2022Vivek Veeriah, Tom Zahavy, Matteo Hessel, Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh
Neural Information Processing Systems (NeurIPS), 2021Dan A. Calian, Daniel J Mankowitz, Tom Zahavy, Zhongwen Xu, Junhyuk Oh, Nir Levine, Timothy Mann
International Conference on Learning Representations (ICLR), 2021Marta Garnelo, Wojciech Marian Czarnecki, Siqi Liu, Dhruva Tirumala, Junhyuk Oh, Gauthier Gidel, Hado van Hasselt, David Balduzzi
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2021Junhyuk Oh, Matteo Hessel, Wojciech M. Czarnecki, Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver
Neural Information Processing Systems (NeurIPS), 2020Paper Press: VentureBeat Press: Analytics India Press: SingularityHub
Zhongwen Xu, Hado van Hasselt, Matteo Hessel, Junhyuk Oh, Satinder Singh, David Silver
Neural Information Processing Systems (NeurIPS), 2020Tom Zahavy, Zhongwen Xu, Vivek Veeriah, Matteo Hessel, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh
Neural Information Processing Systems (NeurIPS), 2020Zeyu Zheng, Junhyuk Oh, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh
International Conference on Machine Learning (ICML), 2020Oriol Vinyals, Igor Babuschkin, Wojciech M Czarnecki, Michaël Mathieu, Andrew Dudzik, Junyoung Chung, David H Choi, Richard Powell, Timo Ewalds, Petko Georgiev, Junhyuk Oh, Dan Horgan, Manuel Kroiss, Ivo Danihelka, Aja Huang, Laurent Sifre, Trevor Cai, John P Agapiou, Max Jaderberg, Alexander S Vezhnevets, Rémi Leblond, Tobias Pohlen, Valentin Dalibard, David Budden, Yury Sulsky, James Molloy, Tom L Paine, Caglar Gulcehre, Ziyu Wang, Tobias Pfaff, Yuhuai Wu, Roman Ring, Dani Yogatama, Dario Wünsch, Katrina McKinney, Oliver Smith, Tom Schaul, Timothy Lillicrap, Koray Kavukcuoglu, Demis Hassabis, Chris Apps, David Silver
Nature, 2019Vivek Veeriah, Matteo Hessel, Zhongwen Xu, Richard Lewis, Janarthanan Rajendran, Junhyuk Oh, Hado van Hasselt, David Silver, Satinder Singh
Neural Information Processing Systems (NeurIPS), 2019Daniel J. Mankowitz, Augustin Žídek, André Barreto, Dan Horgan, Matteo Hessel, John Quan, Junhyuk Oh, Hado van Hasselt, David Silver, Tom Schaul
Reinforcement Learning and Decision Making (RLDM), 2019Jongwook Choi, Yijie Guo, Marcin Moczulski, Junhyuk Oh, Neal Wu, Mohammad Norouzi, Honglak Lee
International Conference on Learning Representations (ICLR), 2019Zeyu Zheng, Junhyuk Oh, Satinder Singh
Neural Information Processing Systems (NeurIPS), 2018Sungryull Sohn, Junhyuk Oh, Honglak Lee
Neural Information Processing Systems (NeurIPS), 2018Vivek Veeriah, Junhyuk Oh, Satinder Singh
NeurIPS Workshop on Deep Reinforcement Learning, 2018Yijie Guo, Junhyuk Oh, Satinder Singh, Honglak Lee
NeurIPS Workshop on Deep Reinforcement Learning, 2018Junhyuk Oh, Yijie Guo, Satinder Singh, Honglak Lee
International Conference on Machine Learning (ICML), 2018Junhyuk Oh, Satinder Singh, Honglak Lee
Neural Information Processing Systems (NeurIPS), 2017Junhyuk Oh, Satinder Singh, Honglak Lee, Pushmeet Kohli
International Conference on Machine Learning (ICML), 2017Junhyuk Oh, Valliappa Chockalingam, Satinder Singh, Honglak Lee
International Conference on Machine Learning (ICML), 2016Paper Code Video Press: MIT Technology Review Press: Daily Mail
Seunghoon Hong, Junhyuk Oh, Bohyung Han, Honglak Lee
Computer Vision and Pattern Recognition (CVPR), 2015 (Spotlight)Junhyuk Oh, Xiaoxiao Guo, Honglak Lee, Richard Lewis, Satinder Singh
Neural Information Processing Systems (NeurIPS), 2015 (Spotlight)