Sotabase
Home
Researchers
Career
·
Cofounder and Chief Scientist
,
Thinking Machines
2025–
·
Research Scientist
,
Anthropic
2024–
·
Co-founder and Lead of Reinforcement Learning Team
,
OpenAI
2015–
Publications
(186)
GPT-4 Technical Report
2023
21,596
cited
Trust Region Policy Optimization
International Conference on Machine Learning · 2015
7,583
cited
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
Neural Information Processing Systems · 2016
4,432
cited
High-Dimensional Continuous Control Using Generalized Advantage Estimation
International Conference on Learning Representations · 2015
4,104
cited
GPT-4o System Card
arXiv.org · 2024
2,980
cited
Concrete Problems in AI Safety
arXiv.org · 2016
2,821
cited
On First-Order Meta-Learning Algorithms
arXiv.org · 2018
2,476
cited
Let's Verify Step by Step
International Conference on Learning Representations · 2023
2,424
cited
Theano: A Python framework for fast computation of mathematical expressions
arXiv.org · 2016
2,366
cited
Benchmarking Deep Reinforcement Learning for Continuous Control
International Conference on Machine Learning · 2016
1,779
cited
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Robotics: Science and Systems · 2017
1,276
cited
RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
arXiv.org · 2016
1,109
cited
Motion planning with sequential convex optimization and convex collision checking
Int. J. Robotics Res. · 2014
911
cited
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Neural Information Processing Systems · 2016
828
cited
VIME: Variational Information Maximizing Exploration
Neural Information Processing Systems · 2016
814
cited
Transparent Water-in-Oil Dispersions: the Oleopathic Hydro-Micelle
Nature · 1943
779
cited
Variational Lossy Autoencoder
International Conference on Learning Representations · 2016
695
cited
Mechanism of Formation and Structure of Micro Emulsions by Electron Microscopy
1959
601
cited
Finding Locally Optimal, Collision-Free Trajectories with Sequential Convex Optimization
Robotics: Science and Systems · 2013
531
cited
Gradient Estimation Using Stochastic Computation Graphs
Neural Information Processing Systems · 2015
403
cited
Show all 186 papers →
Sotabase
John Schulman | Researcher Profile | Sotabase | Sotabase