John Schulman

Thinking Machines Lab

Specialties

Reinforcement LearningAI Safety

Location

USA

Education

PhD in Computer Science, UC Berkeley (2016).

Links & Profiles

Website Scholar

Biography

Co-founder of OpenAI and creator of PPO. Left Anthropic in early 2025 to co-found Thinking Machines Lab, where he is chief scientist. Pioneer of deep reinforcement learning.

Key Influence

Invented PPO, the RL algorithm behind RLHF training of ChatGPT. Co-founded OpenAI.

Connected Contributors

IS

Ilya Sutskever

#6 · AI Safety & Alignment

DA

Dario Amodei

#7 · AI Safety & Alignment

PA

Pieter Abbeel

#18 · Robotics & Autonomous Systems

#25

#27

Back to AI Notebook