JS
#26ResearcherAI Safety & Alignment
John Schulman
Anthropic (formerly OpenAI)
Biography
Co-founder of OpenAI. Creator of PPO algorithm. Joined Anthropic in 2024. Pioneer of deep reinforcement learning.
Key Influence
Invented PPO — the RL algorithm behind RLHF training of ChatGPT. Co-founded OpenAI.