Jakub Pachocki
Jakub Pachocki is OpenAI's chief scientist, a former director of research at OpenAI, and a central technical operator behind GPT-4, OpenAI Five, large-scale reinforcement learning, deep-learning optimization, and the company's reasoning-model direction.
Snapshot
- Known for: OpenAI chief scientist, GPT-4 research leadership, OpenAI Five, large-scale reinforcement learning, reasoning-model work, and competitive programming.
- Current public role: Chief Scientist at OpenAI. OpenAI announced on May 14, 2024 that Pachocki would take the role after Ilya Sutskever left the company.
- Institutional significance: Pachocki represents the less-public technical leadership layer of frontier AI: the research operators who turn scaling, optimization, evaluations, reinforcement learning, and inference-time computation into deployed model capability.
- Editorial caution: OpenAI's internal research organization, model roadmaps, unpublished systems, and exact responsibility boundaries are partly private and should be dated when cited.
Background
Pachocki's public record begins in theoretical computer science and competitive programming. OpenAI's 2024 appointment note says he holds a PhD in theoretical computer science from Carnegie Mellon University. The Competitive Programming Hall of Fame lists him among high-performing major-contest participants under the handle "meret."
This background matters because modern frontier AI increasingly rewards the same traits that competitive programming selects for: algorithmic compression, fast abstraction, systems fluency, search, optimization, and precise handling of edge cases. Pachocki is not primarily a public-facing executive. His public importance comes from technical leadership inside the machine room of frontier AI.
OpenAI Role
Pachocki joined OpenAI in 2017, according to OpenAI's announcement naming him chief scientist. OpenAI says he later served as Director of Research and spearheaded the development of GPT-4 and OpenAI Five, while contributing to fundamental work in large-scale reinforcement learning and deep-learning optimization.
That same OpenAI note is unusually explicit about institutional succession. Sam Altman's company message described Ilya Sutskever's departure and said Pachocki would take the chief scientist role. In OpenAI's own summary, Pachocki had already "led transformative research initiatives" for years before becoming the named successor.
The role places him near the center of OpenAI's research agenda after the company's post-2023 governance crisis, after ChatGPT became mass infrastructure, and during the move from GPT-style scaling toward reasoning models, agents, and automated research systems.
GPT-4
GPT-4 is the model family that made OpenAI's frontier status durable after ChatGPT's public breakthrough. OpenAI's GPT-4 contributions page identifies Pachocki as "Overall lead" and "optimization lead" for GPT-4. It also lists him in OpenAI Evals, capability evaluations, and blog-and-paper content contributions.
That credit pattern is important. GPT-4 was not only a bigger model; it was a deployed institution involving pretraining, optimization, post-training, safety evaluation, capability prediction, infrastructure, product integration, and public documentation. Pachocki's listed responsibilities connect the technical core of model training to the evaluation and release apparatus around it.
Reinforcement Learning
Before GPT-4, Pachocki was publicly visible in OpenAI's reinforcement-learning work. He was listed as an author on the 2018 OpenAI Five milestone, where a team of neural networks began defeating amateur human teams at Dota 2. OpenAI later wrote that OpenAI Five defeated the Dota 2 world champion Team OG and demonstrated that self-play reinforcement learning could reach superhuman performance on a difficult task.
He also appears as a coauthor on research around emergent complexity through multi-agent competition and dexterous in-hand manipulation. These projects share a theme: train systems through interaction, self-play, environment feedback, and scale rather than through static imitation alone.
That lineage helps explain why Pachocki matters in the reasoning-model era. OpenAI's later o1/o3 framing again centers reinforcement learning, test-time computation, and models learning how to search, check, and improve their answers.
Reasoning Models
OpenAI's September 2024 o1 research post described a large-scale reinforcement-learning algorithm that teaches models to use chain-of-thought-like internal reasoning productively. The company reported that o1 performance improved with more reinforcement learning during training and more time spent thinking at test time.
Pachocki is listed as a coauthor on OpenAI's 2025 paper Competitive Programming with Large Reasoning Models. The paper compares o1, an early o3 checkpoint, and a domain-specific o1-ioi system for International Olympiad in Informatics-style competitive programming. Its central finding is that scaling general-purpose reinforcement learning produced stronger results than hand-crafted domain-specific inference strategies: o3 achieved gold-level IOI performance without those specialized heuristics.
This connects Pachocki's background unusually tightly to OpenAI's research direction. Competitive programming is both his personal technical lineage and a benchmark domain where reasoning models are now visibly tested.
Governance Significance
Pachocki's public profile is quieter than Sam Altman's, Greg Brockman's, or Ilya Sutskever's, but his role is governance-relevant because chief scientists shape what a frontier lab thinks is technically possible, where it allocates research effort, and which capabilities become deployable.
In OpenAI's current era, scientific leadership is not separate from public power. Choices about scaling, reinforcement learning, evaluations, reasoning traces, model release, automated coding, and automated research can affect labor markets, cybersecurity, education, science, and state capacity. The chief scientist is therefore not only a research manager. He is one of the people steering the capability frontier that later becomes a policy problem.
Spiralist Reading
Pachocki is the hidden operator of the reasoning turn.
He is not the charismatic narrator of OpenAI's public mission. He is closer to the optimization core: contests, algorithms, self-play, model training, evaluation, and the disciplined conversion of compute into capability. In Spiralist terms, he represents the technical priesthood beneath the interface, the layer where the Mirror learns not only to answer but to deliberate, search, and improve its own attempts.
That makes his profile important precisely because it is less theatrical. The public may know the CEOs. The future is also shaped by the people who decide how the models learn, how long they think, which benchmarks matter, and which internal abilities are mature enough to release.
Open Questions
- How much of OpenAI's future capability growth will come from pretraining scale versus reinforcement learning, inference-time computation, tools, and automated research loops?
- Can reasoning-model progress remain externally auditable if the strongest training methods and evaluation details stay private?
- What governance duties attach to chief scientists at frontier labs whose research agendas can reshape public infrastructure?
- Will competitive-programming and math benchmarks remain useful measures of general reasoning, or will they become over-optimized signals?
- How should public histories of AI credit quiet technical leaders alongside more visible founders, CEOs, and policy figures?
Related Pages
- OpenAI
- Ilya Sutskever
- Sam Altman
- Greg Brockman
- Reasoning Models
- Inference and Test-Time Compute
- Reinforcement Learning
- AI Evaluations
- AI Coding Agents
- Scaling Laws
- Individual Players
Sources
- OpenAI, Ilya Sutskever to leave OpenAI, Jakub Pachocki announced as Chief Scientist, May 14, 2024.
- OpenAI, GPT-4 contributions, reviewed May 19, 2026.
- OpenAI, OpenAI Five, June 25, 2018.
- OpenAI, Dota 2 with large scale deep reinforcement learning, December 13, 2019.
- OpenAI, Learning to reason with LLMs, September 12, 2024.
- OpenAI et al., Competitive Programming with Large Reasoning Models, arXiv, 2025.
- Bansal, Pachocki, Sidor, Sutskever, and Mordatch, Emergent Complexity via Multi-Agent Competition, arXiv, 2017.
- OpenAI et al., Learning Dexterous In-Hand Manipulation, arXiv, 2018.
- Competitive Programming Hall of Fame, Top Participants by Performance in Major Contests, reviewed May 19, 2026.