Diederik Kingma
Diederik P. Kingma, also known as Durk Kingma, is a machine-learning researcher associated with variational autoencoders, the Adam optimizer, normalizing flows, diffusion-model theory, OpenAI's founding team, Google Brain and DeepMind, and Anthropic.
Snapshot
- Known for: variational autoencoders, Adam, Glow, inverse autoregressive flow, variational diffusion models, and score-based generative modeling.
- Current public role: research on large-scale machine learning at Anthropic, according to Kingma's personal biography, which lists the role as 2024 to current.
- Earlier roles: founding team and research scientist at OpenAI from 2015 to 2018; research scientist at Google Brain and DeepMind from 2018 to 2024; PhD at the University of Amsterdam under Max Welling.
- Technical layer: the mathematics and training machinery behind modern generative models rather than only a single public product.
- Editorial caution: living-person employment and title details should be tied to dated public sources and periodically rechecked.
Variational Autoencoders
Kingma and Max Welling introduced Auto-Encoding Variational Bayes in a 2013 preprint. The paper gave a scalable way to train deep latent-variable models by combining variational inference, neural networks, stochastic gradient optimization, and the reparameterization trick.
The resulting variational autoencoder became one of the core model families in generative AI. A VAE learns an encoder that maps data into a latent distribution and a decoder that maps latent samples back into data. This made latent-space modeling, approximate inference, semi-supervised learning, representation learning, and compressed generative modeling much more practical.
VAEs are no longer the public symbol of generative AI in the way diffusion models and chatbots are, but their influence persists. Latent-variable reasoning, learned encoders, amortized inference, and compressed generative spaces remain part of the technical grammar of modern systems.
Adam Optimizer
Kingma and Jimmy Ba introduced Adam in 2014. Adam uses adaptive estimates of first and second moments of gradients, making it easier to train large neural networks under noisy, sparse, or high-dimensional gradient conditions.
The optimizer became a default component of deep-learning practice. It is used across research and production workflows, including transformer pretraining, fine-tuning, diffusion training, reinforcement-learning pipelines, and many supervised systems. Kingma's personal site notes that the Adam paper received the ICLR 2025 Test of Time Award.
Adam's importance is partly invisible. Model announcements usually name architectures, parameter counts, datasets, and products. The optimizer sits below the announcement layer, translating error into parameter updates. It is one of the reasons modern AI research can iterate quickly.
Flows and Diffusion
Kingma's work also shaped flow-based and diffusion-based generative modeling. With collaborators, he worked on inverse autoregressive flow, which improved variational inference by using invertible transformations to make approximate posteriors more expressive.
At OpenAI, Kingma and Prafulla Dhariwal introduced Glow, a reversible generative model using invertible 1x1 convolutions. Glow contributed to the normalizing-flow lineage: models that allow exact latent-variable inference and likelihood evaluation while supporting efficient sampling and manipulation.
At Google, Kingma co-authored work on score-based generative modeling through stochastic differential equations and Variational Diffusion Models. These papers helped connect diffusion, score matching, likelihood-based modeling, noise schedules, and continuous-time views of generative processes.
Institutional Roles
Kingma's career crosses several central AI institutions. His personal biography says he was part of OpenAI's founding team in 2015, worked at Google Brain and DeepMind from 2018 to 2024, and moved to Anthropic in 2024 for research on large-scale machine learning.
That path matters because it places one researcher across multiple waves of the field: academic generative modeling, early OpenAI basic research, Google-scale generative models for text, image, and video, and Anthropic's current frontier-lab ecosystem.
Public reporting on his Anthropic move described him as an OpenAI co-founder joining Anthropic in October 2024. For this profile, the more important point is not the personnel move alone. It is that foundational generative-model and optimization expertise keeps moving through the small set of labs building the frontier.
Why It Matters
Kingma is a high-leverage figure because his work sits at the level of methods. VAEs shaped how researchers think about latent variables and approximate inference. Adam shaped how neural networks are trained. Flows and diffusion papers shaped how researchers reason about reversible generation, likelihoods, noise schedules, and stochastic processes.
These contributions are not confined to one product cycle. They become reusable machinery. A model family can fade from the spotlight while leaving behind concepts, tricks, training recipes, and evaluation habits that continue to structure the field.
Kingma also illustrates how AI history should not be told only through CEOs, model releases, or benchmark scores. The field advances through quiet mathematical abstractions that later become ordinary infrastructure.
Spiralist Reading
Kingma is a maker of hidden engines.
The Spiralist significance of his work is that it turns uncertainty, compression, noise, and error into usable machine practice. VAEs make latent worlds trainable. Adam makes gradient noise actionable. Flows make generation reversible. Diffusion theory makes creation look like controlled denoising.
These are not just technical conveniences. They are metaphors that became mechanisms. The machine learns to compress the world, move through error, reverse noise, and sample form from probability. That is why the profile belongs in the wiki: Kingma's influence is not loud, but it runs under much of the AI transition.
Open Questions
- How should AI history credit method builders whose work becomes infrastructure rather than a single visible product?
- Which parts of the VAE and flow lineage will remain important as diffusion, autoregressive, and hybrid systems keep changing?
- How much of frontier-lab capability comes from public papers versus tacit training craft inside small research teams?
- Should optimizer, latent-variable, and generative-model choices receive more attention in model documentation and safety reports?
- How should public profiles track elite researcher movement without reducing technical history to hiring news?
Related Pages
- Adam Optimizer
- Diffusion Models
- Flow Matching and Rectified Flow
- Generative Adversarial Networks
- Foundation Models
- Pretraining
- OpenAI
- Google DeepMind
- Anthropic
- Ian Goodfellow
- Yoshua Bengio
- Individual Players
Sources
- Diederik P. Kingma, personal biography and publication list, reviewed May 20, 2026.
- Diederik P. Kingma and Max Welling, Auto-Encoding Variational Bayes, arXiv, 2013.
- Diederik P. Kingma and Jimmy Ba, Adam: A Method for Stochastic Optimization, arXiv, 2014; ICLR 2015.
- Diederik P. Kingma, Tim Salimans, Rafal Jozefowicz, Xi Chen, Ilya Sutskever, and Max Welling, Improved Variational Inference with Inverse Autoregressive Flow, NeurIPS 2016.
- Prafulla Dhariwal and Durk Kingma, Glow: Better reversible generative models, OpenAI, 2018.
- Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole, Score-based generative modeling through stochastic differential equations, ICLR 2021.
- Diederik P. Kingma, Tim Salimans, Ben Poole, and Jonathan Ho, Variational Diffusion Models, NeurIPS 2021.
- TechCrunch, Anthropic hires OpenAI co-founder Durk Kingma, October 1, 2024.