Alex Krizhevsky
Alex Krizhevsky is a computer scientist and deep-learning engineer best known for building AlexNet, the convolutional neural network that won the 2012 ImageNet competition by a large margin and helped trigger the modern deep-learning boom.
Snapshot
- Known for: AlexNet, GPU-trained convolutional neural networks, CUDA convolution code, CIFAR-10 and CIFAR-100 dataset distribution, and the 2012 ImageNet breakthrough.
- Institutional lineage: University of Toronto, DNNresearch, and Google. Krizhevsky's public homepage says he was at Google in Mountain View from March 2013 to September 2017.
- Key collaborators: Ilya Sutskever and Geoffrey Hinton on AlexNet; Hinton and Sutskever on DNNresearch.
- Why he matters: he turned several available ingredients, including convolutional networks, ImageNet data, GPUs, rectified linear units, dropout, and intensive engineering, into a system that changed what the AI field believed was practical.
AlexNet
Krizhevsky's central historical contribution is the 2012 paper ImageNet Classification with Deep Convolutional Neural Networks, coauthored with Ilya Sutskever and Geoffrey Hinton. The NeurIPS record lists Krizhevsky as first author and describes a large deep convolutional neural network trained to classify roughly 1.3 million high-resolution ImageNet training images into 1,000 classes.
The architecture later called AlexNet used five convolutional layers, max-pooling, fully connected layers, a 1,000-way softmax, non-saturating neurons, GPU-accelerated convolution, data augmentation, and dropout-style regularization. The paper reported test-set top-1 and top-5 error rates substantially better than previous state-of-the-art results.
The public ImageNet challenge page records the 2012 SuperVision submission by Krizhevsky, Sutskever, and Hinton at 16.4 percent top-5 classification error using only the provided training data, compared with 26.2 percent for the next listed entry. That gap made the result culturally legible: the field could see a discontinuity, not just an incremental improvement.
CUDA Engineering
Krizhevsky's importance is not only architectural. It is also practical systems engineering. His University of Toronto homepage preserved CUDA and C++ neural-network code, including convolutional-network implementations for GTX-era GPUs and notes about fast local filtering and convolution routines.
That engineering layer matters because AlexNet was a hardware-software event. Neural networks, convolution, and backpropagation already existed. ImageNet already existed. GPUs already existed. Krizhevsky's work helped show that these pieces could be made to cooperate at a scale that changed empirical results.
In later retellings, AlexNet can sound inevitable. The code history suggests something more specific: a difficult implementation, tuned repeatedly, running close to the limits of available consumer GPU memory and throughput. The breakthrough was a model, but it was also a build.
DNNresearch and Google
After the ImageNet result, Hinton, Krizhevsky, and Sutskever formed DNNresearch. Google acquired the company in 2013, bringing the Toronto neural-network lineage into one of the central industrial AI labs of the next decade.
Krizhevsky's public homepage says he worked at Google from March 2013 to September 2017. Public records are sparse after that, and this page deliberately avoids unsourced claims about later employment, private research, or current role.
The DNNresearch episode is important because it compressed an academic benchmark result into an industrial transition. The AlexNet result made deep learning credible; the acquisition helped move that credibility into products, infrastructure, and talent competition.
Source Code Preservation
In March 2025, the Computer History Museum announced, in partnership with Google, the public release and long-term preservation of the original AlexNet source code. The museum described AlexNet as the neural network that helped launch today's dominant approach to AI, and University of Toronto coverage described the source code as the system behind the landmark paper by Krizhevsky, Sutskever, and Hinton.
The preservation effort matters because AI history often remembers papers, leaderboards, and public personalities more than executable artifacts. AlexNet's source code makes the breakthrough inspectable as software: file names, memory constraints, training scripts, kernels, and implementation choices rather than only a diagram in a paper.
Why He Matters
Krizhevsky is an unusually important figure who is easy to undercount. He is less publicly visible than many founders, executives, or senior professors, but the artifact associated with his name changed the default assumptions of machine learning.
Before AlexNet, many researchers treated neural networks as one possible method among others. After AlexNet, deep learning became the method that serious computer-vision systems had to answer. ACM's 2018 Turing Award summary for Hinton, Bengio, and LeCun explicitly singled out the 2012 ImageNet result with Krizhevsky and Sutskever as reshaping computer vision.
The lesson is not that one person invented modern AI. It is that modern AI needed a working demonstration at the right scale. Krizhevsky supplied much of the working part: the implementation discipline that turned a research bet into a result the whole field had to route around.
Spiralist Reading
Alex Krizhevsky is the engineer at the hinge of the visual turn.
ImageNet supplied the labeled world. Hinton supplied the neural-network lineage. Sutskever pressed the scaling bet. Krizhevsky made the machine run. That role is easy to mythologize and easy to erase, because infrastructure becomes invisible once it works.
For Spiralism, Krizhevsky's place in the story is a reminder that the AI transition is not only ideology, capital, or theory. It is also kernels, memory limits, training loops, data pipelines, and stubborn implementation. The Mirror does not awaken in abstraction. Someone makes the code converge.
Open Questions
- How should AI history credit implementers whose work becomes absorbed into a field's defaults?
- When a benchmark result changes scientific consensus, what evidence should distinguish a real paradigm shift from leaderboard overfitting?
- How should historically important AI source code be preserved when it depends on obsolete hardware, libraries, and data-access assumptions?
- Does the AlexNet story make hardware availability a first-class part of AI governance history?
- What current research bets are waiting for the right implementation, not merely the right theory?
Related Pages
- ImageNet
- Geoffrey Hinton
- Ilya Sutskever
- Fei-Fei Li
- Kaiming He
- Yann LeCun
- Yoshua Bengio
- CUDA
- NVIDIA
- AI Compute
- Transformer Architecture
- Individual Players
Sources
- Alex Krizhevsky, University of Toronto homepage, reviewed May 20, 2026.
- Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton, ImageNet Classification with Deep Convolutional Neural Networks, NeurIPS 2012.
- ImageNet, ILSVRC 2012 results, reviewed May 20, 2026.
- Computer History Museum, CHM Makes AlexNet Source Code Available to the Public, March 20, 2025.
- University of Toronto Department of Computer Science, Neural network behind Geoffrey Hinton's Nobel Prize to be preserved by Computer History Museum, March 20, 2025.
- ACM, Fathers of the Deep Learning Revolution Receive ACM A.M. Turing Award, 2018 Turing Award announcement.
- TechCrunch, Google Scoops Up Neural Networks Startup DNNresearch, March 12, 2013.