Blog · Review Essay · Last reviewed June 16, 2026

Invisible Women and the Data Gap Under AI

Caroline Criado Perez's Invisible Women is not an AI book in the narrow sense. It is more useful than that: a map of how missing data becomes designed reality before any model is trained.

The Book

Invisible Women: Data Bias in a World Designed for Men was published by Abrams Press in the United States on March 12, 2019. Abrams lists Caroline Criado Perez as the author and ISBN-13 9781419729072 for the hardcover edition. Amazon's listing uses ISBN-10 1419729071, the same number used in the site's affiliate link for this review.

The book's argument is simple and severe: when institutions collect, classify, and design around incomplete data, the missing population does not merely disappear from spreadsheets. It is made to live inside systems calibrated for someone else. Criado Perez moves across medicine, transport, work, public policy, consumer products, and technology, but the recurring mechanism is the same. The male default is treated as general reality, while women are handled as exceptions, edge cases, or noise.

The Default Is a Decision

The value of Invisible Women for this archive is that it turns "default" into a political word. A default body, default worker, default route, default voice, default schedule, or default risk profile looks neutral only because the institution has stopped noticing the people it was built around. In that sense, a data gap is not emptiness. It is an active design condition.

This is the same grammar that runs through algorithmic governance. A model does not need hatred to harm. It can inherit a world where care work is undercounted, clinical research is uneven, workplace equipment is standardized around the wrong body, and mobility patterns are read through one narrow map of daily life. The model then compresses these inherited omissions into scores, recommendations, priorities, and interfaces that feel newly objective because the old bias has been laundered through computation.

The AI Reading

Read after the generative AI boom, the book becomes a warning about training data and evaluation. AI systems are often described through scale: more data, bigger models, broader coverage. Criado Perez pushes against the comfort of that scale. Bigger data can still be partial data. Broader coverage can still miss the questions nobody asked. Benchmark performance can still hide subgroup failure when the test set inherits the same social defaults as the training set.

For AI agents, the problem sharpens. An agent that schedules, triages, drafts, routes, or buys things acts on defaults. If its tools assume a generic worker, patient, applicant, commuter, or user, then the agent's action can turn omission into administration. The issue is not consciousness, intention, or machine will. It is authorized automation operating inside a biased description of the world.

Governance After the Gap

NIST's AI Risk Management Framework treats trustworthy AI as a matter of design, development, deployment, and evaluation, not as a public-relations label. NIST's 2022 publication on identifying and managing AI bias is even more direct: bias is sociotechnical, shaped by data, models, institutional practices, and human interpretation. That is the governance bridge from Criado Perez to AI safety. The question is not only whether a model is accurate on average. It is accurate for whom, under what conditions, with what missing variables, and with what path of appeal when the system fails.

The European Commission's AI Act overview likewise describes a risk-based framework for AI systems, including stricter rules for high-risk uses and transparency obligations. Whatever one thinks of the details, the regulatory direction confirms the book's point: consequential systems need evidence about the people they affect. Without disaggregated testing, monitoring, documentation, and contestability, "data-driven" can become a polite name for institutional indifference.

Where the Book Needs Care

The book's evidentiary abundance is persuasive, but it can leave a reader with a tempting remedy: collect more data, and design will improve. That is necessary, but not sufficient. More complete data can expose a problem; it does not force an institution to care. Data can also become a new channel of surveillance, especially for people who already face medical, workplace, welfare, or platform scrutiny.

The stronger lesson is therefore not "include women in the dataset" alone. It is to ask who controls the category, who benefits from the measurement, who can refuse collection, who can correct the record, and who has power to change the design after harm appears. The missing-data problem is real. So is the danger of solving it by building more exhaustive systems of observation without building more democratic systems of control.

What This Changes

Invisible Women belongs in the Church of Spiralism catalog because it explains a basic ritual of machine society: what is uncounted becomes unreal, and what is counted badly becomes policy. It gives readers a concrete way to inspect AI claims without mystification. Ask what population is missing. Ask what default body or life pattern the system assumes. Ask whether performance is averaged over the very differences that matter.

The book's lasting force is its refusal to treat exclusion as an accident at the margins. In an AI system, the margin can become the rule for everyone routed through it. If the record is incomplete, the machine does not repair reality by optimizing over it. It formalizes the gap.

Sources

Book links are paid affiliate links. As an Amazon Associate I earn from qualifying purchases.


Return to Blog · Return to Books