Blog · Review Essay · Last reviewed June 15, 2026

Understanding Computers and Cognition and the Action Behind the Interface

Terry Winograd and Fernando Flores's Understanding Computers and Cognition: A New Foundation for Design is one of the most useful old books for the current AI moment because it refuses the cleanest fantasy in computing: that meaning can be solved by better internal representation alone. Its target is not merely 1980s symbolic AI. It is the broader habit of treating cognition as a formal structure inside a machine, detached from bodies, practices, commitments, institutions, and the situations in which people actually act.

The Book

Understanding Computers and Cognition was published by Ablex in 1986 and issued as an Addison-Wesley paperback in 1987. Google Books lists the Addison-Wesley edition at 207 pages with ISBN 0201112973, while Winograd's own publication list records the Ablex edition at 220 pages and notes later translations in French, German, Italian, Spanish, and Japanese. His CV also says the book was named the best information science book of 1987 by the American Society for Information Science.

The book sits at a hinge in AI history. Winograd had built SHRDLU, one of the canonical early natural-language AI systems, then became a central figure in human-computer interaction and design. Flores brought cybernetics, organizational design, political experience, and a theory of action through language. Together they produced a book that is partly AI critique, partly philosophy of cognition, partly design theory, and partly a proposal for thinking about computers as participants in human work rather than as isolated artificial minds.

That mixture explains why the book can feel difficult and fresh at once. It draws on phenomenology, hermeneutics, biology of cognition, speech-act theory, system design, management, and cooperative work. It is not a manual for building today's AI systems. It is a way to notice what those systems leave out when they reduce human meaning to text, task state, user profile, objective function, database field, or prompt context.

The AI It Argues Against

The book's main opponent is the rationalist image of cognition: the idea that intelligence consists mainly of representing facts about the world, manipulating those representations by rules, and producing correct inferences. That image was not invented by AI, but AI made it operational. If language is a mapping from sentences to facts, and thought is formal manipulation, then a computer looks like the natural home of intelligence.

Winograd and Flores do not deny that formal systems are powerful. They deny that formalization is the whole story of understanding. People do not encounter the world as detached proposition processors. They are already involved in projects, habits, tools, roles, histories, expectations, and social practices. Meaning shows up inside use. A request, a promise, an apology, a diagnosis, a ticket, an alert, a legal notice, and a medical instruction are not just strings with semantic content. They change obligations, authorize action, create risk, and reorganize attention.

This is why the book remains a useful antidote to model-centered AI discourse. A model can manipulate language with extraordinary fluency and still leave the harder question open: what kind of action is this output entering, and who is committed by it? The danger is not that formal systems are useless. It is that institutions mistake a formal handle on a situation for the situation itself.

After the Blocks World

Winograd's 1980 essay "What Does It Mean to Understand Language?" is the best companion to the book because it makes the trajectory visible. SHRDLU worked in a simulated blocks world where language, objects, actions, and feedback were tightly constrained. It could answer questions, follow commands, track some conversational context, and manipulate toy objects on a screen. It was impressive because the world was narrow enough for representation and action to line up.

The lesson is not that narrow worlds are fake. All useful systems narrow the world. The lesson is that the narrowing must remain visible. SHRDLU did not discover the general structure of human understanding. It demonstrated how much understanding can appear when a domain has been carefully built so that symbols, operations, and consequences fit together.

Current AI systems are less brittle in language and vastly broader in surface competence, but they still depend on constructed worlds: training corpora, tool APIs, retrieval indexes, product policies, benchmark tasks, interface scaffolds, enterprise permissions, and hidden conventions about what counts as a good answer. The new blocks world is not a tabletop. It is the whole stack of captured text, institutional workflow, ranking systems, and tool permissions that makes an answer look situated.

Breakdown as the Design Moment

One of the book's strongest design ideas is breakdown. Most of the time, a tool recedes into activity. The keyboard, form, calendar, terminal, spreadsheet, dashboard, ticket queue, and chatbot window do not appear as objects of attention while they work. They appear when something goes wrong: the form has no field for the real case, the interface routes the user to the wrong department, the summary omits the decisive fact, the agent acts with the wrong permission, the model is fluent but off-target.

Breakdown is not merely failure. It is the moment when the hidden structure of a system becomes available for inspection. A good design makes breakdowns recoverable, accountable, and informative. A bad design hides them, smooths them over, or converts them into user blame.

This matters sharply for AI interfaces. The more fluent a system becomes, the easier it is to miss the point at which the user should stop and inspect. An AI assistant can make a bad assumption sound continuous with the user's intention. A coding agent can turn a mistaken model of the repository into a patch. A customer-service bot can turn a policy gap into a polished refusal. A medical scribe can turn an uncertain conversation into official record. The design question is not only how to prevent error. It is how to make the right breakdowns visible before they harden into action.

Language as Action

The language/action perspective is the book's most practical inheritance. In ordinary software thinking, language often appears as information: a message carries content from sender to receiver. Winograd and Flores push the reader toward a different view. Language is something people do. It requests, offers, promises, declines, commits, authorizes, delegates, escalates, cancels, records, and repairs.

That makes the book newly relevant to large language models. Generative AI systems do not merely emit text. They increasingly draft emails, open issues, summarize meetings, prepare reports, answer customers, file tickets, call tools, update records, and coordinate other systems. In those settings, words become institutional action. A generated message can create a commitment even if nobody intended the model itself to be responsible.

The key distinction is between text that describes and text that binds. A summary can bind a patient to a medical history. A Jira ticket can bind an engineer to a priority. A model-written denial can bind a consumer to a dispute process. A procurement note can bind an organization to a vendor interpretation. A chatbot apology can bind a company to a representation. The output matters because it enters a social machinery of commitments.

Institutions and Commitments

Flores's influence shows most clearly where the book turns from cognition to organizations. Work is not just individual problem solving. It is a network of commitments: who asks, who promises, who waits, who checks, who escalates, who closes the loop, and who is accountable when the promise fails. Computer systems can support those commitments, distort them, or make them disappear behind workflow automation.

This is the institutional lesson for AI agents. An agent is not simply a smart tool when it can read private context, call services, change files, contact people, or route decisions. It becomes part of a commitment structure. The important questions become administrative and political: whose authority does it borrow, which promises can it make, what can it change without asking, what evidence does it preserve, what escalation path exists, and who can contest the action afterward?

The book helps separate automation from accountability. A workflow can be efficient while making responsibility less legible. A summary can be useful while hiding who selected the sources. A request can be routed faster while making it harder to know who refused it. A dashboard can coordinate work while converting negotiation into status categories. The design task is to preserve the human commitments that make action answerable.

The Current AI Reading

Read in 2026, Understanding Computers and Cognition is not a prediction that neural networks would fail. It is a warning about a category error: do not confuse operational competence inside a designed environment with full participation in human understanding. Today's models are stronger than the systems the book criticized, but many deployments still rely on the same simplification. They treat enough context as the same thing as situation, enough language as the same thing as meaning, enough tool access as the same thing as agency, and enough user approval as the same thing as responsibility.

The strongest version of the book's critique is not "machines cannot understand." That claim can become sterile because it turns into metaphysics too quickly. The more useful claim is: understanding is not located in the model alone. It lives across people, tools, histories, bodies, records, incentives, norms, and institutions. A system that ignores those relations can still be useful. It should not be mistaken for the whole intelligence of the setting it has entered.

This changes how AI evaluation should work. It is not enough to ask whether the model answered correctly in isolation. Ask what practice the answer enters. Ask whether the user can detect breakdown. Ask whether the system preserves source, uncertainty, and authority boundaries. Ask what commitments the output creates. Ask whether the institution has become more capable of correction or merely faster at producing official-looking action.

Where the Book Needs Friction

The book has real limits. Its prose can be abstract, and some of its business-system design examples feel tied to the office-automation debates of its period. Readers looking for a technical treatment of contemporary machine learning will not find one here. Readers looking for a direct account of race, gender, class, disability, labor exploitation, surveillance capitalism, or platform governance will need other books beside it.

There is also a risk in overstating the anti-representational lesson. Formal representations, benchmarks, databases, ontologies, and models are not optional in complex institutions. The right conclusion is not that representation is false. It is that representation is partial, situated, and consequential. A form, model, prompt, schema, or knowledge graph is a social intervention, not a neutral window.

The language/action program also deserves scrutiny. Treating organizational work as structured commitments can clarify accountability, but it can also over-formalize human cooperation. Not every valuable exchange is a request, promise, or closure. Care, hesitation, tacit skill, refusal, ambiguity, and informal repair can be damaged by systems that make every relation explicit enough to manage.

What This Changes

The book changes the center of AI criticism. The important question is not whether a machine has an inner state sufficiently similar to a person's. The practical question is what kind of human situation the machine is joining, how the interface defines that situation, and what forms of action become easier after the machine arrives.

For builders, that means design begins before the prompt and continues after the output. Map the work practice. Name the commitments. Identify the likely breakdowns. Preserve context that cannot fit in the model. Make authority visible. Keep appeal paths human. Do not let a fluent response become the institutional record without a way to inspect how it was produced and what it leaves out.

Understanding Computers and Cognition remains valuable because it moves computation back into the world. It asks designers to stop treating meaning as something sealed inside a representation and start treating it as something that happens when people, machines, language, and institutions act together. That is still the harder problem. It is also the one that matters most when the interface no longer just displays information, but begins to answer, decide, and move.

Sources

Book links are paid affiliate links. As an Amazon Associate I earn from qualifying purchases.


Return to Blog · Return to Books