Blog · Analysis · Last reviewed June 19, 2026

The Incident Report Becomes Public Memory

AI governance will not mature until failures become inspectable memory instead of isolated scandals. An incident report is not just a complaint or headline; it is a dated evidence record that names the system, harm or hazard, uncertainty, responsible parties, corrective action, and disclosure boundary.

From Scandal to Record

Every technical system has a memory problem after harm occurs.

A self-driving car kills a pedestrian. A facial-recognition match helps police arrest the wrong person. A hiring or welfare system sorts people through opaque categories. A chatbot produces dangerous advice. A generated image becomes fraud, harassment, propaganda, or evidence pollution. A model-assisted tool deletes the wrong data, leaks private material, or automates a decision nobody can reconstruct.

The public usually meets these events as stories: a lawsuit, a news report, a viral thread, a company statement, a regulatory filing, a congressional letter, a correction. The story has heat. It may have a victim, a villain, a quote, a denial, and a news cycle. Then the cycle moves on.

Governance begins when the event becomes a record.

That is the plain importance of AI incident reporting. It is not a glamorous part of AI policy. It does not promise alignment, consciousness, or a clean theory of intelligence. It asks a colder question: what happened, what system was involved, who was harmed or almost harmed, what evidence exists, who knew, what changed afterward, and how can the next institution avoid repeating the same failure?

A field that cannot remember its accidents cannot govern its machines. It can only perform surprise.

Current Context

As of June 19, 2026, AI incident reporting is not one system. It is a patchwork of public repositories, civil-society records, regulator notices, company postmortems, lawsuit materials, agency inventories, safety-framework disclosures, and internal incident-response logs. That patchwork is useful, but it means readers must ask which layer they are looking at: public-source monitoring, community submission, legal notice, protected whistleblower report, regulator-only filing, or public aggregate.

The OECD AI Incidents and Hazards Monitor and the AI Incident Database are public memory layers. They help researchers, policymakers, journalists, builders, and affected communities see patterns that individual news stories miss. But the OECD methodology is explicit that news-based monitoring captures only a subset of worldwide incidents and hazards, and the AI Incident Database deliberately began with broad collection so shared criteria could improve through use. These databases are evidence infrastructure, not final adjudication.

Law is now entering the same space unevenly. The EU AI Act's Article 73 requires providers of high-risk AI systems placed on the Union market to report serious incidents to market surveillance authorities, with deadlines that depend on severity. The Commission also published a November 2025 serious-incident reporting template for providers of general-purpose AI models with systemic risk under Article 55. California's SB 53, signed on September 29, 2025, created a narrower frontier-model regime: public and company reporting routes for potential critical safety incidents to the Office of Emergency Services, whistleblower protections for covered employees, and Attorney General annual reporting of anonymized, aggregated covered-employee reports. None of these instruments creates a comprehensive public database of all AI harms.

The governance lesson is therefore modest and important: incident reporting must be designed as a chain. Public reports reveal patterns. Internal logs preserve evidence. Regulators receive serious notices. Courts and auditors test disputed facts. Public registers, procurement files, audit trails, safety cases, and post-market monitoring records connect the incident to the system that produced it.

What Counts as an Incident

The definition matters because every definition builds a boundary around public memory.

The OECD's AI Incidents and Hazards Monitor distinguishes between an AI incident, where the development, use, or malfunction of an AI system leads to actual harm, and an AI hazard, where it could plausibly lead to harm. The listed harm categories include injury or health harm, disruption of critical infrastructure, rights or legal violations, and harm to property, communities, or the environment.

That split is useful. If the threshold is only proven catastrophe, the record arrives too late. If the threshold is any bad feeling about a tool, the database becomes noise. Governance needs both categories: incidents for what happened, hazards for what almost happened or could reasonably happen under nearby conditions.

An incident report should therefore preserve more than a narrative. At minimum, it should identify the deployed system or model family, provider, deployer, version or date range where known, affected workflow, harm or hazard type, affected people, evidence status, uncertainty, immediate containment, corrective action, disclosure tier, and the owner responsible for follow-up. Without those fields, the report may still be morally important, but it is harder to use for prevention.

The AI Incident Database takes a historically broad approach. Its own explanation compares AI incident collection to transportation safety and computer vulnerability repositories. It invites reports across domains and says the project is meant to converge on shared criteria through use. That breadth is valuable because AI is not one industry. It is a family of systems entering cars, phones, courts, hospitals, schools, police departments, warehouses, hiring pipelines, creative tools, public benefits, social feeds, and intimate chat interfaces.

But breadth has a cost. An autonomous-vehicle death, a biased photo filter, a hallucinated citation, a deepfake scam, and a chatbot-linked self-harm allegation are not the same kind of event. They may need different causal analysis, evidence standards, severity scales, and remedies. A useful incident culture must preserve variety without flattening every failure into the same moral category.

The Public Databases

The present public memory layer is already plural.

The OECD AI Incidents and Hazards Monitor builds an evidence base for policymakers and practitioners by drawing from reputable international news sources and classifying reported events. Its methodology page is careful about limits: news-based monitoring captures only a subset of incidents and hazards, and the OECD does not independently verify every third-party article. That caveat should be treated as a feature of intellectual honesty, not a weakness to ignore.

The AI Incident Database is more community-oriented and research-facing. It indexes reports, supports taxonomies, accepts submissions, and is maintained by the Responsible AI Collaborative with a broad contributor ecosystem. It is especially important because it treats AI failure as a collective record rather than a sequence of disconnected anecdotes.

AIAAIC, the AI, Algorithmic, and Automation Incidents and Controversies repository, adds another public-interest layer. It tracks incidents and controversies across AI, algorithms, and automation, and includes taxonomies for ethical issues, external harms, consequences, news triggers, and responses. Its scope is wider than foundation models, which is exactly the point: algorithmic governance did not begin with chatbots.

Together these projects do something institutions often fail to do. They let patterns accumulate. One wrongful arrest might be dismissed as an edge case. A sequence of wrongful facial-recognition arrests becomes a governance problem. One hallucinated legal citation may look like lawyer negligence. A pattern of hallucinated citations becomes a professional-responsibility and tool-design problem. One synthetic-media fraud can be treated as crime. A rising class of synthetic-media fraud becomes infrastructure policy.

This is where incident reporting connects to AI registers, audit trails, post-market monitoring, procurement, and vendor governance. A public incident row is much more useful when it can point to the system's register entry, the vendor contract, the model or system card, the audit record, the complaint route, and the change log. Otherwise the incident lives in one database while the deployed system lives in another bureaucracy.

The facial-recognition example is not hypothetical. In January 2020, Detroit police arrested Robert Williams on his front lawn, in front of his wife and two young daughters, on the strength of a false facial-recognition match to surveillance footage of a watch theft he had nothing to do with. His was the first wrongful arrest from facial recognition to come to public light in the United States, and the ACLU's later lawsuit ended in a landmark 2024 settlement that reshaped Detroit's use of the technology. Crucially, Williams's case was not alone: it became one of three known wrongful arrests tied to the same department's use of face recognition, and all three of the people arrested were Black. A single arrest could be filed as error. Three, read together, name a system. That is exactly the work an incident record does: it turns the second and third occurrence into evidence rather than coincidence.

The incident database is a weak signal amplifier. It converts scattered harm into searchable memory.

Law Enters the Logbook

Voluntary databases are not enough. They see only what journalists, researchers, victims, whistleblowers, companies, and volunteers can surface. The next phase is legal reporting.

The EU AI Act creates serious-incident duties for high-risk AI systems. Article 73 requires providers of high-risk AI systems placed on the Union market to report serious incidents to market surveillance authorities in the member states where the incident occurred. The ordinary deadline is no later than 15 days after the provider, or in some cases deployer, becomes aware of the incident, once a causal link or reasonable likelihood has been established. The Act sets shorter timelines for especially severe cases, including not later than two days for certain widespread infringements and not later than 10 days in the event of death.

The EU has also moved toward templates for serious incidents involving general-purpose AI models with systemic risk. In November 2025, the European Commission published a reporting template for such providers under Article 55, tied to the GPAI Code of Practice and the AI Office. That detail matters because general-purpose models create reporting problems that older product categories do not. The same model may be embedded in thousands of downstream systems, with different prompts, tools, safeguards, customers, and jurisdictions.

California's SB 53, signed in September 2025, adds a U.S. state-level frontier-model reporting path. The Governor's announcement described the law as creating a mechanism for frontier AI companies and the public to report potential critical safety incidents to California's Office of Emergency Services. The California Attorney General's SB 53 page adds whistleblower-relevant detail: covered employees responsible for assessing or addressing risk may report certain dangers or violations, and the Attorney General must produce annual anonymized, aggregated information about covered-employee reports.

This is a quiet but important shift. The incident report is becoming a legal object. It is no longer only a public-interest spreadsheet or a postmortem blog post. It can become a duty, a protected disclosure, a regulator's input, a template, a deadline, a post-market monitoring trigger, and eventually evidence in enforcement.

The shift also creates a source problem. Legal reporting duties do not automatically create public knowledge. Some reports will be confidential, delayed, anonymized, aggregated, or withheld for privacy, security, trade-secret, law-enforcement, or investigation reasons. Public memory therefore needs disclosure tiers: enough public information to learn from patterns, enough protected information for regulators and auditors to test facts, and enough privacy discipline to avoid turning victims' records into spectacle.

Why Memory Is Hard

AI incidents are difficult to record because AI systems are rarely single objects.

A model output may depend on training data, fine-tuning, retrieval sources, system prompts, user prompts, memory, moderation layers, tool permissions, API settings, application code, ranking systems, plug-ins, user behavior, deployment context, and organizational incentives. When harm occurs, the question "what caused it?" may not have one clean answer.

That complexity creates predictable failure modes.

First, underreporting. Many harmed people do not know an AI system was involved. Others lack time, legal support, technical literacy, or a safe channel. Companies may detect failures privately and fix quietly. Workers may fear retaliation. Users may feel shame, especially when the incident involves intimacy, mental health, fraud, or a humiliating automated decision.

Second, attribution fog. A company can say the user misused the tool. A deployer can blame the model provider. A model provider can blame the application wrapper. A regulator can lack access to logs. A journalist can report harm without being able to inspect the system. The record then contains the social fact of harm but not a settled technical cause.

Third, severity mismatch. Catastrophic-risk reporting is necessary, but many AI harms are cumulative, distributed, and ordinary. A single automated denial, false accusation, manipulative companion exchange, or hallucinated answer may not meet a legal threshold. At scale, those failures can reshape institutions.

Fourth, privacy tension. Good incident records need enough detail to teach. But incidents can include medical details, legal claims, chat logs, intimate disclosures, minors, workplace records, trade secrets, security vulnerabilities, and ongoing investigations. A public memory layer can become a second harm if it exposes victims or teaches attackers.

Fifth, narrative capture. Whoever writes the first incident narrative may define the public lesson. A company can frame an incident as misuse. An activist can frame it as proof of a total system failure. A regulator can frame it as compliance. A database editor can make a classification choice that later researchers inherit.

Sixth, evidence decay. Logs expire, prompts are overwritten, model versions change, vendors rotate, user interfaces are redesigned, and staff leave. If evidence-preservation rules begin only after a public scandal, the record may already be too thin to explain what happened.

Seventh, memory fragmentation. The complaint sits in customer support, the serious-incident report goes to a regulator, the model update appears in a release note, the audit finding stays under NDA, and the public register remains unchanged. Fragmented memory lets an institution claim it responded while no one can see the whole pattern.

None of these problems argue against incident reporting. They argue for better incident discipline.

A Better Incident Culture

A mature AI incident culture should borrow from aviation, cybersecurity, medicine, labor safety, and public administration without pretending AI is identical to any of them.

First, preserve the event trail. Logs, model versions, prompts, retrieval sources, tool calls, user-facing outputs, moderation decisions, timestamps, human approvals, and downstream actions should be retained when a serious event is suspected. Without reconstruction, every explanation becomes public relations.

Second, separate blame from learning early. Some incidents require liability, enforcement, discipline, or criminal investigation. But if every report is treated first as legal exposure, organizations will hide weak signals. Near misses, hazards, and user reports need channels that support learning before the evidence disappears.

Third, protect reporters. Employees, contractors, users, auditors, researchers, and affected communities need safe routes to report. California's SB 53 whistleblower provisions are important because frontier-model risk knowledge often sits inside private organizations before regulators or the public can see it.

Fourth, use severity tiers. A death, a critical-infrastructure disruption, a rights violation, a dangerous model capability escape, a privacy breach, a hallucinated source, and a manipulative companion interaction require different reporting timelines and audiences. The system should not force every case through one gate.

Fifth, track remedies, not only harms. A useful database should ask what changed: model update, product recall, policy revision, disclosure, appeal, compensation, access restriction, audit, warning label, training change, procurement pause, or no action. Institutional response is part of the incident.

Sixth, make uncertainty explicit. Incident records should distinguish alleged, confirmed, disputed, and unresolved facts. The public needs to know whether a record is based on court documents, regulator findings, company logs, journalism, user testimony, or research replication.

Seventh, connect incidents to procurement and evaluation. Governments, schools, hospitals, courts, and employers should not evaluate AI vendors only by benchmark claims and demo performance. They should ask what incidents have occurred, how they were handled, what the vendor reports voluntarily, and what logs will be available if the system fails locally.

Eighth, use stable identifiers. The same system or deployment should be traceable across the system inventory, public register, vendor contract, AI bill of materials, audit trail, incident report, corrective action, and retirement record. If every record names a different artifact, accountability breaks at the join.

Ninth, publish aggregates when case details are unsafe. Privacy, security, and investigation constraints are real. But they should not erase the fact that a category of incident exists. Annual aggregates, severity tiers, remedy counts, and policy-change summaries can preserve learning while protecting people.

Tenth, feed incidents back into safety cases. A serious incident should update the relevant safety case, evaluation suite, monitoring plan, user notice, procurement condition, and incident response plan. If the report does not change the system, it is archive without repair.

Incident reporting is not an after-action accessory. It is part of system design.

What This Changes

Model-mediated reality makes failure look fluent.

The answer arrives in polished prose. The label appears in the interface. The risk score looks administrative. The synthetic voice sounds calm. The agent completes a workflow. The dashboard says the system is operating. The user sees surface order and may not know where to locate responsibility when the surface breaks.

An incident report punctures that surface. It says: here is where the machine entered the world, here is where the world pushed back, here is who was harmed, here is what remains uncertain, here is what the institution did next.

That record is a reality anchor. It resists the drift from failure into mood, scandal, denial, myth, and brand management. It gives future builders something colder than inspiration and more useful than outrage. It lets a society say: we have seen this pattern before.

But incident reporting can also become ritual. A company files. A regulator receives. A database records. A transparency report names categories. Everyone points to the existence of a process while the same incentives continue underneath. In that failure mode, the incident report becomes another symbol of control standing in for control itself.

The standard should be harder. A serious incident culture must preserve evidence, protect reporters, classify uncertainty, identify repeated patterns, force remediation, and widen the risk map beyond spectacular catastrophe. It must include the harms that arrive through ordinary workflows: education, work, welfare, search, intimacy, public records, fraud, media, and institutional decision-making.

The future will not be governed by prediction alone. It will be governed by what institutions remember after prediction fails.

Source Discipline

Incident analysis depends on separating source types. OECD definitions and methodology explain terms and monitoring limits. The AI Incident Database and AIAAIC are public-interest repositories, not courts. ACLU case pages and settlement records are litigation and advocacy sources; they are strong for the facts they document, but they are not a census of every facial-recognition incident. EU AI Act text and Commission templates state legal duties and reporting forms, not public disclosure of every report. California Governor and Attorney General pages explain SB 53's frontier-model reporting and whistleblower mechanisms, not a broad U.S. AI incident law.

Every incident record should therefore carry source status: alleged, reported, confirmed, settled, adjudicated, regulator-notified, company-disclosed, user-reported, journalist-documented, independently audited, or unresolved. Counts should travel with a date and method. "No incidents found" may mean no incidents occurred, no one reported them, logs were unavailable, the public monitor did not capture them, or the institution classified them differently.

Public memory must also respect safety and privacy limits. Do not publish private victim details, minor information, raw intimate logs, credentials, live exploit instructions, active investigation details, or enough technical information to reproduce an attack. The useful public record names the pattern, evidence status, affected domain, remedy, and responsible institution without turning the report into a second harm.

Sources

OECD, AI risks and incidents, reviewed June 19, 2026.
OECD.AI, Overview and methodology of the AI Incidents and Hazards Monitor, reviewed June 19, 2026.
OECD.AI, Name it to tame it: Defining AI incidents and hazards, May 17, 2024, reviewed June 19, 2026.
OECD, Defining AI incidents and related terms, May 2024, reviewed June 19, 2026.
AI Incident Database, About, reviewed June 19, 2026.
Sean McGregor, Preventing Repeated Real World AI Failures by Cataloging Incidents: The AI Incident Database, 2020, reviewed June 19, 2026.
AIAAIC, AIAAIC Repository User Guide, and OECD.AI catalogue entry for the AIAAIC Repository, reviewed June 19, 2026.
American Civil Liberties Union, Williams v. City of Detroit, on Robert Williams's 2020 wrongful arrest from a false facial-recognition match and the resulting settlement, reviewed June 19, 2026.
European Union, Regulation (EU) 2024/1689, Artificial Intelligence Act, official text, especially Article 73.
European Commission AI Act Service Desk, Article 73: Reporting of serious incidents, reviewed June 19, 2026.
European Commission, reporting template for serious incidents involving general-purpose AI models with systemic risk, November 4, 2025, reviewed June 19, 2026.
Governor of California, Governor Newsom signs SB 53, September 29, 2025, reviewed June 19, 2026.
California Department of Justice, Catastrophic Risks in Artificial Intelligence Foundation Models, reviewed June 19, 2026.
NIST, Artificial Intelligence Risk Management Framework (AI RMF 1.0), January 2023.
Related references: AI Incident Reporting, The AI Register Becomes Public Memory, AI Audit Trails, AI Post-Market Monitoring, AI Audits and Assurance, EU AI Act, AI Liability and Accountability, Agent Audit and Incident Review, Incident and Complaint Protocol, Transparency and Public Registers, and Research and Editorial Integrity.

Return to Blog