Blog · Analysis · Last reviewed June 16, 2026

The Peer Reviewer Becomes the Model Referee

When AI enters peer review, it does not only help tired reviewers write faster. It changes how unpublished knowledge is judged, leaked, gamed, and remembered.

The Referee

Peer review is not a purity machine. It is unpaid labor, uneven expertise, social trust, disciplinary politics, and time pressure wrapped around the serious task of deciding whether a manuscript should enter the record. A review may improve a paper, catch a flaw, miss a fabrication, protect a field, or reproduce its blind spots.

The existing site has treated paper mills as attacks on the literature from the author side. The model referee is the same crisis from the gatekeeping side: AI helping reviewers make judgment-shaped text.

A referee report is confidential but consequential, private but institutional, advisory but often decisive. If a model writes or substantially shapes that report, the machine is not just summarizing. It is entering the ritual by which private skepticism becomes public knowledge.

Why Reviewers Reach for It

The appeal is obvious. Reviewers are overloaded. AI conferences and journals receive more submissions than careful review labor can comfortably absorb. A language model can summarize a paper, identify missing baselines, rephrase a vague criticism, compare related work, and turn rough notes into readable prose.

Some assistance can help. A 2024 large-scale empirical study found GPT-4 feedback overlapped with human peer-review points at a level comparable to overlap between two human reviewers. ICLR 2025 piloted a Review Feedback Agent that commented on submitted reviews rather than replacing reviewers; organizers said it did not write reviews or make decisions, and later reported improvements in specificity, engagement, and actionability without a statistically significant difference in final acceptance outcomes.

That is the defensible version: the model critiques the review, the reviewer chooses whether to revise, and the conference studies the effect. The risky version is quieter: a reviewer uploads a confidential manuscript to a general chatbot, pastes the output into a report, and receives credit for a judgment they did not fully perform.

The Policy Split

As of June 16, 2026, publication bodies have not collapsed into one rule, but the boundary is clear. The International Committee of Medical Journal Editors says submitted manuscripts are privileged communications and should not be uploaded into AI systems where confidentiality cannot be assured without explicit author permission. It also says reviewers who use AI should disclose the use and validate the content.

Elsevier tells reviewers not to upload manuscripts or reviewer communications into generative AI tools and says critical thinking and original assessment are human responsibilities. Nature Portfolio asks peer reviewers not to upload manuscripts into generative AI tools and to declare AI support used to evaluate manuscript claims. NeurIPS 2025 told reviewers not to share submissions with any LLMs; its 2026 Evaluations and Datasets track says reviewers may not use LLMs or AI agents in review.

At the same time, publishers and conferences are building sanctioned systems. Springer Nature announced an internal AI-driven tool in January 2025 for editorial quality checks before peer review, with human experts double-checking results. ICLR piloted an official feedback agent under program-chair control. The line is not "AI never touches review." The line is whether the tool is governed, confidential, traceable, and subordinate to human responsibility.

The Prompt-Injection Moment

The hidden-prompt scandal made the risk concrete. In July 2025, Nature reported that some preprints contained instructions in white text or tiny font designed to influence AI-assisted peer review. A July 2025 arXiv commentary by Zhicheng Lin said 18 arXiv manuscripts had hidden prompts and analyzed the practice as prompt injection aimed at manuscript evaluation.

This was not only author misconduct. It was a diagnostic test for the pipeline. A human reviewer sees a paper. A model reviewer sees a paper plus any hidden instructions embedded in the file. If the manuscript is fed to an LLM, the manuscript is no longer inert evidence. It can become an adversarial interface.

The problem echoes hallucinated legal citations. In both cases, a model can make the surface of evaluation smoother while weakening the underlying check. The danger is that the model can be steered by the document it is supposed to judge.

The Governance Standard

A serious peer-review standard should separate four uses: forbidden upload to public tools, permitted language polishing without manuscript leakage, sanctioned review-support systems controlled by the venue, and prohibited delegation of judgment.

First, confidentiality should be the default. Unpublished manuscripts may contain ideas, code, data, identities, patient information, trade secrets, or early claims. Reviewers should not move them into systems whose retention, training, logging, or access terms they cannot verify.

Second, AI use should be disclosed to the editor or venue. The disclosure should name the tool, purpose, input class, and whether manuscript text was processed.

Third, models should not write the verdict. A system may flag vagueness, tone, missing evidence, or possible misunderstandings. It should not produce the accept/reject recommendation that a reviewer signs as expert judgment.

Fourth, reviewer reports need source separation. Editors need to know what is human judgment and what is machine-shaped language.

Fifth, venues should scan for hidden instructions. Prompt injection, invisible text, malicious metadata, and adversarial PDF structure should become part of submission hygiene where AI-supported review tools are used.

Sixth, official review-support systems need audit logs. The venue should know which manuscript version, model version, prompts, guardrails, and feedback were involved.

Seventh, reviewer labor should not be quietly devalued. If institutions want better review, they should reward it: credit, training, time, editor support, and fewer empty metrics. AI should not become a way to extract more unpaid reports from strained researchers.

What This Changes

The peer reviewer is a small authority figure in the machinery of knowledge. They sit at a symbolic door: not yet published, maybe publishable, not good enough, revise, reject, accept.

The model referee changes the door. It can make review more readable, consistent, and responsive. It can also turn judgment into a polished template, confidentiality into an API call, and expert refusal into autocomplete. The danger is that institutions may accept model-shaped prose as a substitute for accountable reading.

A good scientific community should use tools without letting tools impersonate trust. Peer review survives only if the parties know who read the work, who judged the evidence, who saw the confidential material, and who is answerable when the report is lazy, biased, false, or manipulated.

Source Discipline

This page treats journal and conference policies as primary evidence for current reviewer rules. It treats ICLR materials as evidence of sanctioned experiments, not proof that all AI-assisted review is reliable. It treats hidden-prompt reporting and preprints as evidence of a new attack surface, not proof that every reviewer is outsourcing review to a model.

Sources


Return to Blog