Blog · arXiv Analysis · Last reviewed June 25, 2026

The Belief Trace Becomes the Persuasion Ledger

PersuasionTrace matters because it asks where belief moved inside the conversation, not only whether a pre/post score changed after the conversation ended.

Beyond the Endpoint

Persuasion is usually counted after the fact: ask what someone believed before an intervention, expose them to a message, ask again, and calculate the movement. That is useful, but it flattens the conversation. It does not show whether belief moved after the first appeal, drifted back after resistance, changed only when the topic became personal, or stayed stable while the final answer shifted from fatigue or politeness.

That flattening becomes dangerous when persuasive systems are optimized. If a model, product team, campaign, or safety researcher sees only the endpoint, it can miss the process by which influence happened. A persuasion trace records when the interface asked for belief and how the reported belief changed.

The Paper

arXiv lists A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing as arXiv:2606.05330v1 [cs.CL], submitted June 3, 2026. The authors are Jared Moore, Noah Goodman, Nick Haber, and Max Kleiman-Weiner, with Stanford University and University of Washington listed on the paper.

The paper introduces PersuasionTrace, a web-based framework for human-LLM persuasion studies. It records standard pre/post belief change and turn-level belief reports during the dialogue. It also annotates persuader messages along rhetorical dimensions: logos, pathos, and ethos.

For human data collection, the paper used LLM persuaders and human targets recruited through Prolific. The reported analyses cover 255 completed rounds, where one round means one pre-survey, dialogue, and post-survey on a single proposition. The main text says participants were U.S.-based and English-speaking, and that the study was approved by the authors' institutional IRB.

What the Trace Records

Each target first reports belief in a proposition on a 0 to 100 scale. The persuader is then assigned to support or oppose the proposition based on the target's starting belief. After each persuader message, the target reports belief again, producing a trajectory from pre-belief through intermediate turns to post-belief.

The paper's default human analyses used text-based four-turn dialogues, a 10-minute cap, DebateGPT propositions, and an LLM persuader identified as gpt-5-2025-08-07. It also studied personalized propositions, where participants provided personally relevant decisions that were validated and rephrased into agree/disagree propositions, and an audio condition where participants could speak while seeing transcripts.

What It Found

The behavioral findings are careful but pointed. Across the standard, personalized, and audio cohorts in Figure 2, the authors report that LLM persuaders outperformed control dialogues, with p-values below 0.001 for standard text, below 0.001 for personalized text, and 0.002 for audio.

The trace adds information that a post-score would hide. In one trajectory analysis, the paper reports two separable update patterns: a low-shift cluster with 44 cases and mean end-delta 0.039, and a larger-shift cluster with 40 cases and mean end-delta 0.437. The higher-shift group moved early, then partly drifted back. That is the governance value of the trace: it preserves the shape of influence rather than only the final displacement.

The rhetoric findings are narrower. On the H-STANDARD cohort with N = 32, ethos was negatively associated with persuasion delta, while logos and pathos were not distinguishable from zero. The paper also finds a negative ethos association in the larger DebateGPT comparison. The authors treat this as exploratory.

The Simulated Target

The second half of the paper builds a Bayesian-network simulated target. Instead of prompting a generic LLM to role-play a persuaded person, the simulator maintains an explicit latent belief state over related proposition nodes, updates those beliefs from message atoms, and then verbalizes a response.

In an LLM-judge human-likeness evaluation, the Bayesian-network target scored 81.3 compared with a human reference score of 80.0, while the unstructured LLM target scored 64.7 and the structure-conditioned LLM target scored 64.2. Those numbers should not be read as proof that the simulator is human. They show that simulator choice can materially change evaluation results, which matters if simulated targets are later used as optimization objectives.

Governance Reading

This page belongs beside persuasion contests, partisan persona tests, teen manipulation datasets, memory-based persuasion, and AI persuasion. The fresh contribution is process evidence. A system that changes belief should leave more than a before-and-after score.

Persuasion traces could support audits: when did belief move, what appeal preceded movement, did the system exploit a personal decision, did the user resist and then relent, did audio change the interaction, and did simulated-target optimization transfer to real people? The same trace could also become a manipulation dashboard. That is why the paper's dual-use warning matters. Measurement should constrain persuasion systems, not merely improve them.

Limits

The paper states several limits directly. The primary outcome is self-reported numeric belief, repeatedly measured during a dialogue; the measurement itself can affect behavior. Many propositions are subjective, so there is no ground-truth "correct" belief to incentivize. Building proposition-specific Bayes nets may be difficult at scale, and the simulator does not model many social and affective mechanisms such as relational trust, identity threat, or peripheral-route influence.

Several findings are descriptive rather than causal. The rhetoric analysis is correlational and based on a small annotated subset. Cross-cohort comparisons should be read cautiously because cohorts were collected in different time windows with quota-based assignment. The page-level lesson is that belief movement needs a process record before anyone treats persuasive optimization as evidence of benefit.

Persuasion Receipt

A persuasion receipt should record: proposition, starting belief, assigned persuader stance, modality, model and prompt version, user-provided personal topic status, each persuader turn, each reported belief, rhetorical annotations, control condition, simulator type, judge configuration, consent and debrief status, and limits on use. The audit-grade sentence is: this belief movement was observed under these conditions, with these measurement effects, and should not be generalized into permission to optimize influence without safeguards.

Sources

Jared Moore, Noah Goodman, Nick Haber, and Max Kleiman-Weiner, A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing, arXiv:2606.05330v1 [cs.CL], submitted June 3, 2026.
Primary arXiv versions checked: PDF and experimental HTML, reviewed for title, authorship, date, participant counts, conditions, belief-tracing protocol, rhetoric annotations, simulator results, ethics statement, data/code note, and limitations.
Related pages: The Persuasion Contest Becomes the Expert Benchmark, The Partisan Persona Becomes the Persuasion Test, The Teen Message Becomes the Manipulation Dataset, The Persuasion Engine Gets a Memory, and AI Persuasion.

Return to Blog