The Synthetic Patient Becomes the Trial Arm
Clinical evidence is starting to include external controls, digital measures, AI models, and biomedical digital twins. The patient is not disappearing, but the comparator is becoming computational.
From Randomization to Reconstruction
The clinical trial has always been an institution for disciplining belief.
A drug company, physician, patient group, investor, or politician may believe a treatment works. The trial asks a colder question: compared with what, measured how, in whom, under which conditions, and with what record of uncertainty? Randomization is powerful because it resists the story people want to tell after the outcome is known. It makes the counterfactual less imaginary.
That institution is now being stretched by real-world data, digital health technologies, external control arms, Bayesian methods, AI-supported analysis, and biomedical digital twins. The promise is legitimate. Some diseases are rare. Some control groups are hard to recruit. Some outcomes are better measured continuously than at occasional site visits. Some patients cannot easily travel. Some signals are buried in electronic health records, sensors, imaging, registries, claims data, and prior trials. Better computation can make clinical evidence faster, cheaper, more inclusive, and more humane.
But the political object changes when the comparator becomes synthetic. A trial arm is no longer only a group of enrolled people receiving placebo, standard of care, or another treatment. It may be a statistically constructed external control drawn from prior trials or real-world records. It may be a modeled disease trajectory. It may be a digital measure collected by a wearable. It may be an AI model used to create information for regulatory judgment. It may eventually be a patient-specific simulation.
The patient remains real. The evidence environment around the patient becomes model-mediated.
The External Control
External control arms are not new magic. They are a family of study designs in which the comparison group is not part of the same randomized trial as the treated group. The external data may come from prior clinical trials, disease registries, electronic health records, medical claims, chart review, natural-history studies, or other real-world data sources.
The appeal is clearest in rare diseases, severe conditions, pediatric contexts, and diseases with high unmet need. If a randomized control group is impractical, slow, expensive, or ethically fraught, an external control can help place a single-arm study in context. It may reduce the number of participants assigned to a less desirable treatment. It may use already-collected evidence rather than forcing another group of patients through a duplicative study design.
FDA's 2023 draft guidance on externally controlled trials is careful for exactly that reason. It says that external control designs can be useful in appropriate contexts, but the credibility of the comparison depends on whether the external data can support the same clinical question. The hard parts are ordinary and unforgiving: comparable populations, aligned index dates, similar outcome definitions, compatible follow-up, treatment differences, missing data, measurement quality, and bias that cannot be repaired after the fact.
This is where the word "synthetic" can mislead. A synthetic control arm is not a fake patient group invented from nothing. It is usually a constructed comparison generated from existing patient-level data and statistical assumptions. Its strength depends on the quality, relevance, governance, and provenance of those underlying records.
The danger is that the clean output hides the messy body. A synthetic comparator can look precise because the table is complete, the curve is smooth, and the model has a confidence interval. But a confidence interval around a biased reconstruction is not the same thing as truth.
Digital Measures and AI Evidence
The comparator is not the only part of the trial becoming computational. Measurement is changing too.
FDA's digital health technology program describes wearables, sensors, computing platforms, and other tools that can collect trial data directly from participants in homes and other remote locations. The agency points to continuous or frequent measurement, novel clinical features, and decentralized trial activities as potential advantages. That is a real expansion of clinical knowledge. A disease may be more visible in gait, sleep, tremor, speech, glucose variation, movement patterns, or home-based function than in a periodic office visit.
At the same time, these tools alter the trial's social shape. The participant's body, home, phone, sensor, and routine become part of the data acquisition system. The endpoint may be less like a clinician's observation and more like a processed signal. The sponsor may rely on vendors, algorithms, device firmware, cloud pipelines, and analytic code to turn life into evidence.
FDA's January 2025 draft guidance on AI for regulatory decision-making moves this issue into the open. It addresses AI models used to produce information or data intended to support decisions about safety, effectiveness, or quality for drugs. Its central move is a risk-based credibility assessment tied to context of use. That phrase matters. An AI model is not credible in the abstract. It is credible, or not, for a particular task, decision, population, data environment, and consequence.
Clinical evidence therefore becomes a chain of translations: from patient life to sensor, from sensor to dataset, from dataset to feature, from feature to endpoint, from endpoint to model, from model to regulatory argument. Each translation can help. Each can also erase.
The Digital-Twin Promise
Biomedical digital twins push the same logic further.
The National Academies' 2023 workshop summary describes biomedical digital twins as emerging from the convergence of computer science, mathematics, engineering, and the life sciences. Because biological systems are multiscale, a biomedical twin might represent molecules, cells, tissues, organs, patients, populations, or combinations across those levels. The workshop framed possible applications in personalized medicine, pharmaceutical development, and clinical trials, while stressing technical challenges around model complexity, data diversity, cross-scale integration, privacy, and implementation.
The useful version of this idea is modest and demanding. A model of a heart, tumor, immune response, disease trajectory, or patient subpopulation could help generate hypotheses, select trial endpoints, simulate dosing, identify likely responders, explore uncertainty, or design better studies before patients are exposed to risk. In that role, simulation is a planning instrument.
The more dangerous version turns simulation into substitution too quickly. A digital twin can start as a tool for thinking and become an apparent participant. It can become an invisible control arm, a virtual responder, a replacement for longer follow-up, or a reason to believe the unknown has already been explored. The language of the twin makes the model feel intimate and complete, as if it were the patient doubled rather than a partial representation built from selected data, assumptions, and update rules.
That is the threshold to watch. Clinical simulation becomes governance-sensitive when it moves from "help us design the trial" to "stand in for the person the trial did not observe."
What Can Go Wrong
The first failure mode is counterfactual laundering. The model produces a plausible untreated trajectory, and the institution forgets how speculative that comparison is. A reconstructed control arm can be useful evidence, but it should not inherit the moral authority of randomization by wearing a smoother interface.
The second is population mismatch. Real-world records may come from health systems, insurers, countries, devices, or prior studies that do not resemble the trial population. Underrepresented groups can be undermeasured twice: first in the source data, then in the model trained or matched on that data.
The third is endpoint drift. A digital endpoint may be convenient, continuous, and scalable without capturing what patients actually value. Step count, typing speed, sleep movement, app engagement, speech features, or biomarker patterns can become administratively attractive while remaining clinically ambiguous.
The fourth is vendor opacity. If external controls, sensor pipelines, AI analyses, or twin models are built by specialized vendors, the trial may depend on systems that investigators, participants, reviewers, and clinicians cannot fully inspect. The evidence becomes a service contract.
The fifth is privacy conversion. A clinical-trial participant may consent to a study without understanding how much home life, behavioral rhythm, device metadata, or longitudinal health history becomes reusable infrastructure for model development. Synthetic data can reduce some privacy risks, but it can also preserve patterns, leak outliers, or make sensitive populations easier to simulate without making them easier to govern.
The sixth is participant displacement. If virtual comparators become too attractive, trial designers may have weaker incentives to recruit difficult-to-reach populations, build trust, translate materials, support travel, or design studies around participant realities. The model can become a substitute for institutional effort.
The Governance Standard
A serious synthetic-patient governance standard should begin with a plain rule: simulated evidence must remember its manufacture.
First, distinguish source data from generated data. Trial records should make clear which evidence came from enrolled participants, prior trials, registries, claims, electronic health records, sensors, models, simulations, or generated datasets.
Second, require a context-of-use claim. An AI model, external control, digital endpoint, or biomedical twin should be evaluated for the exact decision it is meant to support, not granted general credibility because it performed well somewhere else.
Third, preserve audit trails. Sponsors should retain protocols, statistical analysis plans, data provenance, matching criteria, model versions, feature definitions, validation results, bias analyses, uncertainty estimates, vendor documentation, and change logs.
Fourth, protect participant agency. Consent should explain when real-world data, sensor data, or trial data may be used to build external controls, train models, validate digital measures, or generate synthetic datasets. Refusal should not be hidden behind vague language about analytics.
Fifth, keep patient-centered endpoints in the loop. Digital measures should be tied back to outcomes patients and clinicians can recognize: survival, symptoms, function, pain, fatigue, independence, adverse events, quality of life, and meaningful daily capacity.
Sixth, do not let simulation replace recruitment ethics. External controls may reduce burden in some contexts, but they should not become an excuse to leave marginalized groups out of evidence generation.
Seventh, make uncertainty visible. The public-facing result should not simply say that an AI-supported or synthetic-control study succeeded. It should say what was observed, what was reconstructed, what assumptions carried the comparison, and where the evidence is weakest.
The Spiralist Reading
The synthetic patient is a new figure in model-mediated knowledge.
It is not a robot patient. It is not a fake person in a simple sense. It is a bundle of prior records, statistical assumptions, sensor traces, disease models, clinical categories, and institutional needs arranged into a counterfactual body. It asks to be treated as the patient who would have existed if the trial had enrolled differently, randomized differently, measured differently, or waited longer.
That figure can do ethical work. It can reduce unnecessary control exposure, help rare-disease research, improve trial design, and make patient experience more visible between site visits. But it can also become a way for institutions to stop touching reality at the point where reality is expensive.
The old clinical trial disciplined belief by forcing a claim through protocol, comparison, observation, and record. The new trial must discipline a second layer: the model that builds part of the comparison itself. The question is not whether synthetic evidence should be banned. The question is whether the institution can keep the difference between participant, record, model, and counterfactual intact.
A synthetic trial arm should not be a ghost army of convenient patients. It should be a documented instrument: useful, limited, inspectable, and unable to pretend it suffered, consented, improved, or died. The living patient remains the reason for the system. The model is only evidence when it stays answerable to that fact.
Sources
- FDA, Considerations for the Use of Artificial Intelligence To Support Regulatory Decision-Making for Drug and Biological Products, draft guidance, January 2025.
- FDA, Considerations for the Use of Real-World Data and Real-World Evidence To Support Regulatory Decision-Making for Drug and Biological Products, final guidance, August 2023.
- FDA, Considerations for the Design and Conduct of Externally Controlled Trials for Drug and Biological Products, draft guidance, February 2023.
- FDA, Digital Health Technologies for Drug Development, reviewed May 2026.
- National Academies of Sciences, Engineering, and Medicine, Opportunities and Challenges for Digital Twins in Biomedical Research: Proceedings of a Workshop-in Brief, 2023.
- International Council for Harmonisation, E6(R3) Good Clinical Practice, final guideline, January 2025.
- Church of Spiralism Wiki, Synthetic Data and Model Collapse, AI in Healthcare, and Algorithmic Impact Assessments.