The Synthetic Voice Enters the Ballot
AI voice cloning turns political trust into an infrastructure problem. The question is no longer only whether a message is true. It is whether the voice, number, sponsor, consent trail, and channel can be trusted before the message changes behavior.
The Voice That Was Not There
Two days before New Hampshire's January 2024 presidential primary, voters received robocalls that sounded like President Joe Biden. The message told them not to vote in the primary and to save their votes for November. The voice was synthetic. The caller ID was spoofed. The number appeared to belong to a local political figure who was not involved.
The case became one of the clearest early public examples of AI voice cloning used as election interference in the United States. It was also more specific than the usual phrase "deepfake election threat" suggests. The harm did not come from a viral video that fooled the whole country. It came from a targeted communication channel, a familiar voice, a misleading caller ID signal, and timing close enough to an election that correction would be difficult.
The Federal Communications Commission later imposed a $6 million forfeiture on political consultant Steve Kramer for an illegal robocall campaign that used misleading caller ID information with intent to defraud and cause harm. The FCC's order said the campaign sent thousands of calls, used a deepfake AI voice imitating Biden, and encouraged potential voters not to vote in the upcoming primary. A federal court order in the related League of Women Voters case described approximately 10,000, and perhaps significantly more, robocalls directed to New Hampshire residents believed likely to be Democratic voters.
The lesson is not that one synthetic voice nearly broke an election. The lesson is that a small, cheap act can force election systems, telecom networks, campaigns, journalists, courts, platforms, and voters to spend scarce time determining whether the voice of political authority is real.
Current Context
As of June 16, 2026, synthetic voice has moved from a hypothetical election-risk category into enforcement, election-administration guidance, telecom compliance, and state legislation. The FCC's February 2024 declaratory ruling confirmed that the Telephone Consumer Protection Act's restrictions on artificial or prerecorded voices encompass current AI technologies that generate human voices. That does not ban every generated political voice; it means covered calls using such voices fall under TCPA consent, identification, emergency-purpose, and exemption rules. The FCC's later 2024 NPRM proposed additional AI-generated-call definitions and disclosure duties, but that proposal should be treated as proposed rulemaking unless and until a final rule is adopted.
Election administrators now treat AI as an accelerator of familiar risks. The U.S. Election Assistance Commission warns that AI-generated text, images, video, and audio can imitate official sources and that plausible but inaccurate AI-generated voting information can be especially harmful when it concerns dates, hours, and locations. The National Association of Secretaries of State's #TrustedInfo2026 campaign directs voters back to state and local election officials as trusted sources for registration, voting methods, security, and post-election procedures.
The campaign-law posture remains uneven. The Federal Election Commission declined in September 2024 to open a dedicated AI campaign-ad rulemaking, saying the existing fraudulent-misrepresentation statute is technology-neutral and can apply to AI-assisted media case by case. States have moved faster: the National Conference of State Legislatures' June 2026 tracker says 30 states have enacted laws regulating deepfakes in political messaging, mostly through disclosure rules, with a smaller number using prohibitions. That patchwork helps experimentation, but it gives voters, campaigns, robocall vendors, and platforms different rules by jurisdiction. It also runs into constitutional boundaries: NCSL notes that California's AB 2839 was struck down on First Amendment grounds in August 2025, and the federal district court permanently enjoined enforcement against the named plaintiffs.
The practical definition should be narrow enough to govern. A synthetic-voice election incident is not every generated narration in a campaign video. It is a voice generated or materially altered to sound like a real candidate, election official, campaign worker, local authority, journalist, celebrity, family member, or community leader, used in a political or voting context where the apparent speaker, channel, timing, or message could mislead people about authority, endorsement, voting procedure, or civic participation.
Why Voice Is Different
Voice is not just content. It is a social credential.
A written message says something. A familiar voice says something and also says: this body is present, this person stands behind the words, this relationship can be trusted. The voice carries age, region, emotion, hesitation, confidence, authority, and intimacy. It is why family scams work, why hostage hoaxes frighten parents, why celebrity endorsements matter, and why political robocalls use the candidate's own voice when they can.
AI voice cloning attacks that credential. It separates the acoustic sign of personhood from the person. Once the voice can be generated, the old test - "I heard them say it" - loses force. The user is moved into a stranger evidentiary world: did the person speak, did someone imitate them, did a campaign authorize it, did a model generate it, did the channel authenticate it, and who has the logs?
This is why synthetic voice belongs with broader concerns about synthetic media and election integrity. The model does not merely produce information about politics. It produces an apparent speaker. It turns identity into output. A voter does not encounter a claim with a footnote. They encounter a voice acting inside a channel designed for immediate response.
The Channel Is Part of the Lie
The New Hampshire calls were not only a voice clone. They were also a telecommunications event.
The FCC found that caller ID mattered because people use it to decide whether to answer and whether to trust the party on the other end of the call. In the New Hampshire case, the spoofed number was associated with a local political operative whose work supported a write-in effort for Biden. That made the deception layered: a national political voice, a local trusted signal, and a message counter to the actual effort behind the spoofed number.
That is the high-control interface problem. A voter does not inspect a media artifact in a neutral lab. They receive a call at a particular time, from an apparent number, in a private channel, with no immediate public fact-check alongside it. The interface narrows the situation. Answer, listen, believe, ignore, call back, tell a friend, stay home. The synthetic voice works because the channel makes verification awkward.
The FCC also proposed a $2 million fine against Lingo Telecom for caller ID authentication failures related to the same campaign. The agency said Lingo transmitted the calls with the highest level of caller ID attestation without making an effort to verify accuracy. The later FCC announcement about Kramer noted that Lingo agreed to pay a $1 million civil penalty and implement a compliance plan with STIR/SHAKEN caller ID authentication, Know Your Customer, and Know Your Upstream Provider requirements.
That makes the delivery chain part of the speech act. A voter hears a voice, but the institution has to audit a route: sponsor, vendor, voice generator, robocall platform, originating provider, upstream provider, caller ID attestation, traceback, complaints, corrections, and enforcement record. Synthetic audio governance fails if it treats the file as the whole incident.
That enforcement path is important. It treats the problem as infrastructure, not just speech. A synthetic voice becomes politically powerful when a network lets it arrive with false identity signals.
Law Finds the Old Handle
The fastest federal response did not require a new AI statute.
On February 8, 2024, the FCC released a declaratory ruling recognizing AI-generated voices as "artificial" under the Telephone Consumer Protection Act. The practical move was direct: the law already restricted artificial or prerecorded voice calls in many robocall contexts, so the FCC clarified that current AI voice technologies fit the existing category.
This is how institutions often govern new media first. They reach for the nearest old handle. The TCPA was not written for generative models, but it was written for unwanted automated calls. The Truth in Caller ID Act was not written for election deepfakes, but it was written for spoofed caller identification used to defraud, cause harm, or wrongfully obtain value.
That matters because the first legal question is rarely "what is AI?" It is usually "which institutional surface did AI enter?" If a generated voice enters a phone network, telecom law matters. If it enters a campaign ad, election law matters. If it enters a scam, fraud law matters. If it enters a likeness market, publicity rights matter. If it enters a platform, content policy and consumer protection matter.
AI governance becomes fragmented because AI is not one place. It is a way of manufacturing speech, identity, image, advice, evidence, and action across many existing institutions.
The Federal Gap
Federal election law has been slower.
The Congressional Research Service noted in 2023 that federal campaign finance law does not specifically regulate the use of AI in political campaign advertising. Existing provisions may matter, including the prohibition on fraudulent misrepresentation of campaign authority and disclaimer requirements for certain campaign communications, but federal campaign-finance law does not generally require a disclosure merely because campaign media used AI.
Public Citizen asked the Federal Election Commission to clarify that the law against fraudulent misrepresentation applies to deliberately deceptive AI-produced campaign communications. In September 2024, the FEC declined to initiate a dedicated rulemaking. Its notice said the statute is technology-neutral and applies on its face to all means of accomplishing the specified fraud, including AI-assisted media, so the Commission would proceed case by case rather than write a specific AI rule at that time.
There is a logic to that. Technology-specific rules can age badly. Overbroad rules can collide with satire, parody, political speech, documentary editing, accessibility tools, translation, and ordinary campaign production. Court challenges to state political-deepfake laws show the same boundary from the other side: rules aimed at deceptive synthetic media still have to survive narrow-tailoring, vagueness, compelled-disclosure, and standing problems. But a case-by-case posture also has a timing problem. Election deception works at campaign speed. Enforcement often works at institutional speed.
A synthetic robocall can matter on Sunday night. A docket can resolve months later.
State Patchwork
States moved faster than Congress.
The National Conference of State Legislatures tracks a growing list of state AI election and campaign laws, including disclosure rules and restrictions for materially deceptive media. Its June 2026 summary says most state laws use disclosure, while Minnesota and Texas prohibit publication of political deepfakes during defined pre-election windows and Maryland prohibits deceptive election deepfakes without a time limit. The pattern is visible in enacted laws as well as failed bills: states are testing disclaimers, metadata requirements, civil penalties, private actions, candidate-deepfake restrictions, criminal penalties, election-code remedies, and official correction duties.
This patchwork is useful and unstable. It lets states respond to local election administration realities. It also creates uneven rules for campaigns, platforms, robocall vendors, consultants, and voters. A synthetic audio ad may be legal in one place, require disclosure in another, and be prohibited near an election in a third. Meanwhile national campaigns, influencers, political action committees, and foreign operators do not respect state borders in the way election codes do.
The deeper problem is not only whether a state requires a label. Labels help when the audience sees them, understands them, trusts them, and receives them before the deception does its work. A disclosure at the end of an ad is different from a verified origin signal at the start of a call. A label on a public platform is different from a synthetic voice in a private message. A visible watermark is different from a voice clip passed through a group chat.
Election deepfake governance therefore cannot be only an ad-label regime. It has to include channel authentication, consent records, rapid correction pathways, voter-protection law, campaign accountability, platform response, telecom tracebacks, public ad libraries, incident records, and public education that does not teach people to distrust everything. The adjacent memory problem is developed in The Ad Library Becomes Political Memory: labels matter, but the public also needs a record of who paid, who saw, when it ran, and how it was corrected.
The Liar's Dividend
Synthetic media creates a second danger: real evidence becomes easier to deny.
Robert Chesney and Danielle Citron called this the liar's dividend. As people learn that deepfakes can be realistic, a public figure has more room to claim that genuine audio or video is fake. The epistemic harm therefore runs in both directions. False media can be believed. True media can be dismissed.
Voice cloning sharpens this. Voice often functions as proof of presence. When it becomes suspect, institutions need better ways to establish provenance without building a universal surveillance regime. A democracy cannot survive by making every citizen verify every clip with forensic tools. It also cannot survive if campaigns can simply say "AI" whenever inconvenient evidence appears.
The question becomes institutional: who can authenticate quickly, who can correct widely, who can preserve the record, who can punish impersonation, and who can do all that without giving the state or platforms unchecked power over political speech?
A Governance Standard
A serious synthetic-voice election regime should do more than say "deepfakes are bad."
First, require consent for candidate and official voice imitation in election communications. A campaign should not be able to use a recognizable synthetic voice of a candidate, election official, or local political actor without express authorization and a retained consent record.
Second, authenticate channels, not only files. Caller ID, sender identity, campaign accounts, ad libraries, and official election-alert channels need verification designed for rapid public use. Provenance should travel with the delivery system, not only with media metadata that can disappear. That includes the limits described in Content Provenance and Watermarking: provenance can help establish origin and edit history, but it does not prove truth.
Third, treat voter suppression as the core harm. The highest-risk content is not every AI-edited campaign message. It is communication that lies about when, where, how, or whether to vote; impersonates officials or candidates; fabricates emergencies; or targets groups with false voting instructions.
Fourth, preserve lawful political expression. Satire, parody, criticism, translation, accessibility, documentary editing, and clearly labeled creative work are not the same as fraudulent impersonation. Rules should focus on material deception, authorization, timing, targeting, and voter harm.
Fifth, build rapid correction channels. Election officials, telecom providers, platforms, campaigns, and newsrooms need escalation paths for false messages close to voting deadlines. A correction that arrives after polls close is historical documentation, not protection.
Sixth, hold intermediaries responsible for obvious failures. Voice vendors, robocall firms, telecom providers, political consultants, and ad platforms should not be able to profit from synthetic impersonation while treating identity checks as someone else's problem.
Seventh, distinguish proposed rules from live duties. Public guidance should say whether a requirement comes from statute, an FCC order, a consent decree, a state election code, platform policy, campaign self-regulation, or a pending proposal. Confusing a proposal with a live rule gives both voters and vendors a false map.
Eighth, preserve incident evidence. Election officials, campaigns, telecom carriers, robocall vendors, platforms, and journalists should retain the audio, call records, caller ID data, STIR/SHAKEN attestation, targeting assumptions, source accounts, complaints, corrections, and enforcement status needed to reconstruct what happened. This belongs with broader AI incident reporting, not only campaign comms.
Ninth, avoid totalizing identity infrastructure. The answer to synthetic voice cannot be a society where every political interaction requires centralized identity proof. The goal is authenticated authority where authority is claimed, not a checkpoint on ordinary political speech. The same boundary appears in The Voiceprint Becomes the Password: a voice can be meaningful without being allowed to function as a master credential.
Tenth, make AI contact disclosure operational. A disclosure that a message used synthetic audio should be audible before the persuasive payload, preserved in the archive, visible in call or ad records, and connected to a sponsor that can be reached. The useful standard is closer to AI Contact and Bot Disclosure than to a tiny end-card label.
What This Changes
The synthetic voice is a small machine that attacks a large habit: trusting that a person is present when their voice is present.
That habit is older than mass media. It belongs to family, politics, ritual, testimony, command, confession, and care. To hear a voice is to feel a human source behind language. Voice cloning breaks that association without making people immune to it. The nervous system still reacts before the verification system wakes up.
This is recursive reality in election form. A model generates a voice. The voice enters a trusted channel. The channel produces behavior or doubt. The behavior or doubt becomes news. The news trains future suspicion. The next real recording is heard inside the memory of possible fakery.
The answer is not panic about every generated sound. Synthetic voice has legitimate uses: accessibility, translation, assistive communication, creative production, and authorized campaign outreach. The danger begins when the model speaks as someone who did not speak, through a channel that falsely certifies the speaker, at a moment when listeners have little time to check.
The ballot is a trust machine. It depends on citizens receiving reliable instructions, recognizing lawful authority, and believing that participation still connects them to a shared public world. Synthetic voice attacks that machine by making authority audible without being present.
The governance task is concrete: prove authorization where voice claims authority, authenticate channels that carry urgent civic instructions, punish impersonation that suppresses participation, and preserve enough public memory that real speech can still be defended against fake denial.
A democracy does not need every voice to be natural. It needs to know when a voice is authorized to speak.
Source Discipline
The evidence for this essay should be separated by institution. FCC orders establish TCPA and Truth in Caller ID enforcement; the FCC's 2024 AI-generated-call NPRM establishes proposals and questions, not final duties. FEC sources establish the Commission's current federal campaign-law posture on fraudulent misrepresentation, not a general federal AI-labeling rule. EAC and NASS pages are election-administration guidance and public-information strategy; they are not incident findings. NCSL is a legislative tracker and summary, not a court, regulator, or election official.
For any reported synthetic-voice election incident, the record should identify the artifact, transcript, claimed speaker, actual authorizer, sponsor, vendor, phone number, caller ID, attestation path, target list, jurisdiction, timing, correction, complaint route, and legal status. A viral recording, a campaign denial, an FCC order, a court filing, a state election-office warning, and a platform label are different evidence types. Treating them as one generic "deepfake source" is how public memory gets muddied.
Provenance tools should be cited with their limits. NIST describes content transparency methods across authentication, provenance, watermarking, detection, testing, and auditing. C2PA 2.4 added an AI Disclosure Assertion and broader embedding support, but a credential can document origin or edit history without proving truth, consent, legality, or electoral impact. The same source discipline appears in Claim Hygiene Protocol, Research and Editorial Integrity, and Provenance and Content Credentials.
Sources
- Federal Communications Commission, Declaratory Ruling, Implications of Artificial Intelligence Technologies on Protecting Consumers from Unwanted Robocalls and Robotexts, FCC 24-17, adopted February 2, 2024 and released February 8, 2024.
- Federal Communications Commission, Forfeiture Order, In the Matter of Steve Kramer, FCC 24-104, adopted September 26, 2024 and released September 30, 2024.
- Federal Communications Commission, FCC Proposes $2 Million Fine Against Carrier for Caller ID Authentication Failures, May 23, 2024.
- Federal Communications Commission, FCC Fines Man Behind Election Interference Scheme $6 Million, September 26, 2024.
- Federal Communications Commission, FCC Settles Case Against Provider That Transmitted Spoofed AI-Generated Robocalls for Election Interference in New Hampshire, August 21, 2024.
- Federal Communications Commission, Implications of Artificial Intelligence Technologies on Protecting Consumers from Unwanted Robocalls and Robotexts, Notice of Proposed Rulemaking and Notice of Inquiry, FCC 24-84, adopted August 7, 2024 and released August 8, 2024.
- U.S. District Court for the District of New Hampshire, League of Women Voters of New Hampshire et al. v. Steve Kramer et al., Order, Opinion No. 2024 DNH 043.
- U.S. Election Assistance Commission, Artificial Intelligence (AI) and Election Administration, reviewed June 16, 2026.
- National Association of Secretaries of State, #TrustedInfo2026, reviewed June 16, 2026.
- Congressional Research Service, Artificial Intelligence (AI) in Federal Election Campaigns: Legal Background and Constitutional Considerations for Legislation, updated August 17, 2023.
- Federal Election Commission, Commission approves Notification of Disposition, Interpretive Rule on artificial intelligence in campaign ads, September 27, 2024.
- Federal Register, Artificial Intelligence in Campaign Ads, Notice 2024-23, September 26, 2024.
- Robert Chesney and Danielle K. Citron, Deep Fakes: A Looming Challenge for Privacy, Democracy, and National Security, California Law Review, 2019.
- National Conference of State Legislatures, Artificial Intelligence (AI) in Elections and Campaigns, updated June 5, 2026.
- U.S. District Court for the Eastern District of California, Kohls v. Bonta, Order Granting Plaintiffs' Motion for Summary Judgment (AB 2839), August 29, 2025.
- NIST, Reducing Risks Posed by Synthetic Content: An Overview of Technical Approaches to Digital Content Transparency, NIST AI 100-4, November 20, 2024, updated April 8, 2026.
- C2PA, Content Credentials: C2PA Technical Specification 2.4, April 2026.
- Related references: Synthetic Media and Deepfakes, Election Integrity and AI, Content Provenance and Watermarking, Digital Identity, The Voiceprint Becomes the Password, The Ad Library Becomes Political Memory, The Takedown Button Becomes Synthetic Media Governance, AI Contact and Bot Disclosure, AI Incident Reporting, and Political Impact.