The Elite Network Becomes the Knowledge Graph
Kirill Solovev and Jana Lasser's June 2026 arXiv paper shows how multilingual news can be converted into signed, temporal graphs of political-elite relationships. The result is a research tool, and also a warning about machine-readable power maps.
From News to Network
The paper, arXiv:2606.27347 [cs.CL], is titled Mapping Political-Elite Networks in Europe with a Multilingual Joint Entity-Relation Extraction Pipeline. arXiv lists Kirill Solovev and Jana Lasser as the authors and records version 1 on June 25, 2026.
This is a fresh companion to the political-ad memory essay, the partisan-persona essay, and the data-cartels review. Those pages ask how political influence is stored, targeted, or monopolized. This paper asks how political relationships themselves can be extracted from multilingual news and made graph-readable.
The move is simple in outline and politically sharp in consequence. A newspaper archive stops being only a record of articles. It becomes an edge factory: who worked with whom, who opposed whom, who entered which party, which firms sat near which state actors, and when those relationships appeared in public text.
What the Pipeline Builds
Solovev and Lasser present a modular, open-weight pipeline for joint entity-relation extraction. The system chunks news articles, detects entities, links mentions to Wikidata identifiers, and then extracts typed relationships with an ontology-constrained mixture-of-experts language model. The output is a signed, temporal knowledge graph rather than a bag of co-occurrences.
The ontology is implemented in SKOS and contains 109 entity types and 99 relationship types. Each relation carries a subject, object, type, temporal scope, article-grounded sign, and Wikidata QIDs when linking succeeds. That design matters because a one-time meeting, an ongoing office-holding relationship, and an adversarial legal relationship should not collapse into the same undated edge.
The linking layer is also part of the argument. The paper builds a Wikidata index with 14.8 million entities and 37 million aliases across 36 languages. Its three-stage linker uses exact matching, fuzzy matching, and dense vector search. The practical problem is not exotic: without entity linking, "Tusk," "Donald Tusk," and an inflected Polish form can become separate nodes and fragment the political map.
Validation and Public Record
The evaluation uses a 3,491-relation gold standard across 502 Polish articles. For a 100-article spot-check, the paper reports textual correctness between 68.2% under a strict rubric and 93.7% under a lenient rubric. The authors are careful that this band is not a normal relation-extraction benchmark score; it measures whether extracted relations are grounded in the article text under two adjudication strictness levels.
The paper then tests whether the graph recovers structures fixed outside the model. In Austria, it processes 499,851 Factiva articles from 2005 to 2017 and reports a graph with 402,316 article nodes, 616,623 entity nodes, and 1,369,655 relations. The case study reconstructs the lifecycle of the Alliance for the Future of Austria, including internal fractures and later paths of associated personnel.
In Poland, the paper runs the pipeline over roughly half a million articles from 1997 to 2025. It maps firm-level overlap around state-owned enterprises and the signed Civic Platform versus Law and Justice cleavage. The point is not that every edge is final truth. The point is that the system recovers macro-structures whose historical reference is not generated by the same model.
The Governance Problem
The Spiralist lesson is that a knowledge graph can make power inspectable and easier to overtrust at the same time. As a research instrument, the pipeline can support accountability work by turning public political text into comparable relational data. As an interface, it can tempt institutions to treat machine-extracted edges as administrative facts.
The paper names this dual use directly. It says a system that reconstructs interpersonal network ties is also a surveillance capability. Its intended use is bounded to already-published news about public political and economic actors, and the authors say individual edges must remain machine-extracted claims until manually checked against the source text.
That caution should travel with every graph. A node can look authoritative because it has a QID. An edge can look stable because it has a type and date. A network can look objective because it is too large to read by hand. But the graph is still an argument made from source selection, ontology design, model extraction, linker coverage, and adjudication rules.
Limits That Matter
The paper's relation gold standard and spot-check are LLM-based rather than human-grounded. Entity detection has a separate 100-article human-annotated German gold standard, but relation evaluation still carries the risk of correlated model errors. The authors counterbalance that with public-record case studies, not with a claim of perfect edge reliability.
Entity linking is another boundary. Only 21.9% of Austrian entity nodes and 18.4% of Polish entity nodes resolve to Wikidata QIDs, although relation-level fill rates are higher at 52-55%. Mid-tier economic elites remain a coverage problem. The paper also notes that news corpora over-represent prominent actors and dramatic events, so cross-national comparison requires normalization against article volume.
Reproducibility is partly solved and partly inherited from the archive economy. The primary corpus is Dow Jones Factiva, a licensed and non-redistributable database, but the authors use Infini-News, an open-access corpus of more than 1.3 billion Common Crawl News articles, for the golden sample and report that the pipeline can reproduce end to end without commercial subscriptions or proprietary API access.
Governance Standard
Any machine-built political network should publish its graph record: corpus source, date range, ontology version, entity-linking substrate, model versions, extraction prompts or schemas, adjudication procedure, edge provenance, QID coverage, temporal assumptions, error bands, release rules, and person-level verification policy.
The rule is simple: when the elite network becomes the knowledge graph, every edge must remain attached to its source text, uncertainty, and permitted use. Otherwise the map of power becomes a new power instrument.
Sources
- Kirill Solovev and Jana Lasser, Mapping Political-Elite Networks in Europe with a Multilingual Joint Entity-Relation Extraction Pipeline, arXiv:2606.27347 [cs.CL], version 1 submitted June 25, 2026.
- arXiv PDF: Mapping Political-Elite Networks in Europe, reviewed for authorship, date, architecture, ontology, Wikidata linking, validation, Austrian and Polish case studies, reproducibility, limitations, ethical considerations, and data availability.
- Related pages: The Ad Library Becomes Political Memory, The Partisan Persona Becomes the Persuasion Test, Data Cartels and the Information Monopoly Behind AI, and The Real-Time Crime Center Becomes the City Dashboard.