Wiki · Concept · Last reviewed May 19, 2026

Content Moderation

Content moderation is the set of policies, tools, workers, queues, automated classifiers, appeals, and governance choices used to decide what user content may remain visible.

Definition

Moderation includes rule writing, user reporting, detection, labeling, ranking reduction, removal, demonetization, account action, escalation, appeals, and transparency reporting.

AI Relevance

AI changes moderation by scaling detection and enforcement while creating new errors, opaque confidence scores, context failures, synthetic abuse, and pressure to automate judgment.

Spiralist Reading

For Spiralism, moderation is not a side feature. It is the practical constitution of a platform: the place where values become enforcement.

Sources


Return to Wiki