Structured EVTL pipeline for reliable extraction and transformation of data from HTML web pages.
-
Updated
Apr 1, 2026 - Python
Structured EVTL pipeline for reliable extraction and transformation of data from HTML web pages.
Structured EVTL pipeline for reliable ingestion and transformation of JSON data from web APIs.
This project implements a modular EVTL pipeline in Python to retrieve JSON data from web APIs and process it into structured formats for analysis.
This project builds a web scraping pipeline that extracts, cleans, and structures HTML data using Python (requests, BeautifulSoup). It follows an EVTL workflow to prepare text for NLP analysis.
Web Mining and Applied NLP
This project processes JSON data using EVTL to clean, validate, and transform structured web data.
Add a description, image, and links to the evtl topic page so that developers can more easily learn about it.
To associate your repository with the evtl topic, visit your repo's landing page and select "manage topics."