Resiliparse Extraction Utilities

The Extraction Utilities module is a heavily performance-optimized library for extraction of structural or semantic information from noisy raw web data for further processing, such as (main) content extraction / boilerplate removal, schema extraction, general web data cleansing, and more.