.. _extract-manual: Resiliparse Extraction Utilities ================================ The Extraction Utilities module is a heavily performance-optimized library for extraction of structural or semantic information from noisy raw web data for further processing, such as (main) content extraction / boilerplate removal, schema extraction, general web data cleansing, and more. .. toctree:: :maxdepth: 3 :caption: Extraction Utilities extract/html2text