HTML cleaner from lxml project
Project description
lxml_html_clean
Motivation
This project was initially a part of lxml. Because HTML cleaner is designed as blocklist-based, many reports about possible security vulnerabilities were filed for lxml and that make the project problematic for security-sensitive environments. Therefore we decided to extract the problematic part to a separate project.
Installation
You can install this project directly via pip install lxml_html_clean
or as an extra of lxml
via pip install lxml[html_clean]
. Both ways install this project together with lxml itself.
Security
For discussions regarding security-related issues or any sensitive reports, please contact us privately. You can reach out to lbalhar(at)redhat.com or frenzy.madness(at)gmail.com to ensure your concerns are addressed confidentially and securely.
Documentation
https://lxml-html-clean.readthedocs.io/
License
BSD-3-Clause
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for lxml_html_clean-0.2.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 177ebe822b39d1b68df7c0c34ba005cb087b23d3791dae87efb3a2bb162ef398 |
|
MD5 | 610350871bec626aa7795549b9945da7 |
|
BLAKE2b-256 | 0f203581a4c3bac717ed6d7e832cc40623b18c9d4ba53fc1e130d7d1a083c9e2 |