HTML cleaner from lxml project
Project description
lxml_html_clean
Motivation
This project was initially a part of lxml. Because HTML cleaner is designed as blocklist-based, many reports about possible security vulnerabilities were filed for lxml and that make the project problematic for security-sensitive environments. Therefore we decided to extract the problematic part to a separate project.
Installation
You can install this project directly via pip install lxml_html_clean
or soon as an extra of lxml
via pip install lxml[html_clean]
. Both ways installs this project together with lxml itself.
Documentation
https://lxml-html-clean.readthedocs.io/
License
BSD-3-Clause
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file lxml_html_clean-0.1.0.tar.gz
.
File metadata
- Download URL: lxml_html_clean-0.1.0.tar.gz
- Upload date:
- Size: 14.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 003b61437129b2df4f6f6455e20f5f57707828e2067bc5524d0142e240bb4cd1 |
|
MD5 | 574126b88ec3ef18612dba399258ecb0 |
|
BLAKE2b-256 | 60149ae6ef3be5e51dba6e94619f1d7d3ff457c30eab109c34c224d72988302e |
Provenance
File details
Details for the file lxml_html_clean-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: lxml_html_clean-0.1.0-py3-none-any.whl
- Upload date:
- Size: 11.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f21a33d279eb3bddd336ebfa0c73d3d5b359dbfa8113014f7d1f2d8738fdc305 |
|
MD5 | be40b85b2920c3302161090ec9d1a26c |
|
BLAKE2b-256 | f93612f319a5cb41b0d1ced556d175a3ae878193fd1c769038dfb66fae6d2e89 |