HTML parser based on the WHAT-WG Web Applications 1.0("HTML5") specifcation
Project description
HTML parser designed to follow the WHATWG HTML5 specification. The parser is designed to handle all flavours of HTML and parses invalid documents using well-defined error handling rules compatible with the behaviour of major desktop web browsers.
Output is to a tree structure; the current release supports output to DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a simple custom format
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
html5lib-0.11.1.tar.gz
(156.1 kB
view details)
File details
Details for the file html5lib-0.11.1.tar.gz
.
File metadata
- Download URL: html5lib-0.11.1.tar.gz
- Upload date:
- Size: 156.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d2f6af2211cbd643ea7b7337badc001816c75a49d97207b9700da3a7226c1cd6 |
|
MD5 | eb8db849473ac024110136f947dae5e4 |
|
BLAKE2b-256 | 061c8281e3ba695db6d01b5deda2c232fff897094f9d5bdef9eb5b8972d6dc1c |