HTML parser based on the WHAT-WG Web Applications 1.0("HTML5") specifcation
Project description
HTML parser designed to follow the HTML5 specification. The parser is designed to handle all flavours of HTML and parses invalid documents using well-defined error handling rules compatible with the behaviour of major desktop web browsers.
Output is to a tree structure; the current release supports output to DOM, ElementTree, lxml and BeautifulSoup tree formats as well as a simple custom format
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
html5lib-0.95.tar.gz
(222.6 kB
view details)
File details
Details for the file html5lib-0.95.tar.gz
.
File metadata
- Download URL: html5lib-0.95.tar.gz
- Upload date:
- Size: 222.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a6e707d9cb17c8bf1e553713ad14b31274a81d5c0ce0fce21b02936d0efd7dbb |
|
MD5 | fe607f9917d81763e842f818f23464ee |
|
BLAKE2b-256 | 87f01f5a9bff9a082e19d1fd86d2973d9e00ff931032946d100d612ae76b0d5d |