This blueprint extracts out title, description and body from html either via xpath or by automatic cluster analysis
Project description
Introduction
- transmogrify.htmlcontentextractor
This blueprint extracts out title, description and body from html either via xpath or by automatic cluster analysis
Changelog
1.0b2 (2010-11-09)
Put condition on autofinder so can be turned off
1.0b1 (2010-11-03)
ignore already found items. better debug [“Dylan Jay”]
skip templates if item already parsed [“Dylan Jay”]
print automaticly found XPaths [“Dylan Jay”]
make text fields strip tail text [“Vitaliy Podoba”]
1.0dev (2010-03-22)
split the auto templatefinder out to it’s own blueprint [“Dylan Jay”]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Close
Hashes for transmogrify.htmlcontentextractor-1.0b2.zip
Algorithm | Hash digest | |
---|---|---|
SHA256 | 86d54156d3288be7f4d3ce89afc23268303c1413c839bfc10c09bc18dbe389c0 |
|
MD5 | e25106a641142bd4fd819e08ee7ea063 |
|
BLAKE2b-256 | 0c8f9ee2aa426bb904ca3d41771e6be823a9d17b429c840548aba9b44994c008 |