feedparser but faster and worse
Project description
Speedparser is a black-box reimplementation of the Universal Feed Parser. It uses lxml for feed parsing and for optional HTML cleaning. Its compatibility with feedparser is very good for a strict subset of fields, but poor for fields outside that subset. See tests/speedparsertests.py for more information on which fields are more or less compatible and which are not.
On an Intel(R) Core(TM) i5 750, running only on one core, feedparser managed 2.5 feeds/sec on the test feed set (roughly 4200 “feeds” in tests/feeds.tar.bz2), while speedparser manages around 65 feeds/sec with HTML cleaning on and 200 feeds/sec with cleaning off.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file speedparser-0.1.tar.gz
.
File metadata
- Download URL: speedparser-0.1.tar.gz
- Upload date:
- Size: 14.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8fdb3c239a8088ac8a03ae50f34e7865097eb76c0fcf437eb4df3d44042040c0 |
|
MD5 | 6aed805fa387a304b8b39ace45e07d04 |
|
BLAKE2b-256 | abd51b66e8d78258bbd1428aff149588bdb12561cc02d7ea5749bc14f370cf37 |