verbaendeliste-bundestag · PyPI

Some features may not work without JavaScript. Please try enabling it if you encounter problems.

Parse PDF-to-XML converted lobby list of German Bundestag

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Environment
- Console
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language
- Python :: 2.7

Project description

Use pdftohtml to get an XML file from the pdf.

pdftohtml -xml input.pdf output.xml

Then use the extractor with first and last relevant page number to convert to parsed JSON:

python extract_lobby.py 4 690 < lobbylist.xml > lobbylist.json

Here is [extracted JSON (15th of June 2012)](http://stefanwehrmeyer.com/projects/verbaendeliste/20120615.json).

License: MIT-License

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Environment
- Console
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language
- Python :: 2.7

Release history Release notifications | RSS feed

This version

0.1.0

Feb 21, 2014

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

verbaendeliste-bundestag-0.1.0.tar.gz (5.6 kB view details)

Uploaded Feb 21, 2014 Source

Built Distribution

verbaendeliste_bundestag-0.1.0-py2.py3-none-any.whl (6.3 kB view details)

Uploaded Feb 21, 2014 Python 2 Python 3

File details

Details for the file verbaendeliste-bundestag-0.1.0.tar.gz.

File metadata

Download URL: verbaendeliste-bundestag-0.1.0.tar.gz
Upload date: Feb 21, 2014
Size: 5.6 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for verbaendeliste-bundestag-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`8be6cd3c9afe8a097e2ec9f42b45163b257cdb01a129fe84efbe7d5642989f3d`
MD5	`c5a8f33a6de4b4718094a42cea20c97e`
BLAKE2b-256	`933b30c36f7c73c7e8e7a6ae9c36fbe771b0f7192173de1323f2faa1fd1a86da`

See more details on using hashes here.

File details

Details for the file verbaendeliste_bundestag-0.1.0-py2.py3-none-any.whl.

File metadata

Download URL: verbaendeliste_bundestag-0.1.0-py2.py3-none-any.whl
Upload date: Feb 21, 2014
Size: 6.3 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No

File hashes

Hashes for verbaendeliste_bundestag-0.1.0-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`7f03ba7d810b2f70759afb85168045a5458814f740b6876bf554af9c4f2570f2`
MD5	`c4cbfbb7be7f58a195a726f64d9e4a50`
BLAKE2b-256	`d06539a2d87598578f6be3c62af800dd0ae97c606ab9a7787632f51f123741d1`

See more details on using hashes here.

Supported by

AWS

AWS Cloud computing and Security Sponsor

Datadog

Datadog Monitoring

Fastly

Google

Google Download Analytics

Microsoft

Microsoft PSF Sponsor

Pingdom

Pingdom Monitoring

Sentry

Sentry Error logging

StatusPage

StatusPage Status page