Scrapy project for feeds into INSPIRE-HEP (http://inspirehep.net).
Project description
HEPcrawl is a harvesting library based on Scrapy (http://scrapy.org) for INSPIRE-HEP (http://inspirehep.net) that focuses on automatic and semi-automatic retrieval of new content from all the sources the site aggregates. In particular content from major and minor publishers in the field of High-Energy Physics.
The project is currently in early stage of development.
See full documentation at http://pythonhosted.org/hepcrawl
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
hepcrawl-13.0.70.tar.gz
(1.1 MB
view hashes)
Built Distribution
Close
Hashes for hepcrawl-13.0.70-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9a02a0a428c6d549cecb14f6bee5be6aab4603f4709c92fd37b9a8e1d8a30cd4 |
|
MD5 | 0418df031e3c8bc857782b860ae9833f |
|
BLAKE2b-256 | 837971eb30e71a190d77bba8c2266e88ab7edf92fc562dbfc324b5482203929d |