Scrapy project for feeds into INSPIRE-HEP (http://inspirehep.net).
Project description
HEPcrawl is a harvesting library based on Scrapy (http://scrapy.org) for INSPIRE-HEP (http://inspirehep.net) that focuses on automatic and semi-automatic retrieval of new content from all the sources the site aggregates. In particular content from major and minor publishers in the field of High-Energy Physics.
The project is currently in early stage of development.
See full documentation at http://pythonhosted.org/hepcrawl
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
hepcrawl-13.0.40.tar.gz
(965.6 kB
view hashes)
Built Distribution
Close
Hashes for hepcrawl-13.0.40-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 035c52eb6eefdf2019542f1b341d588f16ad5f12b3d645fb961a8445f8c40aab |
|
MD5 | e64c2a84d9b6be6bbdadd6c8a71d5a2a |
|
BLAKE2b-256 | 2fde872e8e627833847d93f513eb323f0bee7e344b337779eda2402c7ea18b9e |