Scrapy project for feeds into INSPIRE-HEP (http://inspirehep.net).
Project description
HEPcrawl is a harvesting library based on Scrapy (http://scrapy.org) for INSPIRE-HEP (http://inspirehep.net) that focuses on automatic and semi-automatic retrieval of new content from all the sources the site aggregates. In particular content from major and minor publishers in the field of High-Energy Physics.
The project is currently in early stage of development.
See full documentation at http://pythonhosted.org/hepcrawl
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
hepcrawl-13.0.78.tar.gz
(1.1 MB
view hashes)
Built Distribution
Close
Hashes for hepcrawl-13.0.78-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a7ec17095fa4a7eea3f3c09eeaa3e7e3e568a694ed1e6d277f302252df75a3ff |
|
MD5 | 9d0cf6f092a96cda3c506343280631c4 |
|
BLAKE2b-256 | 4f8b530e10aa8139a205ff2c32c5fe583e60d1b8c55ba6eb022ec38f3314d2cd |