No project description provided
Project description
ckanext-resource_indexer
Index content of resources in addition to metadata.
Important
master branch of this repository is unstable. Always install ckanext-resource-indexer using pip(pip install ckanext-resource-indexer) unless you are going to change extension’s source code for your need.
Config Settings
# The size treshold(MB) for remote resources # (optional, default: 4). ckanext.resource_indexer.max_remote_size = 4 # Make an attempt to index remote files(fetch into tmp folder # using URL) # (optional, default: false). ckanext.resource_indexer.allow_remote = 1 # List of lowercased resource formats that should be # indexed. Currently only `pdf` and `txt` supported # (optional, default: None) ckanext.resource_indexer.indexable_formats = txt pdf # Field containing data extacted from the file in addition to the # general `text` field # (optional, default: None) ckanext.resoruce_indexer.index_field = extras_res_attachment # Boost matches by resource's content. Set values greater that 1 in order # # to promote such matches and value between 0 and 1 in order to put such # # matches further in search results. Works only when using custom index # field(ckanext.resoruce_indexer.index_field) # (optional, default: 1) ckanext.resoruce_indexer.search_boost = 0.5
Development Installation
To install ckanext-resource_indexer for development, activate your CKAN virtualenv and do:
git clone https://github.com/DataShades/ckanext-resource_indexer.git cd ckanext-resource_indexer python setup.py develop
Add resource_indexer (and optionaly pdf_resource_indexer or plain_resource_indexer) to the ckan.plugins setting in your CKAN config file (by default the config file is located at /etc/ckan/default/production.ini).
Running the Tests
To run the tests, do:
pytest --ckan-ini test.ini
AWS Linux
sudo yum install -y pulseaudio-libs-devel python-devel libxml2-devel libxslt-devel poppler poppler-utils poppler-cpp-devel
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ckanext-resource_indexer-0.1.1.tar.gz
.
File metadata
- Download URL: ckanext-resource_indexer-0.1.1.tar.gz
- Upload date:
- Size: 21.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 93241589a7c285d7288b64723d7afa1e46485535ff1e597a030b9bcdc76ecfdc |
|
MD5 | 633c20b81464cc0ed30320db705108d2 |
|
BLAKE2b-256 | 0aa95cc4ac1d7429a97ca87e450724df8365d1d264c85150673425d6c0cfd810 |
Provenance
File details
Details for the file ckanext_resource_indexer-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: ckanext_resource_indexer-0.1.1-py3-none-any.whl
- Upload date:
- Size: 21.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a1e9f869fbaf5ae0c25a3e8bd605492c4c984100ec356858193c141ee9a68004 |
|
MD5 | 7cf0941fe5bdfdec8b65b6811eabb3ee |
|
BLAKE2b-256 | fa19965f4979dffaedc6bfaa95dce8add76f137ca5f21422d0238120384ebb4b |