Skip to main content

No project description provided

Project description

https://github.com/DataShades/ckanext-resource_indexer/actions/workflows/test.yml/badge.svg

ckanext-resource_indexer

Index content of resources in addition to metadata.

Important

master branch of this repository is unstable. Always install ckanext-resource-indexer using pip(pip install ckanext-resource-indexer) unless you are going to change extension’s source code for your need.

Config Settings

# The size treshold(MB) for remote resources
# (optional, default: 4).
ckanext.resource_indexer.max_remote_size = 4

# Make an attempt to index remote files(fetch into tmp folder
# using URL)
# (optional, default: false).
ckanext.resource_indexer.allow_remote = 1

# List of lowercased resource formats that should be
# indexed. Currently only `pdf` and `txt` supported
# (optional, default: None)
ckanext.resource_indexer.indexable_formats = txt pdf

# Field containing data extacted from the file in addition to the
# general `text` field
# (optional, default: None)
ckanext.resoruce_indexer.index_field = extras_res_attachment

# Boost matches by resource's content. Set values greater that 1 in order #
# to promote such matches and value between 0 and 1 in order to put such #
# matches further in search results. Works only when using custom index
# field(ckanext.resoruce_indexer.index_field)
# (optional, default: 1)
ckanext.resoruce_indexer.search_boost = 0.5

Development Installation

To install ckanext-resource_indexer for development, activate your CKAN virtualenv and do:

git clone https://github.com/DataShades/ckanext-resource_indexer.git
cd ckanext-resource_indexer
python setup.py develop

Add resource_indexer (and optionaly pdf_resource_indexer or plain_resource_indexer) to the ckan.plugins setting in your CKAN config file (by default the config file is located at /etc/ckan/default/production.ini).

Running the Tests

To run the tests, do:

pytest --ckan-ini test.ini

AWS Linux

sudo yum install -y pulseaudio-libs-devel python-devel libxml2-devel libxslt-devel poppler poppler-utils poppler-cpp-devel

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ckanext-resource_indexer-0.1.1.tar.gz (21.5 kB view details)

Uploaded Source

Built Distribution

ckanext_resource_indexer-0.1.1-py3-none-any.whl (21.4 kB view details)

Uploaded Python 3

File details

Details for the file ckanext-resource_indexer-0.1.1.tar.gz.

File metadata

  • Download URL: ckanext-resource_indexer-0.1.1.tar.gz
  • Upload date:
  • Size: 21.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4

File hashes

Hashes for ckanext-resource_indexer-0.1.1.tar.gz
Algorithm Hash digest
SHA256 93241589a7c285d7288b64723d7afa1e46485535ff1e597a030b9bcdc76ecfdc
MD5 633c20b81464cc0ed30320db705108d2
BLAKE2b-256 0aa95cc4ac1d7429a97ca87e450724df8365d1d264c85150673425d6c0cfd810

See more details on using hashes here.

Provenance

File details

Details for the file ckanext_resource_indexer-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: ckanext_resource_indexer-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 21.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4

File hashes

Hashes for ckanext_resource_indexer-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 a1e9f869fbaf5ae0c25a3e8bd605492c4c984100ec356858193c141ee9a68004
MD5 7cf0941fe5bdfdec8b65b6811eabb3ee
BLAKE2b-256 fa19965f4979dffaedc6bfaa95dce8add76f137ca5f21422d0238120384ebb4b

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page