Skip to main content

Index DBnomics data into Apache Solr for full-text and faceted search.

Project description

DBnomics Solr

Index DBnomics data into Apache Solr for full-text and faceted search.

Requirements:

  • a running instance of Apache Solr; at the time this documentation is written, we use the version 7.3.

See dbnomics-docker to run a local DBnomics instance with Docker that includes a service for Apache Solr.

Configuration

Environment variables:

Index a provider

Replace wto by the real provider slug in the following command:

dbnomics-solr index-provider /path/to/wto-json-data

Full mode vs incremental mode

When data is stored in a regular directory, the script always indexes all datasets and series of a provider. This is called full mode.

When data is stored in a Git repository, the script runs by default in incremental mode: it indexes only the datasets modified since the last indexation.

It is possible to force the full mode with the --full option.

Bare repositories

The script has an option --bare-repo-fallback which tries to add .git at the end of the storage dir name, if not found.

Remove all data from a provider

To remove all the documents related to a provider (type:provider, type:dataset and type:series):

./delete_provider.sh <provider_code>

Example:
./delete_provider.sh WTO

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dbnomics-solr-1.0.4.tar.gz (50.2 kB view details)

Uploaded Source

Built Distribution

dbnomics_solr-1.0.4-py2.py3-none-any.whl (21.1 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file dbnomics-solr-1.0.4.tar.gz.

File metadata

  • Download URL: dbnomics-solr-1.0.4.tar.gz
  • Upload date:
  • Size: 50.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.8.6

File hashes

Hashes for dbnomics-solr-1.0.4.tar.gz
Algorithm Hash digest
SHA256 edf8f9d6cf27559bb9997dc62e21a304259bc9c47920f026e9c2b324b457b6f6
MD5 2b96709f6e050fe33271316e9466449c
BLAKE2b-256 8d0068cb90fc0daeaded6e647e67997908616c9c4c04f6be06a1a3b54afa7c7f

See more details on using hashes here.

File details

Details for the file dbnomics_solr-1.0.4-py2.py3-none-any.whl.

File metadata

  • Download URL: dbnomics_solr-1.0.4-py2.py3-none-any.whl
  • Upload date:
  • Size: 21.1 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.8.6

File hashes

Hashes for dbnomics_solr-1.0.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 491ceebbcc203d8bab2edf537bfa7adf14c2e8171a3a158f9c9533ab77fdbf8c
MD5 9ef8a054ccbf6e61ef457aad25d503be
BLAKE2b-256 547c246c5763cedac379b1538c88a1f2a8a12994ecfce49c772be3adec8f84a5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page