Skip to main content

Index DBnomics data with Apache Solr for full-text and faceted search

Project description

DBnomics Solr

Index DBnomics data into Apache Solr for full-text and faceted search.

Requirements:

  • a running instance of Apache Solr; at the time this documentation is written, we use the version 7.3.

See dbnomics-docker to run a local DBnomics instance with Docker that includes a service for Apache Solr.

Configuration

Environment variables:

Index a provider

Replace wto by the real provider slug in the following command:

dbnomics-solr index-provider /path/to/wto-json-data

Full mode vs incremental mode

When data is stored in a regular directory, the script always indexes all datasets and series of a provider. This is called full mode.

When data is stored in a Git repository, the script runs by default in incremental mode: it indexes only the datasets modified since the last indexation.

It is possible to force the full mode with the --full option.

Bare repositories

The script has an option --bare-repo-fallback which tries to add .git at the end of the storage dir name, if not found.

Remove all data from a provider

To remove all the documents related to a provider (type:provider, type:dataset and type:series):

dbnomics-solr --debug delete-provider --code <provider_code>
dbnomics-solr --debug delete-provider --slug <provider_slug>

# Examples:
dbnomics-solr --debug delete-provider --code WTO
dbnomics-solr --debug delete-provider --slug wto

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dbnomics_solr-1.1.19.tar.gz (51.6 kB view details)

Uploaded Source

Built Distribution

dbnomics_solr-1.1.19-py3-none-any.whl (45.3 kB view details)

Uploaded Python 3

File details

Details for the file dbnomics_solr-1.1.19.tar.gz.

File metadata

  • Download URL: dbnomics_solr-1.1.19.tar.gz
  • Upload date:
  • Size: 51.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for dbnomics_solr-1.1.19.tar.gz
Algorithm Hash digest
SHA256 4c659cb720d5787dc0df71d277052e79d675b3219bf25bb52d893aacb2a455e7
MD5 be518ebf2a7408ea30d7a08f4f5ca7f7
BLAKE2b-256 491813e5c0494273c54241435b8f9dcaa11d3912ab79fe0ac454627b09b532c0

See more details on using hashes here.

File details

Details for the file dbnomics_solr-1.1.19-py3-none-any.whl.

File metadata

File hashes

Hashes for dbnomics_solr-1.1.19-py3-none-any.whl
Algorithm Hash digest
SHA256 e94c606d1f4124a1d658a52cbb82db9eaacbd294468d12f2f75d356d60d85008
MD5 8024dd862da2286593437c31ed9d843f
BLAKE2b-256 be206a53972a61994001271cad4c31d2d8074b6c07370fd0f51321e94d0cac5f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page