Skip to main content

Index DBnomics data with Apache Solr for full-text and faceted search

Project description

DBnomics Solr

Index DBnomics data into Apache Solr for full-text and faceted search.

Requirements:

  • a running instance of Apache Solr; at the time this documentation is written, we use the version 7.3.

See dbnomics-docker to run a local DBnomics instance with Docker that includes a service for Apache Solr.

Configuration

Environment variables:

Index a provider

Replace wto by the real provider slug in the following command:

dbnomics-solr index-provider /path/to/wto-json-data

Full mode vs incremental mode

When data is stored in a regular directory, the script always indexes all datasets and series of a provider. This is called full mode.

When data is stored in a Git repository, the script runs by default in incremental mode: it indexes only the datasets modified since the last indexation.

It is possible to force the full mode with the --full option.

Bare repositories

The script has an option --bare-repo-fallback which tries to add .git at the end of the storage dir name, if not found.

Remove all data from a provider

To remove all the documents related to a provider (type:provider, type:dataset and type:series):

dbnomics-solr --debug delete-provider --code <provider_code>
dbnomics-solr --debug delete-provider --slug <provider_slug>

# Examples:
dbnomics-solr --debug delete-provider --code WTO
dbnomics-solr --debug delete-provider --slug wto

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dbnomics_solr-1.1.18.tar.gz (51.2 kB view details)

Uploaded Source

Built Distribution

dbnomics_solr-1.1.18-py3-none-any.whl (45.0 kB view details)

Uploaded Python 3

File details

Details for the file dbnomics_solr-1.1.18.tar.gz.

File metadata

  • Download URL: dbnomics_solr-1.1.18.tar.gz
  • Upload date:
  • Size: 51.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.12

File hashes

Hashes for dbnomics_solr-1.1.18.tar.gz
Algorithm Hash digest
SHA256 217e7c56fa110d6d86b512e28479ca9b3f757280c40b7abe221cc6c8f78f01b1
MD5 edec0ec0f8577f25731be10c5108d040
BLAKE2b-256 9f5478dad348265ca8ed85a7904c434ddd3f5540b0c035822ec33997c4a28c3d

See more details on using hashes here.

File details

Details for the file dbnomics_solr-1.1.18-py3-none-any.whl.

File metadata

File hashes

Hashes for dbnomics_solr-1.1.18-py3-none-any.whl
Algorithm Hash digest
SHA256 b5ddf506d44585a468d21cc66b8a2d0ced6ea8ca0fc24bde4ec009406cba0c61
MD5 6503e1c19258692827fb29e1ab6c4b04
BLAKE2b-256 1023974585f1591997a8a05f11bcdcd8ed6cb148724ccde2c1c3b6c9ca096cde

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page