Index DBnomics data with Apache Solr for full-text and faceted search
Project description
DBnomics Solr
Index DBnomics data into Apache Solr for full-text and faceted search.
Requirements:
- a running instance of Apache Solr; at the time this documentation is written, we use the version 7.3.
See dbnomics-docker to run a local DBnomics instance with Docker that includes a service for Apache Solr.
Configuration
Environment variables:
DEBUG_PYSOLR
: display pysolr DEBUG logging messages (cf https://github.com/django-haystack/pysolr)
Index a provider
Replace wto
by the real provider slug in the following command:
dbnomics-solr index-provider /path/to/wto-json-data
Full mode vs incremental mode
When data is stored in a regular directory, the script always indexes all datasets and series of a provider. This is called full mode.
When data is stored in a Git repository, the script runs by default in incremental mode: it indexes only the datasets modified since the last indexation.
It is possible to force the full mode with the --full
option.
Bare repositories
The script has an option --bare-repo-fallback
which tries to add .git
at the end of the storage dir name, if not found.
Remove all data from a provider
To remove all the documents related to a provider (type:provider
, type:dataset
and type:series
):
dbnomics-solr --debug delete-provider --code <provider_code>
dbnomics-solr --debug delete-provider --slug <provider_slug>
# Examples:
dbnomics-solr --debug delete-provider --code WTO
dbnomics-solr --debug delete-provider --slug wto
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dbnomics_solr-1.1.19.tar.gz
.
File metadata
- Download URL: dbnomics_solr-1.1.19.tar.gz
- Upload date:
- Size: 51.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4c659cb720d5787dc0df71d277052e79d675b3219bf25bb52d893aacb2a455e7 |
|
MD5 | be518ebf2a7408ea30d7a08f4f5ca7f7 |
|
BLAKE2b-256 | 491813e5c0494273c54241435b8f9dcaa11d3912ab79fe0ac454627b09b532c0 |
File details
Details for the file dbnomics_solr-1.1.19-py3-none-any.whl
.
File metadata
- Download URL: dbnomics_solr-1.1.19-py3-none-any.whl
- Upload date:
- Size: 45.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e94c606d1f4124a1d658a52cbb82db9eaacbd294468d12f2f75d356d60d85008 |
|
MD5 | 8024dd862da2286593437c31ed9d843f |
|
BLAKE2b-256 | be206a53972a61994001271cad4c31d2d8074b6c07370fd0f51321e94d0cac5f |