a helper to ingest data in sdap
Project description
SDAP manager for ingestion of datasets
Prerequisites
python 3
Install anaconda for python 3. From the graphic install for example for macos:
https://www.anaconda.com/distribution/#macos
git lfs (for development)
Git lfs for the deployment from git, see https://git-lfs.github.com/
If not available you have to get netcdf files for test, if you do need the tests.
Deployed nexus on kubernetes cluster
See project https://github.com/apache/incubator-sdap-nexus
$ helm install nexus . --namespace=sdap --dependency-update -f ~/overridden-nexus-values.yml
Install, Configure and run
Install
Stay logged in a user
$ pip install sdap_ingest_manager
Configure the ingestion system
Catch the message at the end of the installation output
--------------------------------------------------------------
Now, create configuration files in
***/<some path>>/.sdap_ingest_manager***
Use templates and examples provided there
--------------------------------------------------------------
Use the path shown in the message and create your own configuration files:
$ cd /<some path>>/.sdap_ingest_manager
$ cp sdap_ingest_manager.ini.default sdap_ingest_manager.ini
Edit and update the newly created files by following instructions in the comments.
Configure the collections
You can either configure it in a local yaml file referenced in the sdap_ingest_manager.ini
file.
It can also be in a google spreadsheet.
If both a configured, the local yaml file will be used.
Run the ingestion
On the list of the configured collections:
$ run_collections
The number of parallel jobs can be updated during the process in the sdap_ingest_manager.ini
file.
If interrupted (killed) the process will restart where it was.
For developers
deploy project
$ bash
$ git clone ...
$ cd sdap_ingest_manager
$ python -m venv venv
$ source ./venv/bin/activate
$ pip install .
Note the command pip install -e . does not work as it does not deploy the configuration files.
Update the project
Update the code and the test with your favorite IDE (e.g. pyCharm).
Test and create the package
Change version in file setup.py
$ python setup.py test
$ git tag <version>
$ git push origin <version>
The release will be automatically pushed to pypi though github action.
Create a tag and publish the package as a gitHub release.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file sdap_ingest_manager-0.2.0rc1.tar.gz
.
File metadata
- Download URL: sdap_ingest_manager-0.2.0rc1.tar.gz
- Upload date:
- Size: 1.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.0.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 669b1a35d038ed0cb17e83f32978d27afa737a67f782c40b5e6487b1801f8bc8 |
|
MD5 | a7604bece463c8f27e2505a2af48d373 |
|
BLAKE2b-256 | ef39bc782818b90aa32fc23b1c2aee66c3586569c0004ecec5c35a077023ceba |
File details
Details for the file sdap_ingest_manager-0.2.0rc1-py3-none-any.whl
.
File metadata
- Download URL: sdap_ingest_manager-0.2.0rc1-py3-none-any.whl
- Upload date:
- Size: 3.9 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.0.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5e8985712b47cedf72de6f7e3c22693820c4365e04d404e523f4b37d57c84e03 |
|
MD5 | 62aab8026a7b6c9061ecb0d662ac9644 |
|
BLAKE2b-256 | c088c6e645b604d65911b5a4b8d2a7a4d65dba50260e9ec8b45790975239f7f5 |