Skip to main content

a helper to ingest data in sdap

Project description

SDAP manager for ingestion of datasets

Prerequisites

python 3

Install anaconda for python 3. From the graphic install for example for macos:

https://www.anaconda.com/distribution/#macos

git lfs (for development)

Git lfs for the deployment from git, see https://git-lfs.github.com/

If not available you have to get netcdf files for test, if you do need the tests.

Deployed nexus on kubernetes cluster

See project https://github.com/apache/incubator-sdap-nexus

$ helm install nexus .  --namespace=sdap --dependency-update -f ~/overridden-nexus-values.yml 

Install, Configure and run

Install

Stay logged in a user

$ pip install sdap_ingest_manager

Configure the ingestion system

Catch the message at the end of the installation output

--------------------------------------------------------------
Now, create configuration files in
***/<some path>>/.sdap_ingest_manager***
 Use templates and examples provided there
--------------------------------------------------------------

Use the path shown in the message and create your own configuration files:

$ cd /<some path>>/.sdap_ingest_manager
$ cp sdap_ingest_manager.ini.default sdap_ingest_manager.ini

Edit and update the newly created files by following instructions in the comments.

Configure the collections

You can either configure it in a local yaml file referenced in the sdap_ingest_manager.ini file.

It can also be in a google spreadsheet.

If both a configured, the local yaml file will be used.

Run the ingestion

On the list of the configured collections:

$ run_collections

The number of parallel jobs can be updated during the process in the sdap_ingest_manager.ini file.

If interrupted (killed) the process will restart where it was.

For developers

deploy project

$ bash
$ git clone ...
$ cd sdap_ingest_manager
$ python -m venv venv
$ source ./venv/bin/activate
$ pip install .

Note the command pip install -e . does not work as it does not deploy the configuration files.

Update the project

Update the code and the test with your favorite IDE (e.g. pyCharm).

Test and create the package

Change version in file setup.py

$ python setup.py test
$ git tag <version>
$ git push origin <version>

The release will be automatically pushed to pypi though github action.

Create a tag and publish the package as a gitHub release.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sdap_ingest_manager-0.2.0rc1.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

sdap_ingest_manager-0.2.0rc1-py3-none-any.whl (3.9 MB view details)

Uploaded Python 3

File details

Details for the file sdap_ingest_manager-0.2.0rc1.tar.gz.

File metadata

  • Download URL: sdap_ingest_manager-0.2.0rc1.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.0.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.7

File hashes

Hashes for sdap_ingest_manager-0.2.0rc1.tar.gz
Algorithm Hash digest
SHA256 669b1a35d038ed0cb17e83f32978d27afa737a67f782c40b5e6487b1801f8bc8
MD5 a7604bece463c8f27e2505a2af48d373
BLAKE2b-256 ef39bc782818b90aa32fc23b1c2aee66c3586569c0004ecec5c35a077023ceba

See more details on using hashes here.

File details

Details for the file sdap_ingest_manager-0.2.0rc1-py3-none-any.whl.

File metadata

  • Download URL: sdap_ingest_manager-0.2.0rc1-py3-none-any.whl
  • Upload date:
  • Size: 3.9 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.0.0 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.7.7

File hashes

Hashes for sdap_ingest_manager-0.2.0rc1-py3-none-any.whl
Algorithm Hash digest
SHA256 5e8985712b47cedf72de6f7e3c22693820c4365e04d404e523f4b37d57c84e03
MD5 62aab8026a7b6c9061ecb0d662ac9644
BLAKE2b-256 c088c6e645b604d65911b5a4b8d2a7a4d65dba50260e9ec8b45790975239f7f5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page