Skip to main content

Amphibious new data transformer to prepare various sources for CGP DSS Data Loader

Project description

newt-transformer

Build Status

Metadata transformer to convert from gen3 to something readable by cgp-dss-data-loader

Common Setup

  1. (optional) We recommend using a Python 3 virtual environment.

  2. Run:

    pip3 install newt-transformer

Setup for Development

  1. Clone the repo:

    git clone https://github.com/jessebrennan/newt-transformer.git

  2. Go to the root directory of the cloned project:

    cd newt-transformer

  3. Run (ideally in a new virtual environment):

    make develop

Running Tests

  1. make sure you followed Setup for Development

  2. Run:

    make test

Transforming data from sheepdog-exporter

  1. The first step is to extract the Gen3 data you want using the sheepdog exporter. The TopMed public data extracted from sheepdog is available on the release page under Assets. Assuming you use this data, you will now have a file called topmed-public.json

  2. Make sure you are running the virtual environment you set up in the Setup instructions.

  3. Now we need to transform the data. From the root of the project run:

    newt new /path/to/topmed-public.json --output-json transformed-topmed-public.json

    This will generate a transformed output file called transformed-topmed-public.json.

    The new argument specifies that we want the most recent version of the transformer output format. It can be replaced with a gen3 argument, but this older format will soon be deprecated.

  4. Likely you want to upload this data to the DSS. Instructions for this can be found at the DSS data loader repo.

Transformer Architecture

see github for details

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

newt-transformer-0.1.7.tar.gz (11.8 kB view details)

Uploaded Source

File details

Details for the file newt-transformer-0.1.7.tar.gz.

File metadata

File hashes

Hashes for newt-transformer-0.1.7.tar.gz
Algorithm Hash digest
SHA256 85e2dad8ace31b747a570311619f5025cbb5daf3b4c93a76967819fa51cf1749
MD5 58764d8448e9addc8736de34e8b096e6
BLAKE2b-256 a4a092d4788855e8edc13ccbfa39fbd22aa84a4fca7c58eca4ebb0aa2b46dbd1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page