A command line client for the Global Pathogen Analysis Service
Project description
A standalone command line and Python API client for interacting with the Global Pathogen Analysis Service. Tested on Linux, MacOS, with Windows support planned. Uses Python 3.10+
Progress
Command line interface | Python API |
---|---|
✅ gpas upload |
✅ lib.Batch(upload_csv, token).upload() |
✅ gpas download |
✅ lib.download_async() |
✅ gpas validate |
✅ validation.validate() |
✅ gpas status |
✅ lib.fetch_status() , lib.fetch_status_async() |
Install
With conda
curl https://raw.githubusercontent.com/GlobalPathogenAnalysisService/gpas-cli/main/environment.yml --output environment.yml
conda env create -f environment.yml
conda activate gpas-cli
pip install gpas==0.1.0 # If you'd like a versioned release
With pip
Install Samtools and read-it-and-keep manually
pip install gpas
# Tell gpas-cli where you installed samtools and read-it-and-keep
export GPAS_SAMTOOLS_PATH=path/to/samtools
export GPAS_READITANDKEEP_PATH=path/to/readItAndKeep
Authentication
Most gpas-cli actions require a valid API token (token.json
). This can be saved using the 'Get API token' button on the Upload Client
page of the GPAS portal. If you can't see this button, please ask the team to enable it for you.
Command line usage
gpas validate
Validates an upload_csv
and checks that the fastq or bam files it references exist.
gpas validate large-nanopore-fastq.csv
# Validate supplied tags
gpas validate --environment dev --token token.json large-nanopore-fastq.csv
% gpas validate -h
usage: gpas validate [-h] [--token TOKEN] [--environment {dev,staging,prod}] [--json-messages] upload_csv
Validate an upload CSV. Validates tags remotely if supplied with an authentication token
positional arguments:
upload_csv Path of upload CSV
options:
-h, --help show this help message and exit
--token TOKEN Path of auth token available from GPAS Portal
(default: None)
--environment {dev,staging,prod}
GPAS environment to use
(default: prod)
--json-messages Emit JSON to stdout
(default: False)
gpas upload
Validates, decontaminates and upload reads specified in upload_csv
to the specified GPAS environment
gpas upload --environment dev --token token.json large-illumina-bam.csv
# Dry run; skip submission
gpas upload --dry-run --environment dev --token token.json large-illumina-bam.csv
# Offline mode; quit after decontamination
gpas upload tests/test-data/large-nanopore-fastq.csv
% gpas upload -h
usage: gpas upload [-h] [--token TOKEN] [--working-dir WORKING_DIR] [--out-dir OUT_DIR] [--processes PROCESSES] [--dry-run]
[--debug] [--environment {dev,staging,prod}] [--json-messages]
upload_csv
Validate, decontaminate and upload reads to the GPAS platform
positional arguments:
upload_csv Path of upload csv
options:
-h, --help show this help message and exit
--token TOKEN Path of auth token available from GPAS Portal
(default: None)
--working-dir WORKING_DIR
Path of directory in which to make intermediate files
(default: /tmp)
--out-dir OUT_DIR Path of directory in which to save mapping CSV
(default: .)
--processes PROCESSES
Number of tasks to execute in parallel. 0 = auto
(default: 0)
--dry-run Exit before submitting files
(default: False)
--debug Emit verbose debug messages
(default: False)
--environment {dev,staging,prod}
GPAS environment to use
(default: prod)
--json-messages Emit JSON to stdout
(default: False)
gpas download
Downloads json
, fasta
, vcf
and bam
outputs from the GPAS platform by passing either a mapping_csv
generated during batch upload, or a comma-separated list of sample guids. By passing both --mapping-csv
and --rename
, output files are saved using local sample names without the platform's knowledge.
# Download and rename BAMs for a previous upload
gpas download --rename --mapping-csv example_mapping.csv --file-types bam token.json
# Download all outputs for a single guid
gpas download --guids 6e024eb1-432c-4b1b-8f57-3911fe87555f --file-types json,vcf,bam,fasta token.json
% gpas download -h
usage: gpas download [-h] [--mapping-csv MAPPING_CSV] [--guids GUIDS] [--file-types FILE_TYPES] [--out-dir OUT_DIR] [--rename]
[--debug] [--environment {dev,staging,prod}]
token
Download analytical outputs from the GPAS platform for given a mapping csv or list of guids
positional arguments:
token Path of auth token (available from GPAS Portal)
options:
-h, --help show this help message and exit
--mapping-csv MAPPING_CSV
Path of mapping CSV generated at upload time
(default: None)
--guids GUIDS Comma-separated list of GPAS sample guids
(default: )
--file-types FILE_TYPES
Comma separated list of outputs to download (json,fasta,bam,vcf)
(default: fasta)
--out-dir OUT_DIR Path of output directory
(default: /Users/bede/Research/Git/gpas-cli)
--rename Rename outputs using local sample names (requires --mapping-csv)
(default: False)
--debug Emit verbose debug messages
(default: False)
--environment {dev,staging,prod}
GPAS environment to use
(default: prod)
gpas status
Check the processing status of an uploaded batch by passing either a mapping_csv
generated at upload time, or a comma-separated list of sample guids.
gpas status --mapping-csv example_mapping.csv --environment dev token.json
gpas status --guids 6e024eb1-432c-4b1b-8f57-3911fe87555f --format json token.json
% gpas status -h
usage: gpas status [-h] [--mapping-csv MAPPING_CSV] [--guids GUIDS] [--format {table,csv,json}] [--rename] [--raw]
[--environment {dev,staging,prod}]
token
Check the status of samples submitted to the GPAS platform
positional arguments:
token Path of auth token available from GPAS Portal
options:
-h, --help show this help message and exit
--mapping-csv MAPPING_CSV
Path of mapping CSV generated at upload time
(default: None)
--guids GUIDS Comma-separated list of GPAS sample guids
(default: )
--format {table,csv,json}
Output format
(default: table)
--rename Use local sample names (requires --mapping-csv)
(default: False)
--raw Emit raw response
(default: False)
--environment {dev,staging,prod}
GPAS environment to use
(default: prod)
Development and testing
Use pre-commit to apply black style at commit time (should happen automatically)
conda create -n gpas-cli-dev python=3.10 read-it-and-keep=0.3.0 samtools=1.15.1 pytest pytest-cov black pre-commit mypy
conda activate gpas-cli-dev
git clone https://github.com/GlobalPathogenAnalysisService/gpas-cli
cd gpas-cli
pip install -e ./
# Offline unit tests
pytest tests/test_gpas.py
# Online and upload tests require a valid token
pytest --cov=gpas
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file gpas-0.0.5.tar.gz
.
File metadata
- Download URL: gpas-0.0.5.tar.gz
- Upload date:
- Size: 450.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/0.0.0 pkginfo/1.8.2 readme-renderer/27.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.2 keyring/23.4.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 71b63ce6dba2ab34e6414f9b5c12dacbcd935fa681d74cfda5b9292015c53dce |
|
MD5 | 9dfe4e38fec40f8df44aaa66552f9200 |
|
BLAKE2b-256 | e4e53950131426597b19c18963bb7017847c24a8a6e6095b4283026a34a76c6b |
File details
Details for the file gpas-0.0.5-py3-none-any.whl
.
File metadata
- Download URL: gpas-0.0.5-py3-none-any.whl
- Upload date:
- Size: 1.7 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/0.0.0 pkginfo/1.8.2 readme-renderer/27.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.2 keyring/23.4.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 094cce451bec205dff89365b988d77a3481eac022870b5bbddd4eac23db508a6 |
|
MD5 | 51b88c71b987219ad2df907001a6f1ff |
|
BLAKE2b-256 | 64fef014c709eac387c5e02e543dff1972aa208b788b5b2f06d9f0d79dbc5f08 |