Skip to main content

Utilities for working with the Stanford Genome Sequencing Service Center (GSSC) application logic on DNAnexus.

Project description

seqcenter_dnanexus

Utilities for working with the SCGPM Sequencing Center application logic on DNAnexus

API documentation is on Read the Docs.

Provides high level methods and scripts for working with sequencing results that are stored in DNAnexus projects. This repository is geared towards sequencing result projects that the Stanford Genome Sequencing Center creates in DNAnexus, since there are many project properties that are unique to their workflow which are utilized and queried here.

The heart of this API rests in the DxSeqResults() class in the dnanexus_utils.py module. Given a DNAnexus project of interest, a user can use high level methods around that project to do things such as:

  • Download QC reports and JSON stats for one or more barcoded samples,
  • Download FASTQ files or fetch them as DNAnexus DXFile objects,
  • Retrieve the properties that are set on specific FASTQ files,
  • accept project transfers in DNAnexus,
  • and more.

The scripts are many, and include tools such as:

  • Clean up projects to save space,
  • List projects and their properties for projects billed to a specific org,
  • Download fastqs of interest,
  • Add properties to a project,
  • Accept project transfers

The first point above has been heavily used to save space and costs. The script is called scgpm_clean_raw_data.py and works by removing unneccessary extras in the raw_data folder of a project. It works by running an app on DNAnexus by the same name and cleans out all projects that have been created within the last N days.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scgpm-seqresults-dnanexus-0.4.0.tar.gz (39.3 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file scgpm-seqresults-dnanexus-0.4.0.tar.gz.

File metadata

  • Download URL: scgpm-seqresults-dnanexus-0.4.0.tar.gz
  • Upload date:
  • Size: 39.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for scgpm-seqresults-dnanexus-0.4.0.tar.gz
Algorithm Hash digest
SHA256 6e040526467f143a97041142f043b60d2fe90abee1aa021f6c06752e464e449c
MD5 8b9189da266a61cdaa2e8dc6cf03fdc6
BLAKE2b-256 2ad73e78408b5804f884549e7150dbf3a50a52ad0e56608711bad84316d6bf4a

See more details on using hashes here.

File details

Details for the file scgpm_seqresults_dnanexus-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: scgpm_seqresults_dnanexus-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 74.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for scgpm_seqresults_dnanexus-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5a9b09a81c6dbdde6c9bb07a5394d9bf89dc7e1e38aff8f247e38df4f30c80ba
MD5 5ea924dfd9f97f817b7837473e05aae6
BLAKE2b-256 c3f93a1eeb166dc2e1d01ea134ac5220fa32c2ca6f6d63aed135225959955b54

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page