Skip to main content

Utilities for working with the Stanford Genome Sequencing Service Center (GSSC) application logic on DNAnexus.

Project description

seqcenter_dnanexus

Utilities for working with the SCGPM Sequencing Center application logic on DNAnexus

API documentation is on Read the Docs.

Provides high level methods and scripts for working with sequencing results that are stored in DNAnexus projects. This repository is geared towards sequencing result projects that the Stanford Genome Sequencing Center creates in DNAnexus, since there are many project properties that are unique to their workflow which are utilized and queried here.

The heart of this API rests in the DxSeqResults() class in the dnanexus_utils.py module. Given a DNAnexus project of interest, a user can use high level methods around that project to do things such as:

  • Download QC reports and JSON stats for one or more barcoded samples,
  • Download FASTQ files or fetch them as DNAnexus DXFile objects,
  • Retrieve the properties that are set on specific FASTQ files,
  • accept project transfers in DNAnexus,
  • and more.

The scripts are many, and include tools such as:

  • Clean up projects to save space,
  • List projects and their properties for projects billed to a specific org,
  • Download fastqs of interest,
  • Add properties to a project,
  • Accept project transfers

The first point above has been heavily used to save space and costs. The script is called scgpm_clean_raw_data.py and works by removing unneccessary extras in the raw_data folder of a project. It works by running an app on DNAnexus by the same name and cleans out all projects that have been created within the last N days.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scgpm-seqresults-dnanexus-0.5.0.tar.gz (39.8 kB view details)

Uploaded Source

Built Distribution

scgpm_seqresults_dnanexus-0.5.0-py3-none-any.whl (75.2 kB view details)

Uploaded Python 3

File details

Details for the file scgpm-seqresults-dnanexus-0.5.0.tar.gz.

File metadata

  • Download URL: scgpm-seqresults-dnanexus-0.5.0.tar.gz
  • Upload date:
  • Size: 39.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for scgpm-seqresults-dnanexus-0.5.0.tar.gz
Algorithm Hash digest
SHA256 b6a24227536d17c144b85ee4f3e79154a9e85d9e188faab935e08ec4ac47dd88
MD5 6ef54934f4bd47adac5215db32e133df
BLAKE2b-256 86bdcd0568bfbc74678a5ac602842f891f1661f651a488efa1586f640931b901

See more details on using hashes here.

File details

Details for the file scgpm_seqresults_dnanexus-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: scgpm_seqresults_dnanexus-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 75.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for scgpm_seqresults_dnanexus-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 852eac0c69fbabf6306a3ade88853d52f29c29d06a9e2db6c2761312443eaa3e
MD5 517df49c4c3fe1240ebc5107d0ff947a
BLAKE2b-256 a406c1f6695f1c18506546c792df9ca67e41cfe6f084e5aab3ea4997fac7a230

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page