Skip to main content

Utilities for working with the Stanford Genome Sequencing Service Center (GSSC) application logic on DNAnexus.

Project description

seqcenter_dnanexus

Utilities for working with the SCGPM Sequencing Center application logic on DNAnexus

API documentation is on Read the Docs.

Provides high level methods and scripts for working with sequencing results that are stored in DNAnexus projects. This repository is geared towards sequencing result projects that the Stanford Genome Sequencing Center creates in DNAnexus, since there are many project properties that are unique to their workflow which are utilized and queried here.

The heart of this API rests in the DxSeqResults() class in the dnanexus_utils.py module. Given a DNAnexus project of interest, a user can use high level methods around that project to do things such as:

  • Download QC reports and JSON stats for one or more barcoded samples,
  • Download FASTQ files or fetch them as DNAnexus DXFile objects,
  • Retrieve the properties that are set on specific FASTQ files,
  • accept project transfers in DNAnexus,
  • and more.

The scripts are many, and include tools such as:

  • Clean up projects to save space,
  • List projects and their properties for projects billed to a specific org,
  • Download fastqs of interest,
  • Add properties to a project,
  • Accept project transfers

The first point above has been heavily used to save space and costs. The script is called scgpm_clean_raw_data.py and works by removing unneccessary extras in the raw_data folder of a project. It works by running an app on DNAnexus by the same name and cleans out all projects that have been created within the last N days.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scgpm-seqresults-dnanexus-0.3.1.tar.gz (38.9 kB view details)

Uploaded Source

Built Distribution

scgpm_seqresults_dnanexus-0.3.1-py3-none-any.whl (74.3 kB view details)

Uploaded Python 3

File details

Details for the file scgpm-seqresults-dnanexus-0.3.1.tar.gz.

File metadata

  • Download URL: scgpm-seqresults-dnanexus-0.3.1.tar.gz
  • Upload date:
  • Size: 38.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for scgpm-seqresults-dnanexus-0.3.1.tar.gz
Algorithm Hash digest
SHA256 e50c54894b63c92db88044065a31df72d94091a90df5b312ab7358b2202ff9b9
MD5 c5d561029150f947b41debec7978f657
BLAKE2b-256 ca33232abee3cdf023d05214c21abf6cef7255977708e06afcd0fdc32b95fc4d

See more details on using hashes here.

File details

Details for the file scgpm_seqresults_dnanexus-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: scgpm_seqresults_dnanexus-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 74.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.7.1

File hashes

Hashes for scgpm_seqresults_dnanexus-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6e5dbd024152e7646160a839e8300c35b8dcdd95a0542c38dc1e0d6f7179d508
MD5 8b609492526eb6ad9bee900320f6504a
BLAKE2b-256 ebb433e7b8ec1bd3926363302e1c17bf9a352665839a868916aea3dc73871fdb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page