Skip to main content

DNase-seq analysis library

Project description

Introduction

Many people currently analyzing DNase-seq data are using tools designed for ChIP-seq work, but may be inappropriate for DNase-seq data where one is less interested in the overlaps of sequenced fragments, but the site at which the cut occurs (the 5’ most end of the aligned sequence fragment).

We have developed pyDNase to interface with a sorted and indexed BAM file from a DNase-seq experiment, allowing efficient and easy random access of DNase-seq cut data from any genomic location, e.g.

>>> import pyDNase
>>> reads = pyDNase.BAMHandler(pyDNase.example_reads())
>>> reads["chr6,170863500,170863532,+"]
{'+': array([0,0,0,1,0,0,1,1,2,0,0,0,0,1,0,0,0,0,1,0,0,0,0,1,0,0,1,1,0,0,0,1]),
 '-': array([0,10,1,0,1,0,4,9,0,1,0,2,1,0,0,0,0,0,3,0,6,3,0,0,0,1,1,1,3,0,3,6])}

Querying the BAMHandler object returns a dictionary containing numpy arrays with DNase cut counts on the positive reference strand (+), and cuts on the negative reference strand (-). pyDNase efficiently caches the cut data queried, so that multiple requests from the same genomic locations do not require repeated lookups from the BAM file (this can be disabled).

pyDNase comes with several analysis scripts covering several common use cases of DNase-seq analysis, and also an implementation of the Wellington and Wellington 1D footprinting algorithms.

to install pyDNase, ensure NumPy is installed, and run:

$ pip install pyDNase

for full documentation go to: http://pythonhosted.org/pyDNase/

Support

If you’re having any troubles, please send an email to j.piper@warwick.ac.uk and I’ll do my best to help you out. If you notice any bugs, then please raise an issue over at the github repo.

Contributions

I highly encourage contributions! This is my first software development project - send any pull requests this way. I’m particularly interested in cool analysis scripts that anyone has written.

Reference

License

Copyright (C) 2013 Jason Piper. This work is licensed under the GNU GPLv3 license, see LICENCE.TXT for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyDNase-0.1.2.tar.gz (201.3 kB view details)

Uploaded Source

File details

Details for the file pyDNase-0.1.2.tar.gz.

File metadata

  • Download URL: pyDNase-0.1.2.tar.gz
  • Upload date:
  • Size: 201.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pyDNase-0.1.2.tar.gz
Algorithm Hash digest
SHA256 8ae1b58d2c3e664932a0486ec638a0e3ce8d29df5441d196bddc864f9994b9ae
MD5 bb1706c616aee07801b761cd3493f7a9
BLAKE2b-256 51e8d11179040ac0f37b3886fe4f4e7e531158e67d1bbbf9382e3676e3afb9f6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page