Skip to main content

Library for applying haplotypes to reference DNA sequences

Project description

happer

Daniel Standage, 2018
https://github.com/bioforensics/happer

happer is a minimal Python library for generating complete haplotype sequences. Given a reference sequence and haplotype alleles annotated in BED format, happer will mutate the reference and produce sequences to match the specified haplotypes.

Installation

To install:

pip3 install happer

To make sure the package installed correctly:

pip3 install pytest
py.test --pyargs happer

happer requires Python version 3.

Usage

Reference sequences must be provided in Fasta format, and haplotype alleles must be specified in BED format as follows. Alleles corresponding to different haplotypes at the locus are separated by a | character, so for example a diploid individual should have 2 |-separated alleles annotated at each locus, while a tetraploid would have 4 alleles. In the example below, the CCGA alleles are phased and represent one haplotype, while the TATG alleles are phased and represent another haplotype.

#SeqID    Start  End     Alleles
chr1     38827  38828   C|T
chr1     59288  59289   C|A
chr2     24771  24772   G|T
chr4     201191 201192  A|G

To invoke happer from the command line:

[standage@lappy ~]$ happer --out haploseqs.fasta refr.fasta alleles.bed

To invoke happer directly in Python:

>>> import happer
>>> seqfile = open('refr.fasta', 'r')
>>> alleles = open('alleles.bed', 'r')
>>> for label, haploseq in happer.mutate.mutate(seqfile, alleles):
...     # do whatever you'd like with the sequences

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

happer-0.1.1.tar.gz (23.9 kB view details)

Uploaded Source

File details

Details for the file happer-0.1.1.tar.gz.

File metadata

  • Download URL: happer-0.1.1.tar.gz
  • Upload date:
  • Size: 23.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for happer-0.1.1.tar.gz
Algorithm Hash digest
SHA256 95490f3e14f992f849813cd44ac08d7deacbe6cf794483b31a5a7507ac1e9561
MD5 fec2bae39f7bb9851ba0be8d28ff339d
BLAKE2b-256 c9e77c8b98491c3a63bc4766e6656e9d1b4641daa2ccad7b04a638dd23896d6e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page