Lightly modified versions of samtools, htslib, and bcftools.
Project description
xsamtools
xsamtools makes the samtools tooling from htslib and bcftools available through pypi packaging. These tools have been lightly modified to allow merges on VCF streams without an index.
Installation
pip install xsamtools
Installation requires a C toolchain. Typically Ubuntu/Debian systems should have the following packages installed:
libbz2-dev
liblzma-dev
libcurl4-openssl-dev
libncurses5-dev
libcurl4-openssl-dev
may be omitted at the cost of some cloud support features in htslib.
Usage
After successful installation, the following executables are available:
samtools:
- htsfile
- bgzip
- tabix
- bcftools
xsamtools:
- merge_vcfs.py
xsamtools also provides Python tooling to create named (FIFO) pipes to Google Storage objects:
from xsamtools import pipes
reader = pipes.BlobReaderProcess("bucket-name", "read-key")
print("reader path", reader.filepath) # local FIFO filepath
writer_key = pipes.BlobWriterProcess("bucket-name", "writ-key")
print("writer path", writer.filepath) # local FIFO filepath
These streams appear as either readable or writable files on the filesystem. Such objects are not seekable.
A utility method is also provided to merge VCFs from GS objects:
from xsamtools import vcf
vcf.combine("src-bucket-name", ["first-src-vcf-key", "second-src-vcf-key"], "dst-bucket-name", "dst-vcf-key")
There is no formal limit on the number of VCF keys. Care should be taken that the VCF objects provided are aligned by chromosome or the merge will fail.
Docker
A Docker image with xsamtools installed is published at https://hub.docker.com/r/xbrianh/xsamtools
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file xsamtools-0.5.3.tar.gz
.
File metadata
- Download URL: xsamtools-0.5.3.tar.gz
- Upload date:
- Size: 3.8 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.0 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8e4f4466a1104c61efc6f7af10ac5cb125f1c2a29cb9192cd66e51260eefd8a2 |
|
MD5 | 942a147ef578944b7fd7ff3816e39f75 |
|
BLAKE2b-256 | e6604c223d76b4eaa180b4eadf11941cbd3a32b316d3c2465cb94ebb60136ca7 |