Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Documentation Build Status codecov DOI

Compute MinHash signatures for nucleotide (DNA/RNA) and protein sequences.

Usage:

sourmash compute *.fq.gz
sourmash compare *.sig -o distances
sourmash plot distances

We have demo notebooks on binder that you can interact with:

Binder

Sourmash is published on JOSS.


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash is a product of the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine.

Installation

We currently recommend installing the 2.0 pre-release series. You can use pip to do that like so:

pip install --pre sourmash

sourmash runs under both Python 2.7.x and Python 3.5+. The base requirements are screed and ijson, together with a C++ development environment and the CPython development headers and libraries (for the C++ extension).

The comparison code (sourmash compare) uses numpy, and the plotting code uses matplotlib and scipy, but most of the code is usable without these.

For search and gather you also need khmer version 2.1+.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda you will need to add the bioconda channel as well as the other channels bioconda depends on. Once you have setup bioconda, you can install sourmash by running:

$ conda create -n sourmash_env sourmash python=3.6.4
$ source activate sourmash_env
$ sourmash compute -h

which will install the latest alpha release.

Support

Please ask questions and files issues on Github.

Development

Development happens on github at dib-lab/sourmash.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the library code.

Tests require py.test and can be run with make test.

Please see the developer notes for more information.


CTB June 2018

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-2.0.0a9.tar.gz (159.0 kB view details)

Uploaded Source

Built Distributions

sourmash-2.0.0a9-cp37-cp37m-manylinux1_x86_64.whl (561.4 kB view details)

Uploaded CPython 3.7m

sourmash-2.0.0a9-cp36-cp36m-manylinux1_x86_64.whl (491.9 kB view details)

Uploaded CPython 3.6m

sourmash-2.0.0a9-cp35-cp35m-manylinux1_x86_64.whl (479.6 kB view details)

Uploaded CPython 3.5m

sourmash-2.0.0a9-cp27-cp27mu-manylinux1_x86_64.whl (460.3 kB view details)

Uploaded CPython 2.7mu

sourmash-2.0.0a9-cp27-cp27m-manylinux1_x86_64.whl (460.3 kB view details)

Uploaded CPython 2.7m

File details

Details for the file sourmash-2.0.0a9.tar.gz.

File metadata

  • Download URL: sourmash-2.0.0a9.tar.gz
  • Upload date:
  • Size: 159.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for sourmash-2.0.0a9.tar.gz
Algorithm Hash digest
SHA256 7709a71dd7036cbfc062b9967426a0fb5ec6d5395c54bd7435ebe942ae6d71b4
MD5 c0a51adf04a079751965a5c59974cdf9
BLAKE2b-256 f3c8a921e25e90cc8e9bdf75931ac74d8460eb2791155f90f109923908015b13

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0a9-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-2.0.0a9-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 6ebc39aab7f07f6e8f5ba8dfd0d8e3215cf83950ec9f246c96af4c76c649da60
MD5 9d803bf4cb6caa6b5e9470e5ffc00460
BLAKE2b-256 44a9f0ab56587c37c93ea5d46f79b2f997a345aad881eca929375a9cba303fc1

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0a9-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-2.0.0a9-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 3dfe154065037b79fdf2f382c1d80fafbc73edb0f44b2ca6104cece978ea6f7e
MD5 3c3c5fa2c24998deb84b6cd27b846bd8
BLAKE2b-256 3be023f6dad39cc1f0b140bc0b4f59d4c4c4bb575742bebd03708f324447795c

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0a9-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-2.0.0a9-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 698b9166430d6b74a5aa50afb2c5b6717b3ebf864588f6e5a39665ba8a18a822
MD5 bbd343c3b319dd1081e58554dd6025a8
BLAKE2b-256 698c3e9689c44e558bae86ae39593c9d002f377751eb1cbb7ed29ca42414c58b

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0a9-cp27-cp27mu-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-2.0.0a9-cp27-cp27mu-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 5400c03b51391122fc00defa93093a6e862a3cd59d739f453a383de1f276b961
MD5 7f56ca17a708d12865b140646422120a
BLAKE2b-256 dcd47a55ff6b9cc08b61c53c1d4abab76a33d00dfff9559f68156d41a22c66fd

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0a9-cp27-cp27m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-2.0.0a9-cp27-cp27m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 7e155ebd5c0a7e0ed279c5bd0ccf52f17bf0a7f09d580fcc38c5f2a8cfe53a7f
MD5 c0bba470d332eed12943ce895a52275b
BLAKE2b-256 df8615d214b729dc678aa74c7e98eadf37dcac76d8e1055ee7d1b5b0b3f4f97c

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page