Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Documentation Build Status codecov DOI

Compute MinHash signatures for nucleotide (DNA/RNA) and protein sequences.

Usage:

sourmash compute *.fq.gz
sourmash compare *.sig -o distances
sourmash plot distances

We have demo notebooks on binder that you can interact with:

Binder

Sourmash is published on JOSS.


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash is a product of the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine.

Installation

We currently recommend installing the 2.0 pre-release series. You can use pip to do that like so:

pip install --pre sourmash

sourmash runs under both Python 2.7.x and Python 3.5+. The base requirements are screed and ijson, together with a C++ development environment and the CPython development headers and libraries (for the C++ extension).

The comparison code (sourmash compare) uses numpy, and the plotting code uses matplotlib and scipy, but most of the code is usable without these.

For search and gather you also need khmer version 2.1+.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda you will need to add the bioconda channel as well as the other channels bioconda depends on. Once you have setup bioconda, you can install sourmash by running:

$ conda create -n sourmash_env sourmash python=3.6.4
$ source activate sourmash_env
$ sourmash compute -h

which will install the latest alpha release.

Support

Please ask questions and files issues on Github.

Development

Development happens on github at dib-lab/sourmash.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the library code.

Tests require py.test and can be run with make test.

Please see the developer notes for more information.


CTB June 2018

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-2.0.0a10.tar.gz (88.5 kB view details)

Uploaded Source

Built Distributions

sourmash-2.0.0a10-cp37-cp37m-manylinux1_x86_64.whl (483.4 kB view details)

Uploaded CPython 3.7m

sourmash-2.0.0a10-cp36-cp36m-manylinux1_x86_64.whl (488.7 kB view details)

Uploaded CPython 3.6m

sourmash-2.0.0a10-cp35-cp35m-manylinux1_x86_64.whl (474.6 kB view details)

Uploaded CPython 3.5m

sourmash-2.0.0a10-cp27-cp27mu-manylinux1_x86_64.whl (459.1 kB view details)

Uploaded CPython 2.7mu

sourmash-2.0.0a10-cp27-cp27m-manylinux1_x86_64.whl (459.1 kB view details)

Uploaded CPython 2.7m

File details

Details for the file sourmash-2.0.0a10.tar.gz.

File metadata

  • Download URL: sourmash-2.0.0a10.tar.gz
  • Upload date:
  • Size: 88.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.3.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a10.tar.gz
Algorithm Hash digest
SHA256 3be376b17441a2ce65e3846f5566605b5957ea5c16429b4c83398b819a7f6b40
MD5 0898bd3a5c305275af1164ba35605aab
BLAKE2b-256 affe79f19dae075d71fea8a61145a725f799db8199dacf1dde29392c3f389043

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0a10-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0a10-cp37-cp37m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 483.4 kB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.3.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a10-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 3b956a35b46e818e44434cbc9577c75e3779beb7dacb22a03af7a45412e361f1
MD5 c6b20fba50d7d7f772e01d22ba039d76
BLAKE2b-256 5cdde68b71c1df3ef09ac2a87f541162f1751f29df50638c08ee1d2529ac8c6f

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0a10-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0a10-cp36-cp36m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 488.7 kB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.3.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a10-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 e96b95ef4f47a2a8733d303ef1087b635b81668137375929e8aa88362370cf37
MD5 62c3a512cdb66b882a87118172ba4e3e
BLAKE2b-256 a48d80b0ea420724f938ddce25a4bb7e17c6821ce30211bc1870b03948f7e686

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0a10-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0a10-cp35-cp35m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 474.6 kB
  • Tags: CPython 3.5m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.3.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a10-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 1c0a7a7e5e4d6de88366c593a60875c9b2ef425e2f605199f1d9f861af505ccc
MD5 f24072fd6efefd7fc7c74650d6cc1dc9
BLAKE2b-256 648464d9cdbde4f04d6b42fb4034c8585027be33ec32434f16b24e04f896e056

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0a10-cp27-cp27mu-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0a10-cp27-cp27mu-manylinux1_x86_64.whl
  • Upload date:
  • Size: 459.1 kB
  • Tags: CPython 2.7mu
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.3.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a10-cp27-cp27mu-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 b6233c9ca248a751964299e774245e024cf1f67e00aed2dc0599b091f81646a7
MD5 b98986a69ca6c2f01c43b250af5bc4b5
BLAKE2b-256 a9c27cb99b95f7dab347f3f67c9ef4048783e436411877113b4d436bcb9c32d4

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0a10-cp27-cp27m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0a10-cp27-cp27m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 459.1 kB
  • Tags: CPython 2.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.3.0 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6

File hashes

Hashes for sourmash-2.0.0a10-cp27-cp27m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 d3dc535ce3ff36a57073f5e8a68973dc4deef4846cd8f933f69472859637b781
MD5 0eb3f3960cb0b95305ffae228c8f5d13
BLAKE2b-256 c3da8a0e5ce95bda946b9a56fb48763293e0935a1d2690a0d92dc0fbb4aa5bb8

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page