Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Documentation Build Status codecov DOI

Compute MinHash signatures for nucleotide (DNA/RNA) and protein sequences.

Usage:

sourmash compute *.fq.gz
sourmash compare *.sig -o distances
sourmash plot distances

Sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash is a product of the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine.

Installation

We recommend using bioconda to install sourmash:

conda install sourmash

This will install the 2.0 pre-release series.

You can also use pip to install the pre-release like so:

pip install --pre sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under both Python 2.7.x and Python 3.5+. The base requirements are screed and ijson, together with a C++ development environment and the CPython development headers and libraries (for the C++ extension).

The comparison code (sourmash compare) uses numpy, and the plotting code uses matplotlib and scipy, but most of the code is usable without these.

For search and gather you also need khmer version 2.1+.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda you will need to add the bioconda channel as well as the other channels bioconda depends on. Once you have setup bioconda, you can install sourmash by running:

$ conda create -n sourmash_env sourmash python=3.6.4
$ source activate sourmash_env
$ sourmash compute -h

which will install the latest alpha release.

Support

Please ask questions and files issues on Github.

Development

Development happens on github at dib-lab/sourmash.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the library code.

Tests require py.test and can be run with make test.

Please see the developer notes for more information.


CTB Dec 2018

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-2.0.0.tar.gz (7.3 MB view details)

Uploaded Source

Built Distributions

sourmash-2.0.0-cp37-cp37m-manylinux1_x86_64.whl (458.3 kB view details)

Uploaded CPython 3.7m

sourmash-2.0.0-cp37-cp37m-macosx_10_6_intel.whl (155.7 kB view details)

Uploaded CPython 3.7m macOS 10.6+ intel

sourmash-2.0.0-cp36-cp36m-manylinux1_x86_64.whl (459.5 kB view details)

Uploaded CPython 3.6m

sourmash-2.0.0-cp36-cp36m-macosx_10_6_intel.whl (159.5 kB view details)

Uploaded CPython 3.6m macOS 10.6+ intel

sourmash-2.0.0-cp35-cp35m-manylinux1_x86_64.whl (450.0 kB view details)

Uploaded CPython 3.5m

sourmash-2.0.0-cp35-cp35m-macosx_10_6_intel.whl (156.5 kB view details)

Uploaded CPython 3.5m macOS 10.6+ intel

sourmash-2.0.0-cp34-cp34m-manylinux1_x86_64.whl (449.6 kB view details)

Uploaded CPython 3.4m

sourmash-2.0.0-cp34-cp34m-macosx_10_6_intel.whl (154.7 kB view details)

Uploaded CPython 3.4m macOS 10.6+ intel

sourmash-2.0.0-cp27-cp27mu-manylinux1_x86_64.whl (430.6 kB view details)

Uploaded CPython 2.7mu

sourmash-2.0.0-cp27-cp27m-manylinux1_x86_64.whl (430.5 kB view details)

Uploaded CPython 2.7m

sourmash-2.0.0-cp27-cp27m-macosx_10_6_intel.whl (155.9 kB view details)

Uploaded CPython 2.7m macOS 10.6+ intel

File details

Details for the file sourmash-2.0.0.tar.gz.

File metadata

  • Download URL: sourmash-2.0.0.tar.gz
  • Upload date:
  • Size: 7.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0.tar.gz
Algorithm Hash digest
SHA256 44c3d520360eab34fefb6a99eec58fb37ed9de078f5171b28d4918442aa075f7
MD5 700d8b36f9e8cda36ba9287b8edcc128
BLAKE2b-256 ae799659c5d7352643420511bdb2d2cd5302d3d9ed1bb1f02824586979b0eb70

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0-cp37-cp37m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 458.3 kB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 c3899baffacccb4d68c8452c4a9926d2cc786c153587a99073099889a7235feb
MD5 6568454185ed529f09387681ed94415c
BLAKE2b-256 1ef298174ce66afb7e637c38e825a27eb5a8658f7376fb7e25df023f67d3fad5

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0-cp37-cp37m-macosx_10_6_intel.whl.

File metadata

  • Download URL: sourmash-2.0.0-cp37-cp37m-macosx_10_6_intel.whl
  • Upload date:
  • Size: 155.7 kB
  • Tags: CPython 3.7m, macOS 10.6+ intel
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0-cp37-cp37m-macosx_10_6_intel.whl
Algorithm Hash digest
SHA256 7c4e0530783e6f34f32e5263a4b2f927048918c438db4457aaac1617e20695d6
MD5 1264800e6c7757a5e87c6c1b7fdef1c4
BLAKE2b-256 94030a4f9c008874daa6afd0f9e601e804744ef67fefd3a2705d002b8f0bf183

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0-cp36-cp36m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 459.5 kB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 09bcbed6c970209bd38d34c575f2352330b7f1956c6e73a1411ee070ba796e29
MD5 1ec297af1108368dc7c9e919ac1d8670
BLAKE2b-256 5f81762e6e26684129aceb363d038f354a5a9a30d411a8299fe79c4cc8e12b28

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0-cp36-cp36m-macosx_10_6_intel.whl.

File metadata

  • Download URL: sourmash-2.0.0-cp36-cp36m-macosx_10_6_intel.whl
  • Upload date:
  • Size: 159.5 kB
  • Tags: CPython 3.6m, macOS 10.6+ intel
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0-cp36-cp36m-macosx_10_6_intel.whl
Algorithm Hash digest
SHA256 bea63ac59d7d1d33e761f6f801a76ddf588db1892dd4581a331b8e36cbad3f7e
MD5 b7a80c47f846eeea15c3d10bad4e7e0c
BLAKE2b-256 3783bf2fcf27d017549bb5da156706ea86e37d6d446a9d4101c00f3c93ba8ac4

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0-cp35-cp35m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 450.0 kB
  • Tags: CPython 3.5m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 23a9ee385c2c16be14fa12ab02872dd7f7bd0d242cc35bdfbefd5966c31a7fc0
MD5 c06d7ea98c9b47749bf70f8b969da560
BLAKE2b-256 44db8b084aa8e2af7d34efaaee2837d4a488694e53a7c77f16d52a8b91dca4a0

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0-cp35-cp35m-macosx_10_6_intel.whl.

File metadata

  • Download URL: sourmash-2.0.0-cp35-cp35m-macosx_10_6_intel.whl
  • Upload date:
  • Size: 156.5 kB
  • Tags: CPython 3.5m, macOS 10.6+ intel
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0-cp35-cp35m-macosx_10_6_intel.whl
Algorithm Hash digest
SHA256 b7ac8c6f04690c53c6bb6b9c6553806b667b005a42475636578e675bf44ed9b9
MD5 a948a714d0237a0ef5804a31796ba911
BLAKE2b-256 2179dbdc51da0eda9eb410054ec8550962a2948fef220130d0b5c3cf9fdeb8f1

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0-cp34-cp34m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0-cp34-cp34m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 449.6 kB
  • Tags: CPython 3.4m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0-cp34-cp34m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 e0ab9becd15abae75675500c499ae55dfb4e622c72d6468d114e758b9ed84792
MD5 1a4ddb396db549bcda450eb44cad6df9
BLAKE2b-256 fdd4b59bb1b496fbbb6425321816f8c4f7732e5f046785f85184579024e36a0e

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0-cp34-cp34m-macosx_10_6_intel.whl.

File metadata

  • Download URL: sourmash-2.0.0-cp34-cp34m-macosx_10_6_intel.whl
  • Upload date:
  • Size: 154.7 kB
  • Tags: CPython 3.4m, macOS 10.6+ intel
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0-cp34-cp34m-macosx_10_6_intel.whl
Algorithm Hash digest
SHA256 38fd9123f5a4b3020d233d0e31962cef225436159746fb1f5395f0c3b5974ed3
MD5 4fe0615e0598b9d202ce7a8111307746
BLAKE2b-256 9f59a23bb18e0c7803eb59f7dfa538825407d7d58660848750054abd176b6105

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0-cp27-cp27mu-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0-cp27-cp27mu-manylinux1_x86_64.whl
  • Upload date:
  • Size: 430.6 kB
  • Tags: CPython 2.7mu
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0-cp27-cp27mu-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 7d0277b1a2c57f89c72edefb933172eb6c8b9a95516d7885971bba21d2bfc259
MD5 d8a404a775a33b476198343a9d94acfb
BLAKE2b-256 7a9dd198dabbd8af0be17d59855e1715e2e0fed362640aac8b03291ae2608880

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0-cp27-cp27m-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-2.0.0-cp27-cp27m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 430.5 kB
  • Tags: CPython 2.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0-cp27-cp27m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 81a5572c8f7dda5fab292449060b8aa84c98da9925f931fe528fc2355f94d4f3
MD5 29f1e9c68fa50be3785d2fa80d900e60
BLAKE2b-256 d6a835ee1ccef921027d59b8a6a385db8b7c6b2ee043e5d34619411d19cc77e7

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-2.0.0-cp27-cp27m-macosx_10_6_intel.whl.

File metadata

  • Download URL: sourmash-2.0.0-cp27-cp27m-macosx_10_6_intel.whl
  • Upload date:
  • Size: 155.9 kB
  • Tags: CPython 2.7m, macOS 10.6+ intel
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.0

File hashes

Hashes for sourmash-2.0.0-cp27-cp27m-macosx_10_6_intel.whl
Algorithm Hash digest
SHA256 e8125f4c7ca75c9eda9c4fd18b41d20865ddee53fe5d3bae3cab9e6d444da579
MD5 3d6f2559a368943b0898fa823c7d2a00
BLAKE2b-256 ec085801f7228bfa30dd8d60fe9467ea00361d95a1131f3c9b606b13196497f3

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page