Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Documentation Build Status PyPI codecov DOI License: 3-Clause BSD


Compute MinHash signatures for nucleotide (DNA/RNA) and protein sequences.

Usage:

sourmash compute *.fq.gz
sourmash compare *.sig -o distances
sourmash plot distances

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash is a product of the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 3.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under both Python 2.7.x and Python 3.5+. The base requirements are screed and ijson, together with a Rust environment (for the extension code). We suggest using rustup to install the Rust environment:

curl https://sh.rustup.rs -sSf | sh

The comparison code (sourmash compare) uses numpy, and the plotting code uses matplotlib and scipy, but most of the code is usable without these.

For search and gather you also need khmer version 2.1+.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda you will need to add the bioconda channel as well as the other channels bioconda depends on. Once you have setup bioconda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash compute -h

which will install the latest alpha release.

Support

Please ask questions and files issues on Github.

Development

Development happens on github at dib-lab/sourmash.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information.


CTB Jan 2020

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-3.3.1.tar.gz (7.3 MB view details)

Uploaded Source

Built Distributions

sourmash-3.3.1-py2.py3-none-manylinux2014_ppc64le.whl (1.3 MB view details)

Uploaded Python 2 Python 3

sourmash-3.3.1-py2.py3-none-manylinux2014_aarch64.whl (1.1 MB view details)

Uploaded Python 2 Python 3

sourmash-3.3.1-py2.py3-none-manylinux2010_x86_64.whl (1.2 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.12+ x86-64

sourmash-3.3.1-py2.py3-none-manylinux1_x86_64.whl (1.2 MB view details)

Uploaded Python 2 Python 3

sourmash-3.3.1-py2.py3-none-macosx_10_11_x86_64.whl (537.1 kB view details)

Uploaded Python 2 Python 3 macOS 10.11+ x86-64

File details

Details for the file sourmash-3.3.1.tar.gz.

File metadata

  • Download URL: sourmash-3.3.1.tar.gz
  • Upload date:
  • Size: 7.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.4.0.post20200518 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.3.1.tar.gz
Algorithm Hash digest
SHA256 8d3404f095a90ed3365c429104f3a9526ef3a3b4db8c9bd944f4b0cde8b59f28
MD5 f6bc5ea283f2c0a980c46c6bf1a8b4a3
BLAKE2b-256 75c39331f7a4d02b3fbb24297f478156e0d8811283658cf4158d9ece311b9ca5

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-3.3.1-py2.py3-none-manylinux2014_ppc64le.whl.

File metadata

  • Download URL: sourmash-3.3.1-py2.py3-none-manylinux2014_ppc64le.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.4.0.post20200518 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.3.1-py2.py3-none-manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 3e84ac9a01231cafd8b60d5d9bfd0afcd9eaf8836cd433ffc7f5e15e3b116790
MD5 24dbde97c51099b129ee10b7272c1855
BLAKE2b-256 c7327da2a5f4aa61f865a2b4a2102f3e0efdb4b9bc890dd2085b2ad5059b1754

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-3.3.1-py2.py3-none-manylinux2014_aarch64.whl.

File metadata

  • Download URL: sourmash-3.3.1-py2.py3-none-manylinux2014_aarch64.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.4.0.post20200518 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.3.1-py2.py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 827c79385ebf83f57aae1156637f32da769a78ffd9727528432175e9c53ee10e
MD5 5a7d6b103556b3a779c9f4c9f5c59d92
BLAKE2b-256 ab2e752ac6dd93707b562819d39ec827557195053f4ea09d7e5e9e9817717282

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-3.3.1-py2.py3-none-manylinux2010_x86_64.whl.

File metadata

  • Download URL: sourmash-3.3.1-py2.py3-none-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.4.0.post20200518 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.3.1-py2.py3-none-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 5d184bfb080833e70b0103f1ec977d3123f387982a1996a615955f356163f6e0
MD5 c214818e4c769d64157744f37b3eab2b
BLAKE2b-256 664c75f46674badda560d7d72d580879c3adc49d35aac02a8107b001de3ae435

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-3.3.1-py2.py3-none-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-3.3.1-py2.py3-none-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.4.0.post20200518 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.3.1-py2.py3-none-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 894749957b0b0e5b40ac0f5ed9bba94f93ae6da26e6cea2ea7e93ed4ae664205
MD5 c815a54ea747c87a913204d8e03ea910
BLAKE2b-256 c4a4ccca79dbd707f96cd4f8a52f92ea5d92b8472efd98df13292c16b768f947

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-3.3.1-py2.py3-none-macosx_10_11_x86_64.whl.

File metadata

  • Download URL: sourmash-3.3.1-py2.py3-none-macosx_10_11_x86_64.whl
  • Upload date:
  • Size: 537.1 kB
  • Tags: Python 2, Python 3, macOS 10.11+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.4.0.post20200518 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.6

File hashes

Hashes for sourmash-3.3.1-py2.py3-none-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 bd7320cc72984834136f409e2a23f7f42d4caca8bf3ff21b194925f8810588bf
MD5 a95e13fad72a94d89b120a51cd77b057
BLAKE2b-256 d913df16036a581631e1cae0a7a6aea596a6860bd9fbce939351d98799df8364

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page