Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Documentation Build Status Bioconda install PyPI codecov DOI License: 3-Clause BSD

Usage:

sourmash sketch dna *.fq.gz
sourmash compare *.sig -o distances.cmp -k 31
sourmash plot distances.cmp

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.

The latest major release is sourmash v4, which has several command-line and Python incompatibilities with previous versions. Please visit our migration guide to upgrade!


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 4.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.7 and later. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda (see below) will install everything necessary, and is our recommended installation method.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash --help

which will install the latest released version.

Support

Please ask questions and files issues on Github.

Development

Development happens on github at dib-lab/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.


CTB Feb 2021

Release history Release notifications | RSS feed

This version

4.0.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-4.0.0.tar.gz (7.9 MB view details)

Uploaded Source

Built Distributions

sourmash-4.0.0-py2.py3-none-manylinux2014_s390x.whl (1.6 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0-py2.py3-none-manylinux2014_ppc64le.whl (1.4 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0-py2.py3-none-manylinux2014_aarch64.whl (1.2 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0-py2.py3-none-manylinux2010_x86_64.whl (1.2 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.12+ x86-64

sourmash-4.0.0-py2.py3-none-manylinux1_x86_64.whl (1.2 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0-py2.py3-none-macosx_10_11_x86_64.whl (541.6 kB view details)

Uploaded Python 2 Python 3 macOS 10.11+ x86-64

File details

Details for the file sourmash-4.0.0.tar.gz.

File metadata

  • Download URL: sourmash-4.0.0.tar.gz
  • Upload date:
  • Size: 7.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/54.0.0 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.8.8

File hashes

Hashes for sourmash-4.0.0.tar.gz
Algorithm Hash digest
SHA256 72533b69ebe5ba592679377537426173a75aebbc5d2572b925115aa6769a689c
MD5 7b7d2080864bbd9285795703f03a8528
BLAKE2b-256 d882ee1d4f9acccac94174fc2e44f6120af0aa7cba5554c06604605fa635dbc6

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0-py2.py3-none-manylinux2014_s390x.whl.

File metadata

  • Download URL: sourmash-4.0.0-py2.py3-none-manylinux2014_s390x.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0-py2.py3-none-manylinux2014_s390x.whl
Algorithm Hash digest
SHA256 b0fab42dc1222eba53db24e25086b9a0df12b82647656bbd537e73db1e57c538
MD5 3e86cc1c3e59caf0a62cf2d599a3ded0
BLAKE2b-256 39ab37bcd43008f101e30a0b7b1de85d310ac4d6878f46bd08df2135378d10d3

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0-py2.py3-none-manylinux2014_ppc64le.whl.

File metadata

  • Download URL: sourmash-4.0.0-py2.py3-none-manylinux2014_ppc64le.whl
  • Upload date:
  • Size: 1.4 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0-py2.py3-none-manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 eaa6fd82768408a71681281e54963b4c48dfd19dddf16876df525baec7418e4b
MD5 0dfbf615f42eb19e9d221dfb9004e895
BLAKE2b-256 fc63ac16e535550315cdcb0b4a50dde6212cb593d244f30a0dcab24a092234c8

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0-py2.py3-none-manylinux2014_aarch64.whl.

File metadata

  • Download URL: sourmash-4.0.0-py2.py3-none-manylinux2014_aarch64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0-py2.py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 ce76839df5f01c7f8106d16e217b58fdd25edc5112fe5194cfe16880b142c404
MD5 611412ce5d7238251c4fd45c8c1bec82
BLAKE2b-256 6d86d2515de10a44bab83d5a17bfd2f9bb894bd0cb698483fbed0c63d6a48b8b

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0-py2.py3-none-manylinux2010_x86_64.whl.

File metadata

  • Download URL: sourmash-4.0.0-py2.py3-none-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0-py2.py3-none-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 55b0551cf64c98ff591d4d8729ccb6e1e89a655a33366e1f52013163a66e7094
MD5 d9c0dd3f80bc2e71e479a13d50c194cc
BLAKE2b-256 08c62d4a2377173483c9f4f65c18d6f9af14256724bc4b5ac3cc42e987a162cc

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0-py2.py3-none-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-4.0.0-py2.py3-none-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0-py2.py3-none-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 432a56bdf9d27da69d190f7094429cdd65b35dabaf3879dfe2828d763bc4aa8b
MD5 eb4a83bb07f80bdd3fa75f4fda686904
BLAKE2b-256 01d12ca6fd454e3d9f65813640425ebf19281a86f36f64913ad4ccf0d9d5d049

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0-py2.py3-none-macosx_10_11_x86_64.whl.

File metadata

  • Download URL: sourmash-4.0.0-py2.py3-none-macosx_10_11_x86_64.whl
  • Upload date:
  • Size: 541.6 kB
  • Tags: Python 2, Python 3, macOS 10.11+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0-py2.py3-none-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 909dad93fd4a2098ccf4da77b6e6e8c8ffc035a098f1064e372e4a0655660254
MD5 8808474c23d9cd38097384a846848f7c
BLAKE2b-256 275362c3e6daae21f97a02b96f2e351c237dd0957613d3ff5cc9029bba4b680c

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page