Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Documentation Gitter Build Status Bioconda install PyPI codecov DOI License: 3-Clause BSD

Usage:

sourmash sketch dna *.fq.gz
sourmash compare *.sig -o distances.cmp -k 31
sourmash plot distances.cmp

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.

The latest major release is sourmash v4, which has several command-line and Python incompatibilities with previous versions. Please visit our migration guide to upgrade!


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 4.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.7 and later. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda (see below) will install everything necessary, and is our recommended installation method.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash --help

which will install the latest released version.

Support

For questions, please open an issue on Github, or ask in our chat.

Development

Development happens on github at sourmash-bio/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.

Research notice

Please note that this repository is participating in a study into sustainability of open source projects. Data will be gathered about this repository for approximately the next 12 months, starting from 2021-06-11.

Data collected will include number of contributors, number of PRs, time taken to close/merge these PRs, and issues closed.

For more information, please visit our informational page or download our participant information sheet.


CTB Feb 2021

Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-4.2.2.tar.gz (8.0 MB view details)

Uploaded Source

Built Distributions

sourmash-4.2.2-py2.py3-none-manylinux2014_s390x.whl (1.5 MB view details)

Uploaded Python 2 Python 3

sourmash-4.2.2-py2.py3-none-manylinux2014_ppc64le.whl (1.4 MB view details)

Uploaded Python 2 Python 3

sourmash-4.2.2-py2.py3-none-manylinux2014_aarch64.whl (1.2 MB view details)

Uploaded Python 2 Python 3

sourmash-4.2.2-py2.py3-none-manylinux2010_x86_64.whl (1.3 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.12+ x86-64

sourmash-4.2.2-py2.py3-none-manylinux1_x86_64.whl (1.3 MB view details)

Uploaded Python 2 Python 3

sourmash-4.2.2-py2.py3-none-macosx_10_11_x86_64.whl (590.1 kB view details)

Uploaded Python 2 Python 3 macOS 10.11+ x86-64

File details

Details for the file sourmash-4.2.2.tar.gz.

File metadata

  • Download URL: sourmash-4.2.2.tar.gz
  • Upload date:
  • Size: 8.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/54.1.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.2.2.tar.gz
Algorithm Hash digest
SHA256 b1b1650c15b57691890a44b756f743a5b5602891e92407839d46ba085c2c0dbc
MD5 99d497ddfdea6b2e3d7ef0772e03a498
BLAKE2b-256 d1974d401ba2fbd124d8c2f5ecdc5b4fbd1f51cc5ba4930aac0a7ea9afb5466a

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.2-py2.py3-none-manylinux2014_s390x.whl.

File metadata

  • Download URL: sourmash-4.2.2-py2.py3-none-manylinux2014_s390x.whl
  • Upload date:
  • Size: 1.5 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/54.1.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.2.2-py2.py3-none-manylinux2014_s390x.whl
Algorithm Hash digest
SHA256 075a199563c17e17b36a4e7cd4c6642debec2bedd85615fe163af701418cf760
MD5 db61d1efc9b2ceabcd067fbad9d72e61
BLAKE2b-256 63b4e2b0decfe363512dbd990b04dc5e02d70c3d0a8cf9f04f480bcdd5903b99

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.2-py2.py3-none-manylinux2014_ppc64le.whl.

File metadata

  • Download URL: sourmash-4.2.2-py2.py3-none-manylinux2014_ppc64le.whl
  • Upload date:
  • Size: 1.4 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/54.1.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.2.2-py2.py3-none-manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 23c87e7a17250d541d584d29c4cee6a42f46425bd414a625cc9e51d6058f8fc9
MD5 2345bd164941e03bc2888b2bf66cdefc
BLAKE2b-256 f5a4f05679727dd1357d855ab517122e75ab79762275f11bf6c8b04c33ba5ff9

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.2-py2.py3-none-manylinux2014_aarch64.whl.

File metadata

  • Download URL: sourmash-4.2.2-py2.py3-none-manylinux2014_aarch64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/54.1.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.2.2-py2.py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 00aba354ddd41b9e6166a0bcc5f4998d55e3d883e496087907c25e19a4c4b4d5
MD5 1588142ebcd03f5d606729c511d8130a
BLAKE2b-256 6d981a183064ce110219ed11c8a7cc5b972361107402f40d44bf00a1dc975ed9

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.2-py2.py3-none-manylinux2010_x86_64.whl.

File metadata

  • Download URL: sourmash-4.2.2-py2.py3-none-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 2, Python 3, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/54.1.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.2.2-py2.py3-none-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 be074fda540ac014449a5e845f38caaa644eeb6cea43af985b5d43e6a3e1e57a
MD5 59ce692cb1774d18126fbb1b0e192ac8
BLAKE2b-256 45a8e55466ba0a166df3d799df20c728c98f18861e1aa52a088db286550b0c7e

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.2-py2.py3-none-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-4.2.2-py2.py3-none-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/54.1.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.2.2-py2.py3-none-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 4dc50ff7e19c6fc71a6f9d60be8d2dbcefb82986685026b7fc37f8847059549b
MD5 086e7ab10d607f733fbc2ed78338e013
BLAKE2b-256 b1f537f7779438c65d747bcded5a6db5114ea7c57ede810e903b94f0eab52bda

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.2-py2.py3-none-macosx_10_11_x86_64.whl.

File metadata

  • Download URL: sourmash-4.2.2-py2.py3-none-macosx_10_11_x86_64.whl
  • Upload date:
  • Size: 590.1 kB
  • Tags: Python 2, Python 3, macOS 10.11+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/54.1.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.2.2-py2.py3-none-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 390d0ef417a6521d06d84425c3e06e99a3abb3906823315344879206c920fd6c
MD5 18ea054aa11a1a362308d39cd3de741b
BLAKE2b-256 5e8372785a3fa5a05de6816cf3e03853834bef173479747ef509d027e15b5556

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page