Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Documentation Gitter Build Status Bioconda install PyPI codecov DOI License: 3-Clause BSD

Usage:

sourmash sketch dna *.fq.gz
sourmash compare *.sig -o distances.cmp -k 31
sourmash plot distances.cmp

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.

The latest major release is sourmash v4, which has several command-line and Python incompatibilities with previous versions. Please visit our migration guide to upgrade!


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 4.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.7 and later. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda (see below) will install everything necessary, and is our recommended installation method.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash --help

which will install the latest released version.

Support

For questions, please open an issue on Github, or ask in our chat.

Development

Development happens on github at sourmash-bio/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.

Research notice

Please note that this repository is participating in a study into sustainability of open source projects. Data will be gathered about this repository for approximately the next 12 months, starting from 2021-06-11.

Data collected will include number of contributors, number of PRs, time taken to close/merge these PRs, and issues closed.

For more information, please visit our informational page or download our participant information sheet.


CTB Feb 2021

Release history Release notifications | RSS feed

This version

4.2.4

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-4.2.4.tar.gz (8.0 MB view details)

Uploaded Source

Built Distributions

sourmash-4.2.4-py2.py3-none-musllinux_1_1_x86_64.whl (1.4 MB view details)

Uploaded Python 2 Python 3 musllinux: musl 1.1+ x86-64

sourmash-4.2.4-py2.py3-none-musllinux_1_1_aarch64.whl (1.4 MB view details)

Uploaded Python 2 Python 3 musllinux: musl 1.1+ ARM64

sourmash-4.2.4-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl (1.7 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ s390x

sourmash-4.2.4-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (1.5 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ ppc64le

sourmash-4.2.4-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (1.4 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ ARM64

sourmash-4.2.4-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_12_x86_64.manylinux2010_x86_64.whl (1.4 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.12+ x86-64 manylinux: glibc 2.5+ x86-64

sourmash-4.2.4-py2.py3-none-macosx_10_11_x86_64.whl (615.0 kB view details)

Uploaded Python 2 Python 3 macOS 10.11+ x86-64

File details

Details for the file sourmash-4.2.4.tar.gz.

File metadata

  • Download URL: sourmash-4.2.4.tar.gz
  • Upload date:
  • Size: 8.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/54.1.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.2.4.tar.gz
Algorithm Hash digest
SHA256 d50f9e666c05d6c9b39c804ccc2e5c9d322f51bd7d4f8b53b4ca28d129fb133c
MD5 e0614f46b928cb9f344bd5d7dc53a4b9
BLAKE2b-256 15f368c124a4f88e694db68ea00297a2c287eb856517eb5d7b103463685449a2

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.4-py2.py3-none-musllinux_1_1_x86_64.whl.

File metadata

  • Download URL: sourmash-4.2.4-py2.py3-none-musllinux_1_1_x86_64.whl
  • Upload date:
  • Size: 1.4 MB
  • Tags: Python 2, Python 3, musllinux: musl 1.1+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/54.1.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.2.4-py2.py3-none-musllinux_1_1_x86_64.whl
Algorithm Hash digest
SHA256 b7c30a9c82432b82775bb99dd4b5c4a874cd3b702c31297be6cf4d227cd16d1b
MD5 d222df2c5ce8d22f3d934d2bfec69ff1
BLAKE2b-256 15d8094f9d5ebabee6e1cb0b011bb140fe7cdd40793e3bef7293b6e983a3f83a

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.4-py2.py3-none-musllinux_1_1_aarch64.whl.

File metadata

  • Download URL: sourmash-4.2.4-py2.py3-none-musllinux_1_1_aarch64.whl
  • Upload date:
  • Size: 1.4 MB
  • Tags: Python 2, Python 3, musllinux: musl 1.1+ ARM64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/54.1.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.2.4-py2.py3-none-musllinux_1_1_aarch64.whl
Algorithm Hash digest
SHA256 b21abdb5bcbb29320aa693adc45196c7fd269cf7c2b4c951a0362a57c0ff42d8
MD5 9cc4ef94f8654cbdc13474aed7499aab
BLAKE2b-256 30a4659585e95d8a8c818dd0ddb2a0ac84788861b8ef1b2c3e42a840672927ec

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.4-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl.

File metadata

File hashes

Hashes for sourmash-4.2.4-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl
Algorithm Hash digest
SHA256 e84a2940f80f06ce1313a2ae8de85dc60b323d0af9e608ef115df30031287d60
MD5 8d659b4697e091611881ccc86d7742ad
BLAKE2b-256 d9ab7f42a0e2990642319a35ce34b0d496846e87054f8b42f5dfc8f417d9bd40

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.4-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl.

File metadata

File hashes

Hashes for sourmash-4.2.4-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 101f6cd6a002a425d5b4ca4805bb15488ae3045f2f51c4fce8b73055e22b20c6
MD5 90539d69a52bf3277f17989a1b3b4f48
BLAKE2b-256 f51ee5374c07324557ada4d77d64c58479e2d062564494162ec84cb1a24f6a68

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.4-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sourmash-4.2.4-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 f04e984734e3bb48fde127eb5bb18743fac74cf253fd0498321daf11cc444abd
MD5 a4d03297819d48e86ca8de1b0717e122
BLAKE2b-256 fc7b77db53c6b057155a0ae4f463bc4e049d9424be6d5ecf3f128b353caea8e3

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.4-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_12_x86_64.manylinux2010_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.2.4-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_12_x86_64.manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 470fe6d3bafb7bcadffba84792ddd25e45aabc500cb9ffda277d24b283e34057
MD5 6b6417b5572146e61674b621157b3103
BLAKE2b-256 dd89635b0bbde4a3070731f695fc995e2fef217fb55ca11ca3bf04a5c2ef8fb2

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.4-py2.py3-none-macosx_10_11_x86_64.whl.

File metadata

  • Download URL: sourmash-4.2.4-py2.py3-none-macosx_10_11_x86_64.whl
  • Upload date:
  • Size: 615.0 kB
  • Tags: Python 2, Python 3, macOS 10.11+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/54.1.1 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.2.4-py2.py3-none-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 a3fdfdf1f5272223d6ab0607e8c186e8320e837cd23dede22321a2fd9fd50e61
MD5 0dbab8d30ff946b0f5db20f320de7f5f
BLAKE2b-256 4f171035508c758cc3351e31c4a387eadd15f88b428b2925cf0267c86530eae4

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page