Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Documentation Build Status Bioconda install PyPI codecov DOI License: 3-Clause BSD

Usage:

sourmash sketch dna *.fq.gz
sourmash compare *.sig -o distances.cmp -k 31
sourmash plot distances.cmp

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.

The latest major release is sourmash v4, which has several command-line and Python incompatibilities with previous versions. Please visit our migration guide to upgrade!


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 4.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.7 and later. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda (see below) will install everything necessary, and is our recommended installation method.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash --help

which will install the latest released version.

Support

Please ask questions and files issues on Github.

Development

Development happens on github at dib-lab/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.


CTB Feb 2021

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-4.0.0rc1.tar.gz (7.9 MB view details)

Uploaded Source

Built Distributions

sourmash-4.0.0rc1-py2.py3-none-manylinux2014_s390x.whl (1.6 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0rc1-py2.py3-none-manylinux2014_ppc64le.whl (1.4 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0rc1-py2.py3-none-manylinux2014_aarch64.whl (1.2 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0rc1-py2.py3-none-manylinux2010_x86_64.whl (1.2 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.12+ x86-64

sourmash-4.0.0rc1-py2.py3-none-manylinux1_x86_64.whl (1.2 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0rc1-py2.py3-none-macosx_10_11_x86_64.whl (541.9 kB view details)

Uploaded Python 2 Python 3 macOS 10.11+ x86-64

File details

Details for the file sourmash-4.0.0rc1.tar.gz.

File metadata

  • Download URL: sourmash-4.0.0rc1.tar.gz
  • Upload date:
  • Size: 7.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc1.tar.gz
Algorithm Hash digest
SHA256 d9416268d9bc2c7585163beac08f9b16f451e770219a83d0b63cab021993124f
MD5 6739128b93e41a750d246a9c0810493d
BLAKE2b-256 301bdab8a04bebe5e86cb288d8c845f3c52251ba74054f082d0df2b339c62e1e

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc1-py2.py3-none-manylinux2014_s390x.whl.

File metadata

  • Download URL: sourmash-4.0.0rc1-py2.py3-none-manylinux2014_s390x.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc1-py2.py3-none-manylinux2014_s390x.whl
Algorithm Hash digest
SHA256 d1a65ccb558eda1c1af289010e98f89b5c0c7b8b7769d99114700a5705728539
MD5 f7f2b6c16b99e932f741466db36144db
BLAKE2b-256 fa008a5cf36d2182652859b09c17d2a4e8f94c7ef282798437fd6a2a5d190592

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc1-py2.py3-none-manylinux2014_ppc64le.whl.

File metadata

  • Download URL: sourmash-4.0.0rc1-py2.py3-none-manylinux2014_ppc64le.whl
  • Upload date:
  • Size: 1.4 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc1-py2.py3-none-manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 fcd3ddeeb06cc4182935103a568052a380e57729876fd0ff0b5fb6072b8fc662
MD5 e94dd9c6dbb87c55054bf5192005b24b
BLAKE2b-256 e8138f2d896e0d6d3611ebafcb3e816c4169fb143b1826d5bdccede500889bf1

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc1-py2.py3-none-manylinux2014_aarch64.whl.

File metadata

  • Download URL: sourmash-4.0.0rc1-py2.py3-none-manylinux2014_aarch64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc1-py2.py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 7540c31df2d789a957cabfb64264acbf241e920ed922ab2834091fff4de9a654
MD5 135253ef8bf17a567ea10aacbce4d749
BLAKE2b-256 625efb17ec3a7905f69621dc880a75a9ab6b9db7d5448205d8dd976ac7a29194

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc1-py2.py3-none-manylinux2010_x86_64.whl.

File metadata

  • Download URL: sourmash-4.0.0rc1-py2.py3-none-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc1-py2.py3-none-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 25f83d2e7a9083ca496aa6e8d7f4502f964ea8d1f0567068c905712e9dfc031f
MD5 2052d21313580302d276827a6cae6e31
BLAKE2b-256 4798eb9aebb7c5fee86a0c9a94f696702be88d8b720adf47f0d4d22fffea38d4

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc1-py2.py3-none-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-4.0.0rc1-py2.py3-none-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc1-py2.py3-none-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 9c8ea2119e0e67cce882b69e1a29e778958b340791bb19104e82e5da46bb6536
MD5 e9127e0f5982e1885c4572e5403f9199
BLAKE2b-256 36c12db2bb748c19da88a81a0eef37105b713ebb24a8b445d1a25df1e17d6c27

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc1-py2.py3-none-macosx_10_11_x86_64.whl.

File metadata

  • Download URL: sourmash-4.0.0rc1-py2.py3-none-macosx_10_11_x86_64.whl
  • Upload date:
  • Size: 541.9 kB
  • Tags: Python 2, Python 3, macOS 10.11+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc1-py2.py3-none-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 5b6b1205485898eeef845b38dd6d5b71880f37d38fc465dc1c2e003b9425cd63
MD5 9cf2e69b020fb537687b1c460b4ba45c
BLAKE2b-256 03de4607eb5f93e9e01b9da7385be6b96c462286570e4278bc70158b4e3bc1ea

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page