Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Documentation Build Status Bioconda install PyPI codecov DOI License: 3-Clause BSD

Usage:

sourmash sketch dna *.fq.gz
sourmash compare *.sig -o distances.cmp -k 31
sourmash plot distances.cmp

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.

The latest major release is sourmash v4, which has several command-line and Python incompatibilities with previous versions. Please visit our migration guide to upgrade!


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 4.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.7 and later. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda (see below) will install everything necessary, and is our recommended installation method.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash --help

which will install the latest released version.

Support

Please ask questions and files issues on Github.

Development

Development happens on github at dib-lab/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.


CTB Feb 2021

Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-4.0.0rc2.tar.gz (7.9 MB view details)

Uploaded Source

Built Distributions

sourmash-4.0.0rc2-py2.py3-none-manylinux2014_s390x.whl (1.6 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0rc2-py2.py3-none-manylinux2014_ppc64le.whl (1.4 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0rc2-py2.py3-none-manylinux2014_aarch64.whl (1.2 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0rc2-py2.py3-none-manylinux2010_x86_64.whl (1.2 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.12+ x86-64

sourmash-4.0.0rc2-py2.py3-none-manylinux1_x86_64.whl (1.2 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0rc2-py2.py3-none-macosx_10_11_x86_64.whl (541.6 kB view details)

Uploaded Python 2 Python 3 macOS 10.11+ x86-64

File details

Details for the file sourmash-4.0.0rc2.tar.gz.

File metadata

  • Download URL: sourmash-4.0.0rc2.tar.gz
  • Upload date:
  • Size: 7.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/54.0.0 requests-toolbelt/0.9.1 tqdm/4.58.0 CPython/3.8.8

File hashes

Hashes for sourmash-4.0.0rc2.tar.gz
Algorithm Hash digest
SHA256 cfbfbc12873b5330ebcf5a7dd562526268918a51c11d3bc435fe0747659e0490
MD5 c818948f9dc9f61def0f00de899963b6
BLAKE2b-256 fa370c68d40f3cd0c6a37707b68818d0ac484154b314eb14718854ac5d0b10d4

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc2-py2.py3-none-manylinux2014_s390x.whl.

File metadata

  • Download URL: sourmash-4.0.0rc2-py2.py3-none-manylinux2014_s390x.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc2-py2.py3-none-manylinux2014_s390x.whl
Algorithm Hash digest
SHA256 4ac2ff6e73192bab54402c8e3cfbe36a1dada0e8135fcfe5e785e344d2e7e1f9
MD5 04f4b93a40ee6b7dbe19f872f5a4e880
BLAKE2b-256 d06d6d7969c88fb9226dabe1df012a912f9b541c602e3c3d73f984ba98f8e935

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc2-py2.py3-none-manylinux2014_ppc64le.whl.

File metadata

  • Download URL: sourmash-4.0.0rc2-py2.py3-none-manylinux2014_ppc64le.whl
  • Upload date:
  • Size: 1.4 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc2-py2.py3-none-manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 f3e576a9eb321df841d57df85d31161cdd9ba9a4b0eff9b815737548976cd7d2
MD5 fb5e23fe40310affd9409076f03a5c28
BLAKE2b-256 64b2c108ca6cf34b1e285936d586ff4b8d6a9de69db8b3cad9a7ec57fa003135

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc2-py2.py3-none-manylinux2014_aarch64.whl.

File metadata

  • Download URL: sourmash-4.0.0rc2-py2.py3-none-manylinux2014_aarch64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc2-py2.py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 96338584680327c97e0d10c1d3178f940a3171947e86e3c15c5dc8e3bfb74913
MD5 0e02a2c06e7c0e20cde23687cdd4bac4
BLAKE2b-256 d12a20018afa8cef48ccb4bc1149f736dc0271b0c98e48ed84e3b5032a9b3fe5

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc2-py2.py3-none-manylinux2010_x86_64.whl.

File metadata

  • Download URL: sourmash-4.0.0rc2-py2.py3-none-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc2-py2.py3-none-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 38c07c58ca112cca0c776f2ff03b961fc2662d1a09b16692fb0507fd501f6a43
MD5 42295cf278fecf7a511e3c4f370b538e
BLAKE2b-256 f046edf9de1999e8381b37dbe38ca5bea96e3eabe0ff54e03d4429f40bd5bd1a

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc2-py2.py3-none-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-4.0.0rc2-py2.py3-none-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc2-py2.py3-none-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 63fa12405ceaf1e6735da0d795f36017e62064e99cab4ce2124244b544936a38
MD5 5452fb64ec22d073f77a68c21cf177a1
BLAKE2b-256 59bdc743a3a1fa54b021f060c3f5403e52c3d067ee049bd29be71b3e9059bb0e

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0rc2-py2.py3-none-macosx_10_11_x86_64.whl.

File metadata

  • Download URL: sourmash-4.0.0rc2-py2.py3-none-macosx_10_11_x86_64.whl
  • Upload date:
  • Size: 541.6 kB
  • Tags: Python 2, Python 3, macOS 10.11+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/51.3.3 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0rc2-py2.py3-none-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 82ef59f7c4e3d8190ab01ab6468ee7f94ca15c9c346156692bc5c4cd0a37921d
MD5 8156c4b93f8a707ee86ebd9620a5ac68
BLAKE2b-256 516ab765fc06f2e8b13b5b47f7f813905f3b37b18c0b53717cec3ccd435af77c

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page