Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Documentation Gitter Build Status Bioconda install PyPI codecov DOI License: 3-Clause BSD

Usage:

sourmash sketch dna *.fq.gz
sourmash compare *.sig -o distances.cmp -k 31
sourmash plot distances.cmp

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.

The latest major release is sourmash v4, which has several command-line and Python incompatibilities with previous versions. Please visit our migration guide to upgrade!


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 4.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.7 and later. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda (see below) will install everything necessary, and is our recommended installation method.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash --help

which will install the latest released version.

Support

For questions, please open an issue on Github, or ask in our chat.

Development

Development happens on github at sourmash-bio/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.

Research notice

Please note that this repository is participating in a study into sustainability of open source projects. Data will be gathered about this repository for approximately the next 12 months, starting from 2021-06-11.

Data collected will include number of contributors, number of PRs, time taken to close/merge these PRs, and issues closed.

For more information, please visit our informational page or download our participant information sheet.


CTB Feb 2021

Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-4.2.1.tar.gz (8.0 MB view details)

Uploaded Source

Built Distributions

sourmash-4.2.1-py2.py3-none-manylinux2014_s390x.whl (1.5 MB view details)

Uploaded Python 2 Python 3

sourmash-4.2.1-py2.py3-none-manylinux2014_ppc64le.whl (1.4 MB view details)

Uploaded Python 2 Python 3

sourmash-4.2.1-py2.py3-none-manylinux2014_aarch64.whl (1.2 MB view details)

Uploaded Python 2 Python 3

sourmash-4.2.1-py2.py3-none-manylinux2010_x86_64.whl (1.3 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.12+ x86-64

sourmash-4.2.1-py2.py3-none-manylinux1_x86_64.whl (1.3 MB view details)

Uploaded Python 2 Python 3

sourmash-4.2.1-py2.py3-none-macosx_10_11_x86_64.whl (587.0 kB view details)

Uploaded Python 2 Python 3 macOS 10.11+ x86-64

File details

Details for the file sourmash-4.2.1.tar.gz.

File metadata

  • Download URL: sourmash-4.2.1.tar.gz
  • Upload date:
  • Size: 8.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.10

File hashes

Hashes for sourmash-4.2.1.tar.gz
Algorithm Hash digest
SHA256 0198f48e0eafb89d2bd2cfe4e426ee61e0b1790c264c119f99166c025423f0e0
MD5 301fce147c0e3deb83a4aff39707f427
BLAKE2b-256 d836bae87b6e2209bdf863915d7578d35de370061fac5f405b5cc671c445a709

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.1-py2.py3-none-manylinux2014_s390x.whl.

File metadata

  • Download URL: sourmash-4.2.1-py2.py3-none-manylinux2014_s390x.whl
  • Upload date:
  • Size: 1.5 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.10

File hashes

Hashes for sourmash-4.2.1-py2.py3-none-manylinux2014_s390x.whl
Algorithm Hash digest
SHA256 2153bc3bb54954cfaed5364af2b1ae4a171a13a2e8a333a4ffbe94153c9d07f2
MD5 ceabe0ca2086adf55af56a75b1374a37
BLAKE2b-256 433c7287cdef0de3451982d82798f47d6098f6382bb9d90e3c4eedccdf59c209

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.1-py2.py3-none-manylinux2014_ppc64le.whl.

File metadata

  • Download URL: sourmash-4.2.1-py2.py3-none-manylinux2014_ppc64le.whl
  • Upload date:
  • Size: 1.4 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.10

File hashes

Hashes for sourmash-4.2.1-py2.py3-none-manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 d4330d00ee969c5fa70a379f3996f7950a9b15843c6b3dcb92514144535af61b
MD5 4c2ddcaed2437dd50f9e17cf15989c8d
BLAKE2b-256 a2b378e22a1934ad510c81b280f058b4887776a9e02a5ecf1b49a0c381ab0c5f

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.1-py2.py3-none-manylinux2014_aarch64.whl.

File metadata

  • Download URL: sourmash-4.2.1-py2.py3-none-manylinux2014_aarch64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.10

File hashes

Hashes for sourmash-4.2.1-py2.py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 9f70d2049852c88c486238863a84fb9cd540a0b6eb9ebcfc65aa087813659f70
MD5 5fc1b8f3927aa6a74207aa86c3c3ff2f
BLAKE2b-256 2e344e806cc9cafb5451311d78beeffec74215b96fdbb7225d0b23a6b070a798

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.1-py2.py3-none-manylinux2010_x86_64.whl.

File metadata

  • Download URL: sourmash-4.2.1-py2.py3-none-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 2, Python 3, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.10

File hashes

Hashes for sourmash-4.2.1-py2.py3-none-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 f9666eae50c282d1762e37a6e463f9b8f2425a58b609c59817afeca68707e6b1
MD5 4d3e885411bce4966c8f68aa7a52b3f0
BLAKE2b-256 fa7b5c731a311f42d16f974d96028a83668e28bf432d90397f527750622c5291

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.1-py2.py3-none-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-4.2.1-py2.py3-none-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.10

File hashes

Hashes for sourmash-4.2.1-py2.py3-none-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 96bc58ba88a5e7a48f8d8d7983bbac8c3bff7fd172860078b27c12633f6d909f
MD5 2d6c4b39f1a45b800d51bd0905ca7dc1
BLAKE2b-256 89f96c193576161515929acc639fb8b1a299c455c56db274a51e7a5511a26fce

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.2.1-py2.py3-none-macosx_10_11_x86_64.whl.

File metadata

  • Download URL: sourmash-4.2.1-py2.py3-none-macosx_10_11_x86_64.whl
  • Upload date:
  • Size: 587.0 kB
  • Tags: Python 2, Python 3, macOS 10.11+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.10

File hashes

Hashes for sourmash-4.2.1-py2.py3-none-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 66342f291f557c8b5abdb165e74b2c007ac944f14e8c21343da34e8a1860d631
MD5 732244059aaf18e27e047386fd4b973c
BLAKE2b-256 d28bc92419a760a2c3b4241762c61c372b85376174a8105fd0c7759a108d038e

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page