Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Documentation Gitter Build Status Bioconda install PyPI codecov DOI License: 3-Clause BSD

Usage:

sourmash sketch dna *.fq.gz
sourmash compare *.sig -o distances.cmp -k 31
sourmash plot distances.cmp

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.

The latest major release is sourmash v4, which has several command-line and Python incompatibilities with previous versions. Please visit our migration guide to upgrade!


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 4.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.7 and later. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda (see below) will install everything necessary, and is our recommended installation method.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash --help

which will install the latest released version.

Support

For questions, please open an issue on Github, or ask in our chat.

Development

Development happens on github at sourmash-bio/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.

Research notice

Please note that this repository is participating in a study into sustainability of open source projects. Data will be gathered about this repository for approximately the next 12 months, starting from 2021-06-11.

Data collected will include number of contributors, number of PRs, time taken to close/merge these PRs, and issues closed.

For more information, please visit our informational page or download our participant information sheet.


CTB Feb 2021

Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-4.4.0.tar.gz (8.1 MB view details)

Uploaded Source

Built Distributions

sourmash-4.4.0-py2.py3-none-musllinux_1_1_x86_64.whl (1.6 MB view details)

Uploaded Python 2 Python 3 musllinux: musl 1.1+ x86-64

sourmash-4.4.0-py2.py3-none-musllinux_1_1_aarch64.whl (1.6 MB view details)

Uploaded Python 2 Python 3 musllinux: musl 1.1+ ARM64

sourmash-4.4.0-py2.py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.5 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ x86-64

sourmash-4.4.0-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl (1.7 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ s390x

sourmash-4.4.0-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (1.7 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ ppc64le

sourmash-4.4.0-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (1.5 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ ARM64

sourmash-4.4.0-py2.py3-none-macosx_11_0_arm64.whl (727.6 kB view details)

Uploaded Python 2 Python 3 macOS 11.0+ ARM64

sourmash-4.4.0-py2.py3-none-macosx_10_11_x86_64.whl (757.2 kB view details)

Uploaded Python 2 Python 3 macOS 10.11+ x86-64

File details

Details for the file sourmash-4.4.0.tar.gz.

File metadata

  • Download URL: sourmash-4.4.0.tar.gz
  • Upload date:
  • Size: 8.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.8.13

File hashes

Hashes for sourmash-4.4.0.tar.gz
Algorithm Hash digest
SHA256 00a9a0dd26e538c2518db3e2aa5fd4b4bd8be9650f7427d3c93a8022798412de
MD5 cb98792445025e5a7446067b48af40a3
BLAKE2b-256 fbf371e1380e27c433542c84734d332940a6e0aedbcd6d2b67fb840fd0680ad1

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.0-py2.py3-none-musllinux_1_1_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.0-py2.py3-none-musllinux_1_1_x86_64.whl
Algorithm Hash digest
SHA256 b81970f4dccdddafad2534ab4b0382a7c75d9c05ea7a30cfde5349a23887168b
MD5 fd08d64da8bc1b84bb58fbc2177d25f2
BLAKE2b-256 efe969585bae6561b9790e5468b6e81fb0aa4189b64abc3cc528a651f44444eb

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.0-py2.py3-none-musllinux_1_1_aarch64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.0-py2.py3-none-musllinux_1_1_aarch64.whl
Algorithm Hash digest
SHA256 bc255693484685bd85d8634af69b26383a97a6b76c252b45bbe74eb8031c9425
MD5 2fcfb1db41ff5a7502c7ef2c18dcf89e
BLAKE2b-256 7fa6b754beb615c49671a3cff2209684c4dde270229fd3750b4dfdc4bd739f4d

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.0-py2.py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.0-py2.py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 22367c600862ba29207ce045855a933f770a16734b005089bba3e32348e33cda
MD5 72078d3585bd889aef0138e3a6cbae7e
BLAKE2b-256 e73e47efea6f6b1bcd32dc63e284ebc5646b68ed13cc60bc154885e9cd70b60a

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.0-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl.

File metadata

File hashes

Hashes for sourmash-4.4.0-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl
Algorithm Hash digest
SHA256 72f93af3d0af5bd7e1323e7c436db80e2b3621bb53b2f240b63c116ac0acdcc5
MD5 a0388b8d28f94301bac265479d3ac85e
BLAKE2b-256 4b1826248839c9acb27d1e7c604a73d3509f372f44b7397cef6921ab7606c9c3

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.0-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl.

File metadata

File hashes

Hashes for sourmash-4.4.0-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 ea4d1cc5b1255fdd448e3317afe0f3ca3315678f82aeabf57e521d6b6a0ae9f2
MD5 2f2a93844d361c12fc417e8305b055fd
BLAKE2b-256 97b938ad5b2ee0c2513e64c08886e561db16f174770bd63738b5ae48235d2c2b

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.0-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.0-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 9089f7f16b626b239588bd9fd897efadf4d7fff59b0c69acf8cdf3c848e458d9
MD5 b830f98db2bbe1e18bf1d53b08f554f6
BLAKE2b-256 c8fd904120b849c929dba9b0d75d6f8fa01829e5cfd828a67025f714d686e0f5

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.0-py2.py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.0-py2.py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 0c0076870ed0f7e95b2689cfc87db46c52ef29dfa714d284e68eb2461dc72128
MD5 5601d21aaf7d3392028953a6357f2a13
BLAKE2b-256 ed4dd0506e242532333c86ed61b599d8e22f5cb33b3f7dc9dde63e1fb97a9f01

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.0-py2.py3-none-macosx_10_11_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.0-py2.py3-none-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 40e2c974dbe84c41f4cd709a892eb89a1b2ec6422c83260acdb58570ea16ff68
MD5 94ffc0dae80838c12e3cf3c1359c34fe
BLAKE2b-256 72f5282b46a277a3ffdbf44befb54fa430cc3487aeddcb151869ed23d77b9d19

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page