Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Documentation Gitter Build Status Bioconda install PyPI codecov DOI License: 3-Clause BSD

Usage:

sourmash sketch dna *.fq.gz
sourmash compare *.sig -o distances.cmp -k 31
sourmash plot distances.cmp

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.

The latest major release is sourmash v4, which has several command-line and Python incompatibilities with previous versions. Please visit our migration guide to upgrade!


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 4.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.7 and later. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda (see below) will install everything necessary, and is our recommended installation method.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash --help

which will install the latest released version.

Support

For questions, please open an issue on Github, or ask in our chat.

Development

Development happens on github at sourmash-bio/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.

Research notice

Please note that this repository is participating in a study into sustainability of open source projects. Data will be gathered about this repository for approximately the next 12 months, starting from 2021-06-11.

Data collected will include number of contributors, number of PRs, time taken to close/merge these PRs, and issues closed.

For more information, please visit our informational page or download our participant information sheet.


CTB Feb 2021

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-4.4.2.tar.gz (8.2 MB view details)

Uploaded Source

Built Distributions

sourmash-4.4.2-py2.py3-none-musllinux_1_1_x86_64.whl (1.6 MB view details)

Uploaded Python 2 Python 3 musllinux: musl 1.1+ x86-64

sourmash-4.4.2-py2.py3-none-musllinux_1_1_aarch64.whl (1.6 MB view details)

Uploaded Python 2 Python 3 musllinux: musl 1.1+ ARM64

sourmash-4.4.2-py2.py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.6 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ x86-64

sourmash-4.4.2-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl (1.8 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ s390x

sourmash-4.4.2-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (1.7 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ ppc64le

sourmash-4.4.2-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (1.6 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ ARM64

sourmash-4.4.2-py2.py3-none-macosx_11_0_arm64.whl (743.5 kB view details)

Uploaded Python 2 Python 3 macOS 11.0+ ARM64

sourmash-4.4.2-py2.py3-none-macosx_10_11_x86_64.whl (771.9 kB view details)

Uploaded Python 2 Python 3 macOS 10.11+ x86-64

File details

Details for the file sourmash-4.4.2.tar.gz.

File metadata

  • Download URL: sourmash-4.4.2.tar.gz
  • Upload date:
  • Size: 8.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.8.13

File hashes

Hashes for sourmash-4.4.2.tar.gz
Algorithm Hash digest
SHA256 6eb6beb70954b73340db7bb296441cfc4515979d39b589e3df058a46e78533fe
MD5 b2fea8707c852124a9f81db384f042a9
BLAKE2b-256 8cf5c102b494713e75066cf72460312c4e6200c813e59b3fe17bc1d94d0858da

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.2-py2.py3-none-musllinux_1_1_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.2-py2.py3-none-musllinux_1_1_x86_64.whl
Algorithm Hash digest
SHA256 e6a0b021802f4c567fe672afd3d821b6b86190ae24cb652260322c4404d44522
MD5 56ccc0a3f30323b2bba2a34d64af43a6
BLAKE2b-256 38e394b6ca452e74214bac3f416476f82b50c96a65481fe8680dee14f6cb1b4e

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.2-py2.py3-none-musllinux_1_1_aarch64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.2-py2.py3-none-musllinux_1_1_aarch64.whl
Algorithm Hash digest
SHA256 3d5c204818e14db82b52cac1f85f84d968364a0090875e8f29122b6f91da06be
MD5 eb1301cd6282c8e889ed56e44a48d2a5
BLAKE2b-256 43e5484dfe188e66a217de5ad43be5fd44683953bc2f6ec2ec65ef9bb1513a1a

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.2-py2.py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.2-py2.py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 1097234cd1b6a0ba74694cf52462ed447256de5fa1aa4108fbed48943684a73f
MD5 ac8f5086c9ff2185dcaff8c960b03bca
BLAKE2b-256 b884f4c17d7fe6342072827d0dfd7f2a23207aa8f455dd8b07edd49c5d8055e8

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.2-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl.

File metadata

File hashes

Hashes for sourmash-4.4.2-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl
Algorithm Hash digest
SHA256 c4b6ba6c0968dbc21b48fd8bdd80687c10c3a5df995574bfaba0969b2d2c02e8
MD5 2bb2a34bafb69071ebfce130ddbec567
BLAKE2b-256 e75f25d0e253481d5a4974cd2df769f5662e583c742e2cdab304e75f775c533b

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.2-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl.

File metadata

File hashes

Hashes for sourmash-4.4.2-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 3894d1a9fcb6c8c68ceadeccc60e41f9e5ca9aa749ab3d2b3fc05b72fd16a523
MD5 9f5101881711763bcd9027d19abfb68e
BLAKE2b-256 e93d83849aa17cea659646b3a9227dd5b4399ce241caed764b0d35d56e938aa1

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.2-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.2-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 bf10010fb52805eb4095091a689c3d4697d886389753cee4eb836eb752d9ef6b
MD5 2137423670f2b80ac351dc450fd2ce60
BLAKE2b-256 82fa932f25ed76d675bf79624c5728c9e0dc69bef12776cd846b6d031ff9a2b8

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.2-py2.py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.2-py2.py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 89c98530b2c0c0c6bc2f58cf5e73f7a990720c75c16c91a06748f0b707c704f0
MD5 a259d01e1ee38c46d77a21faf90f9c0f
BLAKE2b-256 8c99a4430e6f926d6e6f91de6064e960703fc385a83e7833df3f19a2e29342ce

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.2-py2.py3-none-macosx_10_11_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.2-py2.py3-none-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 708b52bcf0363841fe660044f85dd42c571926e783e53228c7058b7507897036
MD5 43ac74c257c3d35247352ed4701f56af
BLAKE2b-256 e3f094abf05563fb964244964bbac94838b8b1ef5973a580e52f79abab9ae3eb

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page