Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Documentation Gitter Build Status Bioconda install PyPI codecov DOI License: 3-Clause BSD

Usage:

sourmash sketch dna *.fq.gz
sourmash compare *.sig -o distances.cmp -k 31
sourmash plot distances.cmp

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.

The latest major release is sourmash v4, which has several command-line and Python incompatibilities with previous versions. Please visit our migration guide to upgrade!


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 4.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.7 and later. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda (see below) will install everything necessary, and is our recommended installation method.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash --help

which will install the latest released version.

Support

For questions, please open an issue on Github, or ask in our chat.

Development

Development happens on github at sourmash-bio/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.

Research notice

Please note that this repository is participating in a study into sustainability of open source projects. Data will be gathered about this repository for approximately the next 12 months, starting from 2021-06-11.

Data collected will include number of contributors, number of PRs, time taken to close/merge these PRs, and issues closed.

For more information, please visit our informational page or download our participant information sheet.


CTB Feb 2021

Project details


Release history Release notifications | RSS feed

This version

4.4.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-4.4.1.tar.gz (8.2 MB view details)

Uploaded Source

Built Distributions

sourmash-4.4.1-py2.py3-none-musllinux_1_1_x86_64.whl (1.6 MB view details)

Uploaded Python 2 Python 3 musllinux: musl 1.1+ x86-64

sourmash-4.4.1-py2.py3-none-musllinux_1_1_aarch64.whl (1.6 MB view details)

Uploaded Python 2 Python 3 musllinux: musl 1.1+ ARM64

sourmash-4.4.1-py2.py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.6 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ x86-64

sourmash-4.4.1-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl (1.8 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ s390x

sourmash-4.4.1-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (1.7 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ ppc64le

sourmash-4.4.1-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (1.6 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.17+ ARM64

sourmash-4.4.1-py2.py3-none-macosx_11_0_arm64.whl (743.7 kB view details)

Uploaded Python 2 Python 3 macOS 11.0+ ARM64

sourmash-4.4.1-py2.py3-none-macosx_10_11_x86_64.whl (772.2 kB view details)

Uploaded Python 2 Python 3 macOS 10.11+ x86-64

File details

Details for the file sourmash-4.4.1.tar.gz.

File metadata

  • Download URL: sourmash-4.4.1.tar.gz
  • Upload date:
  • Size: 8.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.8.13

File hashes

Hashes for sourmash-4.4.1.tar.gz
Algorithm Hash digest
SHA256 f8afe23da1e1509f9eeec41bc82414caceb11c8c3cb87d883583e2f21a270763
MD5 047356828f8bcc6fe27c32f8cca873bd
BLAKE2b-256 1051673ae359ef177f31c2f8c4e48d564726098f9114deaae654c8f29eb45db8

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.1-py2.py3-none-musllinux_1_1_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.1-py2.py3-none-musllinux_1_1_x86_64.whl
Algorithm Hash digest
SHA256 ba07ab5dc222f1b3db3be999f8bf56756a258ea01935b0d1e7d66579d27c5ea7
MD5 4d3f0246f6cba8dfb61a75ebc3642c2c
BLAKE2b-256 4dc5247ce17502d89859127c4707ba85481a9612f7f85b3fbed6b9342fd0f76c

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.1-py2.py3-none-musllinux_1_1_aarch64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.1-py2.py3-none-musllinux_1_1_aarch64.whl
Algorithm Hash digest
SHA256 af9715f4d2b34fecfdd3104fc1d72cd43b983fd0ca47f542ac6420cb7feb6db8
MD5 0447632cb28ee404cfc8634744836f18
BLAKE2b-256 b27849c310f72a4eaff9fdc34d5d5263d8aa0e63b1227e6a0d8f0ef02b9d067d

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.1-py2.py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.1-py2.py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 0aa1a8ccf24b5bf18ccbebd069e376fd47882927b04c60a681a234f9976ecf8b
MD5 01940f646fac6bb0f884b91f4af73017
BLAKE2b-256 b84de99938567e5bcb2959f6ebcbefa54ef5d2c7e2cbc617a3984fe91a2f7c9b

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.1-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl.

File metadata

File hashes

Hashes for sourmash-4.4.1-py2.py3-none-manylinux_2_17_s390x.manylinux2014_s390x.whl
Algorithm Hash digest
SHA256 6d8a0103c90c0b4fa47ea287ce682ff1e139217a1adf4f33b98f3a0ad87cf6da
MD5 8bb0c1dfefdbdf861e64095272bb4792
BLAKE2b-256 2e7c85dc33d3b0034b256d1e8013e1db5e35b1117a33710f72901a205da9a3ef

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.1-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl.

File metadata

File hashes

Hashes for sourmash-4.4.1-py2.py3-none-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 3a87a709e52f768d7288881234803b2dc60f2a5b9953f7b321bddd7649d24a3c
MD5 ee2c946d3f530dc35bb10a647f693ac5
BLAKE2b-256 935e8182eac6055c29164565c9f4d9653e8884c39196baca01886af86b06e838

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.1-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.1-py2.py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 3168c5bbb13674036c25827920dab3813508ab495feaa3a3872d40388862bcfa
MD5 41a5976c5e76bb4382f07687c61fe28b
BLAKE2b-256 f11a0f28ad557c706fd8419ce0058f0d96583f8e5d5b74420a44196f5765d750

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.1-py2.py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.1-py2.py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 b5c297644dbae078d8e4207cc6669cbefeb6798806291d42f4e0e75e5f32e96a
MD5 fadb46c6f32c946abb489f783dacf5cb
BLAKE2b-256 99e8724ed9898e8dd261edd377f6559d9056536fb72473048f9f1c8df9ce62ae

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.4.1-py2.py3-none-macosx_10_11_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.4.1-py2.py3-none-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 41287165b9c14ce85df6d9c5efb039dcc7908d0c22954e9281d52c06c5f2d94b
MD5 f4ed1cad926a85cbed578b52f143eaf6
BLAKE2b-256 7b5c1cbf5bec74e7de437ecec7cee82ee60117ef3ba0fe4a7908c531601c10cb

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page