Skip to main content

tools for comparing biological sequences with k-mer sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Documentation Gitter Build Status Bioconda install PyPI codecov DOI License: 3-Clause BSD

Usage:

sourmash sketch dna *.fq.gz
sourmash compare *.sig -o distances.cmp -k 31
sourmash plot distances.cmp

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.

The latest major release is sourmash v4, which has several command-line and Python incompatibilities with previous versions. Please visit our migration guide to upgrade!


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 4.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.10 and later. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda (see below) will install everything necessary, and is our recommended installation method.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash
$ source activate sourmash_env
$ sourmash --help

which will install the latest released version.

Support

For questions, please open an issue on Github, or ask in our chat.

Development

Development happens on github at sourmash-bio/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.

Research notice

Please note that this repository is participating in a study into sustainability of open source projects. Data will be gathered about this repository for approximately the next 12 months, starting from 2021-06-11.

Data collected will include number of contributors, number of PRs, time taken to close/merge these PRs, and issues closed.

For more information, please visit our informational page or download our participant information sheet.


CTB Feb 2021

Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-4.8.5.tar.gz (13.3 MB view details)

Uploaded Source

Built Distributions

sourmash-4.8.5-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.7 MB view details)

Uploaded Python 3 manylinux: glibc 2.17+ x86-64

sourmash-4.8.5-py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (3.6 MB view details)

Uploaded Python 3 manylinux: glibc 2.17+ ARM64

sourmash-4.8.5-py3-none-macosx_11_0_x86_64.whl (2.6 MB view details)

Uploaded Python 3 macOS 11.0+ x86-64

sourmash-4.8.5-py3-none-macosx_11_0_arm64.whl (2.4 MB view details)

Uploaded Python 3 macOS 11.0+ ARM64

File details

Details for the file sourmash-4.8.5.tar.gz.

File metadata

  • Download URL: sourmash-4.8.5.tar.gz
  • Upload date:
  • Size: 13.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for sourmash-4.8.5.tar.gz
Algorithm Hash digest
SHA256 9016c338d6b01f152d2c7becf5631000b058ff657f4fc50daa6ef15373d8a02a
MD5 4dbe8908443fe10707605f3a54758004
BLAKE2b-256 5f4bbeed3386e538115221e0e361da35125cd562acb04978ad8eea5903cd84e6

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.8.5-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.8.5-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 75e52771030deb1d95458ad179bd58eb08d8383031de433b30155f7dae978e30
MD5 674d341b1ae8b3e2eb67c896bcafcc52
BLAKE2b-256 34bbb82651fb694c2d31898fcaefb441e26f99f56f394ae08cacfa953e1833f0

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.8.5-py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sourmash-4.8.5-py3-none-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 9216036e0288b575830695702f9f521ade2becf170235e8fa02f7650db966335
MD5 4c30ba5e9ffd292193efcdca160ccd9c
BLAKE2b-256 ebbe1a76073107569a5f8fccde834ff9fdca547e6a080ddecc04202827c829c4

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.8.5-py3-none-macosx_11_0_x86_64.whl.

File metadata

File hashes

Hashes for sourmash-4.8.5-py3-none-macosx_11_0_x86_64.whl
Algorithm Hash digest
SHA256 7f2b969b68e40abd46b0efe8c3db219d663d2f00c322bffd57d7f084664381a7
MD5 ca351ccd8aa5e3a256c900977df48a7b
BLAKE2b-256 b7eac9075d7096480a77ae4ace5e5f0cd3d7c9fcdce77958d0b3fbb7d31c905a

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.8.5-py3-none-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for sourmash-4.8.5-py3-none-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 e51f18838815db2ce80672800aa53003dd9ea3c82ab983ebe6f73e88e1af6624
MD5 84ada55de88abc4c21550f52ba82c70a
BLAKE2b-256 926aee1623ab885461022c2a868f3b9e846f3d28a178537ba4834c09009c1c22

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page