Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Documentation Build Status PyPI codecov DOI License: 3-Clause BSD

🦀 Rust API Documentation on docs.rs


Compute MinHash signatures for nucleotide (DNA/RNA) and protein sequences.

Usage:

sourmash compute *.fq.gz
sourmash compare *.sig -o distances
sourmash plot distances

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash is a product of the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 2.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under both Python 2.7.x and Python 3.5+. The base requirements are screed and ijson, together with a Rust environment (for the extension code). We suggest using rustup to install the Rust environment:

curl https://sh.rustup.rs -sSf | sh

The comparison code (sourmash compare) uses numpy, and the plotting code uses matplotlib and scipy, but most of the code is usable without these.

For search and gather you also need khmer version 2.1+.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda you will need to add the bioconda channel as well as the other channels bioconda depends on. Once you have setup bioconda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash compute -h

which will install the latest alpha release.

Support

Please ask questions and files issues on Github.

Development

Development happens on github at dib-lab/sourmash.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the library code.

Tests require py.test and can be run with make test.

Please see the developer notes for more information.


CTB Dec 2018

Project details


Release history Release notifications | RSS feed

This version

3.0.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourmash-3.0.1.tar.gz (7.3 MB view details)

Uploaded Source

Built Distributions

sourmash-3.0.1-py2.py3-none-manylinux2010_x86_64.whl (1.0 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.12+ x86-64

sourmash-3.0.1-py2.py3-none-manylinux1_x86_64.whl (1.0 MB view details)

Uploaded Python 2 Python 3

sourmash-3.0.1-py2.py3-none-macosx_10_6_intel.whl (431.1 kB view details)

Uploaded Python 2 Python 3 macOS 10.6+ intel

File details

Details for the file sourmash-3.0.1.tar.gz.

File metadata

  • Download URL: sourmash-3.0.1.tar.gz
  • Upload date:
  • Size: 7.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for sourmash-3.0.1.tar.gz
Algorithm Hash digest
SHA256 0299903121f642abe38484cfae5a90461e95f5692a0aa0da249129eea076bf22
MD5 f99b8217a7451db0b4f4667a1ae2c522
BLAKE2b-256 1b8475a0108b40e66e6265d544ceda908b0d0b6b0536d2f6dfa694ba1e0549ff

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-3.0.1-py2.py3-none-manylinux2010_x86_64.whl.

File metadata

  • Download URL: sourmash-3.0.1-py2.py3-none-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 1.0 MB
  • Tags: Python 2, Python 3, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for sourmash-3.0.1-py2.py3-none-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 f38895971076f7248abad25edad8890697119f17d66eeee52cf6a3379e391faf
MD5 7e61a014807ea931d4ffaf920acbca39
BLAKE2b-256 e16b371a342093c42fe877cd1229eb9d3d8aa05ead0769b967142e33b5e611dc

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-3.0.1-py2.py3-none-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-3.0.1-py2.py3-none-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.0 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for sourmash-3.0.1-py2.py3-none-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 9dd4ea51bbbde4ef6788991cfc2a105f7645cf0b0ba559a223b38270c826682e
MD5 19191cc0054d86316fbd5bd8d343191f
BLAKE2b-256 b677569dea831c3faacda914d4f91a006d4f24d60396264efea7aa164bbdecbc

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-3.0.1-py2.py3-none-macosx_10_6_intel.whl.

File metadata

  • Download URL: sourmash-3.0.1-py2.py3-none-macosx_10_6_intel.whl
  • Upload date:
  • Size: 431.1 kB
  • Tags: Python 2, Python 3, macOS 10.6+ intel
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.7

File hashes

Hashes for sourmash-3.0.1-py2.py3-none-macosx_10_6_intel.whl
Algorithm Hash digest
SHA256 b6b51b21f1c9da68900537e261842bb5b5b9c1ba2f5e2572ea6155aa63a6e3e1
MD5 7d9620f5fbb586351c437843ba7c6856
BLAKE2b-256 c53d86fef5abdcd6409e72d3fb701b45e43df2d093fb3620c15904f468742137

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page