Skip to main content

tools for comparing DNA sequences with MinHash sketches

Project description

sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

Documentation Build Status Bioconda install PyPI codecov DOI License: 3-Clause BSD

Usage:

sourmash compute *.fq.gz
sourmash compare *.sig -o distances
sourmash plot distances

sourmash 1.0 is published on JOSS; please cite that paper if you use sourmash (doi: 10.21105/joss.00027):.


The name is a riff off of Mash, combined with @ctb's love of whiskey. (Sour mash is used in making whiskey.)

Primary authors: C. Titus Brown (@ctb) and Luiz C. Irber, Jr (@luizirber).

sourmash was initially developed by the Lab for Data-Intensive Biology at the UC Davis School of Veterinary Medicine, and now includes contributions from the global research and developer community.

Installation

We recommend using bioconda to install sourmash:

conda install -c conda-forge -c bioconda sourmash

This will install the latest stable version of sourmash 3.

You can also use pip to install sourmash:

pip install sourmash

A quickstart tutorial is available.

Requirements

sourmash runs under Python 3.7 and later. The base requirements are screed, cffi, numpy, matplotlib, and scipy. Conda (see below) will install everything necessary, and is our recommended installation method.

Installation with conda

Bioconda is a channel for the conda package manager with a focus on bioinformatics software. After installing conda, you can install sourmash by running:

$ conda create -n sourmash_env -c conda-forge -c bioconda sourmash python=3.7
$ source activate sourmash_env
$ sourmash compute -h

which will install the latest released version.

Support

Please ask questions and files issues on Github.

Development

Development happens on github at dib-lab/sourmash.

sourmash is developed in Python and Rust, and you will need a Rust environment to build it; see the developer notes for our suggested development setup.

After installation, sourmash is the main command-line entry point; run it with python -m sourmash, or do pip install -e /path/to/repo to do a developer install in a virtual environment.

The sourmash/ directory contains the Python library and command-line interface code.

The src/core/ directory contains the Rust library implementing core functionality.

Tests require py.test and can be run with make test.

Please see the developer notes for more information on getting set up with a development environment.


CTB July 2020

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

sourmash-4.0.0a2-py2.py3-none-manylinux2010_x86_64.whl (1.2 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.12+ x86-64

sourmash-4.0.0a2-py2.py3-none-manylinux2010_i686.whl (1.3 MB view details)

Uploaded Python 2 Python 3 manylinux: glibc 2.12+ i686

sourmash-4.0.0a2-py2.py3-none-manylinux1_x86_64.whl (1.2 MB view details)

Uploaded Python 2 Python 3

sourmash-4.0.0a2-py2.py3-none-manylinux1_i686.whl (1.3 MB view details)

Uploaded Python 2 Python 3

File details

Details for the file sourmash-4.0.0a2-py2.py3-none-manylinux2010_x86_64.whl.

File metadata

  • Download URL: sourmash-4.0.0a2-py2.py3-none-manylinux2010_x86_64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3, manylinux: glibc 2.12+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0a2-py2.py3-none-manylinux2010_x86_64.whl
Algorithm Hash digest
SHA256 49a0e7c4fdee5f5ec86378ca224e0ed19e83f7dd5edac9ec48a680524b044434
MD5 802ad88ea8dafeccb894ba3c4e3d358b
BLAKE2b-256 98ebc670e7a610971912f67178816f91217b69aab2931978d365978814054c39

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0a2-py2.py3-none-manylinux2010_i686.whl.

File metadata

  • Download URL: sourmash-4.0.0a2-py2.py3-none-manylinux2010_i686.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 2, Python 3, manylinux: glibc 2.12+ i686
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0a2-py2.py3-none-manylinux2010_i686.whl
Algorithm Hash digest
SHA256 042ea0666cff01d7b04f6168d807970c7830bf045b0a3e78109296c794645513
MD5 ae9420a0893d9f4d455a92ff0eff127b
BLAKE2b-256 e4aba5c7baf60bc8a3031b0363f45cf068035865dc9264d1ab648f2f76bbb6f7

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0a2-py2.py3-none-manylinux1_x86_64.whl.

File metadata

  • Download URL: sourmash-4.0.0a2-py2.py3-none-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0a2-py2.py3-none-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 efed0f6163349f7529d3f5c20ad51198b86c9e9c7d009af669812debc7b32fca
MD5 f8fb4c20b0b9fe2ff202a92b824d296e
BLAKE2b-256 2f8b8f3328e2208916c51e93f11e9d540c6e4aff9bf621ef6445b2a075d583e9

See more details on using hashes here.

Provenance

File details

Details for the file sourmash-4.0.0a2-py2.py3-none-manylinux1_i686.whl.

File metadata

  • Download URL: sourmash-4.0.0a2-py2.py3-none-manylinux1_i686.whl
  • Upload date:
  • Size: 1.3 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.6

File hashes

Hashes for sourmash-4.0.0a2-py2.py3-none-manylinux1_i686.whl
Algorithm Hash digest
SHA256 f4c8768e1413c3b4a1e99364e3fc0b7b0978a34551fa51d21f654b9b442ecff3
MD5 2b3c951e5cb7a52dd53057d4dcf28682
BLAKE2b-256 1bb8101ea135b593712ab65d0c6a1df500c83956420cf713bde1ab9582ff0bd1

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page