Hash-based phonemic sequence identifiers
Project description
Konstel(lations)
Not yet stable, proceed with caution
An extensible command line tool and library for generating memorable and pronounceable hash-based identifier schemes for sequences, biological or otherwise. For further details and my SARS-CoV-2 naming proposal, please read my blog post.
Install
# Python >= 3.6
pip install konstel
# Latest
pip install git+https://github.com/bede/konstel
Usage
Command line
$ konstel gen sars-cov-2-s.genome konstel/tests/data/spike2.genome.fa --output table
scheme sars-cov-2-s
hash S:w80qgz2k1fdds6x4mknxazm7psed5knd
hash-4 S:w80q
id S:gofabil
$ echo "ACGT" | konstel gen generic.nucl - --output table
scheme generic
hash 4449jkgqyv6akzs3aaptjav527dger1m
id bodafanoja
Python
>>> from konstel import konstel
>>> konstel.generate('sars-cov-2-s.protein', 'konstel/tests/data/spike.prot.fa')
{"scheme": "sars-cov-2-s", "hash": "S:c52gdyc7v1dfznny48hfkn1g8ax2yx8f", "hash-4": "S:c52g", "id": "S:dodidib"}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
konstel-0.7.1.tar.gz
(8.7 kB
view details)
File details
Details for the file konstel-0.7.1.tar.gz
.
File metadata
- Download URL: konstel-0.7.1.tar.gz
- Upload date:
- Size: 8.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.24.0 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 80af13233d02f2e0695428c420d4601860aec3d3b7e0f3ed8a72182bfc4ff685 |
|
MD5 | 57686dec79cb636c7ba00a78df61a917 |
|
BLAKE2b-256 | d645fbc282fd6866b36290b620fade8be2f087ca50dd329346c1989b8cb1de04 |