a library for doing approximate and phonetic matching of strings.
Project description
Jellyfish is a python library for doing approximate and phonetic matching of strings.
Written by James Turk <james.p.turk@gmail.com> and Michael Stephens.
See https://github.com/jamesturk/jellyfish/graphs/contributors for contributors.
Source is available at http://github.com/jamesturk/jellyfish.
Included Algorithms
String comparison:
Levenshtein Distance
Damerau-Levenshtein Distance
Jaro Distance
Jaro-Winkler Distance
Match Rating Approach Comparison
Hamming Distance
Phonetic encoding:
American Soundex
Metaphone
NYSIIS (New York State Identification and Intelligence System)
Match Rating Codex
Example Usage
>>> import jellyfish >>> jellyfish.levenshtein_distance(u'jellyfish', u'smellyfish') 2 >>> jellyfish.jaro_distance(u'jellyfish', u'smellyfish') 0.89629629629629637 >>> jellyfish.damerau_levenshtein_distance(u'jellyfish', u'jellyfihs') 1
>>> jellyfish.metaphone(u'Jellyfish') 'JLFX' >>> jellyfish.soundex(u'Jellyfish') 'J412' >>> jellyfish.nysiis(u'Jellyfish') 'JALYF' >>> jellyfish.match_rating_codex(u'Jellyfish') 'JLLFSH'
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file jellyfish-0.5.4.tar.gz
.
File metadata
- Download URL: jellyfish-0.5.4.tar.gz
- Upload date:
- Size: 132.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6fc6463bf44a53dd8aa422750353413f266553d0bd46a001043bb823aae444d4 |
|
MD5 | 3fc5b35caed6fdaf3e71a050a4d11b19 |
|
BLAKE2b-256 | cca8fc7d71ccba20e1553b06713601cc0b83908cf4d6f5aa5a4f2c804f8b3000 |
File details
Details for the file jellyfish-0.5.4-cp27-none-macosx_10_10_x86_64.whl
.
File metadata
- Download URL: jellyfish-0.5.4-cp27-none-macosx_10_10_x86_64.whl
- Upload date:
- Size: 23.7 kB
- Tags: CPython 2.7, macOS 10.10+ x86-64
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b5d7923abb3315e243bf30d0537a659a8371555862f19139f55f2f87ac577d3c |
|
MD5 | b6852ac50ae65054a2a367f5aa649031 |
|
BLAKE2b-256 | 8be4402af5acc7473c5a7da1ac0a55a51c2d27547707ad0a6dd4e20a044d228b |