a library for doing approximate and phonetic matching of strings.
Project description
Jellyfish is a python library for doing approximate and phonetic matching of strings.
jellyfish is a project of Sunlight Labs (c) 2014. All code is released under a BSD-style license, see LICENSE for details.
Written by James Turk <jturk@sunlightfoundation.com> and Michael Stephens.
See https://github.com/sunlightlabs/jellyfish/graphs/contributors for contributors.
Source is available at http://github.com/sunlightlabs/jellyfish.
Included Algorithms
String comparison:
Levenshtein Distance
Damerau-Levenshtein Distance
Jaro Distance
Jaro-Winkler Distance
Match Rating Approach Comparison
Hamming Distance
Phonetic encoding:
American Soundex
Metaphone
NYSIIS (New York State Identification and Intelligence System)
Match Rating Codex
Example Usage
>>> import jellyfish >>> jellyfish.levenshtein_distance('jellyfish', 'smellyfish') 2 >>> jellyfish.jaro_distance('jellyfish', 'smellyfish') 0.89629629629629637 >>> jellyfish.damerau_levenshtein_distance('jellyfish', 'jellyfihs') 1
>>> jellyfish.metaphone('Jellyfish') 'JLFX' >>> jellyfish.soundex('Jellyfish') 'J412' >>> jellyfish.nysiis('Jellyfish') 'JALYF' >>> jellyfish.match_rating_codex('Jellyfish') 'JLLFSH'
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file jellyfish-0.3.4.tar.gz
.
File metadata
- Download URL: jellyfish-0.3.4.tar.gz
- Upload date:
- Size: 20.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 57d40ec3a53c5ba7a83b746262823eef737105d2d4d6e0ce48873fbb9f18dab8 |
|
MD5 | 42a541f93cccbb300ae59062620f4e08 |
|
BLAKE2b-256 | 36b732abfcba4f9c91ec93f0505fbc87fce1528a832d622edb58e4734063bcab |