Skip to main content

a library for doing approximate and phonetic matching of strings.

Project description

https://travis-ci.org/jamesturk/jellyfish.svg?branch=master https://coveralls.io/repos/jamesturk/jellyfish/badge.png?branch=master https://img.shields.io/pypi/v/jellyfish.svg Documentation Status https://ci.appveyor.com/api/projects/status/t5o03rqcusxhhe41/branch/master?svg=true

Jellyfish is a python library for doing approximate and phonetic matching of strings.

Written by James Turk <james.p.turk@gmail.com> and Michael Stephens.

See https://github.com/jamesturk/jellyfish/graphs/contributors for contributors.

Source is available at http://github.com/jamesturk/jellyfish.

Included Algorithms

String comparison:

  • Levenshtein Distance

  • Damerau-Levenshtein Distance

  • Jaro Distance

  • Jaro-Winkler Distance

  • Match Rating Approach Comparison

  • Hamming Distance

Phonetic encoding:

  • American Soundex

  • Metaphone

  • NYSIIS (New York State Identification and Intelligence System)

  • Match Rating Codex

Example Usage

>>> import jellyfish
>>> jellyfish.levenshtein_distance(u'jellyfish', u'smellyfish')
2
>>> jellyfish.jaro_distance(u'jellyfish', u'smellyfish')
0.89629629629629637
>>> jellyfish.damerau_levenshtein_distance(u'jellyfish', u'jellyfihs')
1
>>> jellyfish.metaphone(u'Jellyfish')
'JLFX'
>>> jellyfish.soundex(u'Jellyfish')
'J412'
>>> jellyfish.nysiis(u'Jellyfish')
'JALYF'
>>> jellyfish.match_rating_codex(u'Jellyfish')
'JLLFSH'

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jellyfish-0.5.4.tar.gz (132.3 kB view details)

Uploaded Source

Built Distribution

jellyfish-0.5.4-cp27-none-macosx_10_10_x86_64.whl (23.7 kB view details)

Uploaded CPython 2.7 macOS 10.10+ x86-64

File details

Details for the file jellyfish-0.5.4.tar.gz.

File metadata

  • Download URL: jellyfish-0.5.4.tar.gz
  • Upload date:
  • Size: 132.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for jellyfish-0.5.4.tar.gz
Algorithm Hash digest
SHA256 6fc6463bf44a53dd8aa422750353413f266553d0bd46a001043bb823aae444d4
MD5 3fc5b35caed6fdaf3e71a050a4d11b19
BLAKE2b-256 cca8fc7d71ccba20e1553b06713601cc0b83908cf4d6f5aa5a4f2c804f8b3000

See more details on using hashes here.

File details

Details for the file jellyfish-0.5.4-cp27-none-macosx_10_10_x86_64.whl.

File metadata

File hashes

Hashes for jellyfish-0.5.4-cp27-none-macosx_10_10_x86_64.whl
Algorithm Hash digest
SHA256 b5d7923abb3315e243bf30d0537a659a8371555862f19139f55f2f87ac577d3c
MD5 b6852ac50ae65054a2a367f5aa649031
BLAKE2b-256 8be4402af5acc7473c5a7da1ac0a55a51c2d27547707ad0a6dd4e20a044d228b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page