Skip to main content

a library for doing approximate and phonetic matching of strings.

Reason this release was yanked:

broken C extension

Project description

https://travis-ci.com/jamesturk/jellyfish.svg?branch=master https://coveralls.io/repos/jamesturk/jellyfish/badge.png?branch=master https://img.shields.io/pypi/v/jellyfish.svg Documentation Status https://ci.appveyor.com/api/projects/status/9xeyl1f5sd5pl40h?svg=true

Jellyfish is a python library for doing approximate and phonetic matching of strings.

Written by James Turk <dev@jamesturk.net> and Michael Stephens.

See https://github.com/jamesturk/jellyfish/graphs/contributors for contributors.

See http://jellyfish.readthedocs.io for documentation.

Source is available at http://github.com/jamesturk/jellyfish.

Jellyfish >= 0.7 only supports Python 3, if you need Python 2 please use 0.6.x.

Included Algorithms

String comparison:

  • Levenshtein Distance

  • Damerau-Levenshtein Distance

  • Jaro Distance

  • Jaro-Winkler Distance

  • Match Rating Approach Comparison

  • Hamming Distance

Phonetic encoding:

  • American Soundex

  • Metaphone

  • NYSIIS (New York State Identification and Intelligence System)

  • Match Rating Codex

Example Usage

>>> import jellyfish
>>> jellyfish.levenshtein_distance(u'jellyfish', u'smellyfish')
2
>>> jellyfish.jaro_distance(u'jellyfish', u'smellyfish')
0.89629629629629637
>>> jellyfish.damerau_levenshtein_distance(u'jellyfish', u'jellyfihs')
1
>>> jellyfish.metaphone(u'Jellyfish')
'JLFX'
>>> jellyfish.soundex(u'Jellyfish')
'J412'
>>> jellyfish.nysiis(u'Jellyfish')
'JALYF'
>>> jellyfish.match_rating_codex(u'Jellyfish')
'JLLFSH'

Running Tests

If you are interested in contributing to Jellyfish, you may want to run tests locally. Jellyfish uses tox to run tests, which you can setup and run as follows:

pip install tox
# cd jellyfish/
tox

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jellyfish-0.8.1.tar.gz (134.2 kB view details)

Uploaded Source

Built Distributions

jellyfish-0.8.1-cp38-cp38-manylinux2014_x86_64.whl (94.0 kB view details)

Uploaded CPython 3.8

jellyfish-0.8.1-cp37-cp37m-manylinux2014_x86_64.whl (90.4 kB view details)

Uploaded CPython 3.7m

jellyfish-0.8.1-cp36-cp36m-manylinux2014_x86_64.whl (93.8 kB view details)

Uploaded CPython 3.6m

jellyfish-0.8.1-cp35-cp35m-manylinux2014_x86_64.whl (93.1 kB view details)

Uploaded CPython 3.5m

File details

Details for the file jellyfish-0.8.1.tar.gz.

File metadata

  • Download URL: jellyfish-0.8.1.tar.gz
  • Upload date:
  • Size: 134.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.7

File hashes

Hashes for jellyfish-0.8.1.tar.gz
Algorithm Hash digest
SHA256 4d3d0937e1e56f055cc23705d90a3a27df156b340f726f36a11356b97521102b
MD5 f6508276177bb23ca92a730a4c9d4c06
BLAKE2b-256 d4053ecf73e5e77153c4b849c0e722cc5351233a2cfc3ca8c389049114f946cc

See more details on using hashes here.

File details

Details for the file jellyfish-0.8.1-cp38-cp38-manylinux2014_x86_64.whl.

File metadata

  • Download URL: jellyfish-0.8.1-cp38-cp38-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 94.0 kB
  • Tags: CPython 3.8
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.7

File hashes

Hashes for jellyfish-0.8.1-cp38-cp38-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 6e466013b23af78778b70a12620262461fa0ff71c56fd58ed944c6af7236cf37
MD5 f811cc79515d95e8b878091cbb5f3c45
BLAKE2b-256 3d833bd7d1fe8d08b33779c9a53fff6e00ea0bda0d59ad59f9b480599e0e80db

See more details on using hashes here.

File details

Details for the file jellyfish-0.8.1-cp37-cp37m-manylinux2014_x86_64.whl.

File metadata

  • Download URL: jellyfish-0.8.1-cp37-cp37m-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 90.4 kB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.7

File hashes

Hashes for jellyfish-0.8.1-cp37-cp37m-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 03d58b508b018557808e4068f85ab7d6d6f7e230b780307df2fa47a181efa50f
MD5 862ae398351ea4aae8118ef7a9747069
BLAKE2b-256 7e57609a7dfda4b0876f6d7fb436b028cbde3bc12e0f76549e0fd4b005e1e537

See more details on using hashes here.

File details

Details for the file jellyfish-0.8.1-cp36-cp36m-manylinux2014_x86_64.whl.

File metadata

  • Download URL: jellyfish-0.8.1-cp36-cp36m-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 93.8 kB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.7

File hashes

Hashes for jellyfish-0.8.1-cp36-cp36m-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 88633f9e6a0167a2d8818ce634535b5ef0bc7fa638eddb6e611d7b7dcad96962
MD5 e682295e26057ead1fd6ca363fccaeda
BLAKE2b-256 3a0e3898c268a6b9dac0eecdd8a0dc70169dfbabc9aa428fe5302e3ff9b9246a

See more details on using hashes here.

File details

Details for the file jellyfish-0.8.1-cp35-cp35m-manylinux2014_x86_64.whl.

File metadata

  • Download URL: jellyfish-0.8.1-cp35-cp35m-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 93.1 kB
  • Tags: CPython 3.5m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.7

File hashes

Hashes for jellyfish-0.8.1-cp35-cp35m-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 c8602ab7ac85d1e0933361f9b0a0f8cefc33fe7fbfd1a41fcb778d4e6c8abeac
MD5 e613860c546cc6902165e654607f2019
BLAKE2b-256 4addd5b2f69b308590f42c97f607e22e75b7766e370590ad409eb54baca3858f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page