Skip to main content

a library for doing approximate and phonetic matching of strings.

Project description

https://travis-ci.com/jamesturk/jellyfish.svg?branch=master https://coveralls.io/repos/jamesturk/jellyfish/badge.png?branch=master https://img.shields.io/pypi/v/jellyfish.svg Documentation Status https://ci.appveyor.com/api/projects/status/9xeyl1f5sd5pl40h?svg=true

Jellyfish is a python library for doing approximate and phonetic matching of strings.

Written by James Turk <dev@jamesturk.net> and Michael Stephens.

See https://github.com/jamesturk/jellyfish/graphs/contributors for contributors.

See http://jellyfish.readthedocs.io for documentation.

Source is available at http://github.com/jamesturk/jellyfish.

Jellyfish >= 0.7 only supports Python 3, if you need Python 2 please use 0.6.x.

Included Algorithms

String comparison:

  • Levenshtein Distance

  • Damerau-Levenshtein Distance

  • Jaro Distance

  • Jaro-Winkler Distance

  • Match Rating Approach Comparison

  • Hamming Distance

Phonetic encoding:

  • American Soundex

  • Metaphone

  • NYSIIS (New York State Identification and Intelligence System)

  • Match Rating Codex

Example Usage

>>> import jellyfish
>>> jellyfish.levenshtein_distance(u'jellyfish', u'smellyfish')
2
>>> jellyfish.jaro_distance(u'jellyfish', u'smellyfish')
0.89629629629629637
>>> jellyfish.damerau_levenshtein_distance(u'jellyfish', u'jellyfihs')
1
>>> jellyfish.metaphone(u'Jellyfish')
'JLFX'
>>> jellyfish.soundex(u'Jellyfish')
'J412'
>>> jellyfish.nysiis(u'Jellyfish')
'JALYF'
>>> jellyfish.match_rating_codex(u'Jellyfish')
'JLLFSH'

Running Tests

If you are interested in contributing to Jellyfish, you may want to run tests locally. Jellyfish uses tox to run tests, which you can setup and run as follows:

pip install tox
# cd jellyfish/
tox

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jellyfish-0.8.2.tar.gz (134.2 kB view details)

Uploaded Source

Built Distributions

jellyfish-0.8.2-cp38-cp38-manylinux2014_x86_64.whl (94.0 kB view details)

Uploaded CPython 3.8

jellyfish-0.8.2-cp37-cp37m-manylinux2014_x86_64.whl (90.4 kB view details)

Uploaded CPython 3.7m

jellyfish-0.8.2-cp36-cp36m-manylinux2014_x86_64.whl (93.8 kB view details)

Uploaded CPython 3.6m

jellyfish-0.8.2-cp35-cp35m-manylinux2014_x86_64.whl (93.1 kB view details)

Uploaded CPython 3.5m

File details

Details for the file jellyfish-0.8.2.tar.gz.

File metadata

  • Download URL: jellyfish-0.8.2.tar.gz
  • Upload date:
  • Size: 134.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.7

File hashes

Hashes for jellyfish-0.8.2.tar.gz
Algorithm Hash digest
SHA256 a499741401512d05bbd3556e448e960bc908eba3879fb73d450e8e91566a030b
MD5 6e5a8778cd798d8328b8e5d1288b9ec7
BLAKE2b-256 043fd03cb056f407ef181a45569255348457b1a0915fc4eb23daeceb930a68a4

See more details on using hashes here.

File details

Details for the file jellyfish-0.8.2-cp38-cp38-manylinux2014_x86_64.whl.

File metadata

  • Download URL: jellyfish-0.8.2-cp38-cp38-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 94.0 kB
  • Tags: CPython 3.8
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.7

File hashes

Hashes for jellyfish-0.8.2-cp38-cp38-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 288727bac0cef5817640d1d0d3655dabb9906d1f7b3e31eb1bd0b005bdf0eaf2
MD5 2e5745b06e7db27bc672935e09df37cf
BLAKE2b-256 115052dc08e011620516462e3c23d9c0d174d8bb56dbdc2c9b623946f6c125ea

See more details on using hashes here.

File details

Details for the file jellyfish-0.8.2-cp37-cp37m-manylinux2014_x86_64.whl.

File metadata

  • Download URL: jellyfish-0.8.2-cp37-cp37m-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 90.4 kB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.7

File hashes

Hashes for jellyfish-0.8.2-cp37-cp37m-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 4213dda17b75e09b4a5bac48b7dd2602cf54304af49fd6236d3539f49f614e8a
MD5 5ba76c8e2cc43d245813996eb510bed5
BLAKE2b-256 30a64d039bc827a102f62ce7a7910713e38fdfd7c7a40aa39c72fb14938a1473

See more details on using hashes here.

File details

Details for the file jellyfish-0.8.2-cp36-cp36m-manylinux2014_x86_64.whl.

File metadata

  • Download URL: jellyfish-0.8.2-cp36-cp36m-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 93.8 kB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.7

File hashes

Hashes for jellyfish-0.8.2-cp36-cp36m-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 2490726dda6603fad0090f7afa2b33718ac03011b6807658b8cdbf512a798e89
MD5 46011ae3a8364c7eff65275400ace219
BLAKE2b-256 6c09927ae35fc5a9f70abb6cc2c27ee88fc48549f7bc4786c1d4b177c22e997d

See more details on using hashes here.

File details

Details for the file jellyfish-0.8.2-cp35-cp35m-manylinux2014_x86_64.whl.

File metadata

  • Download URL: jellyfish-0.8.2-cp35-cp35m-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 93.1 kB
  • Tags: CPython 3.5m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.7

File hashes

Hashes for jellyfish-0.8.2-cp35-cp35m-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 f5c7fd1b02edf86135ed005ba8715ed9496d268e3d77e6428445689a441c4c64
MD5 85d148d5dd820c62bdac9db57986026d
BLAKE2b-256 4ef8f2f58540044ce01904e3dd4f3660a82b03c41debedcfda4ed344fdc660f5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page