Skip to main content

A python library for accurate and scaleable data deduplication and entity-resolution

Project description

dedupe is a library that uses machine learning to perform de-duplication and entity resolution quickly on structured data. dedupe is the open source engine for dedupe.io

dedupe will help you:

  • remove duplicate entries from a spreadsheet of names and addresses

  • link a list with customer information to another with order history, even without unique customer id’s

  • take a database of campaign contributions and figure out which ones were made by the same person, even if the names were entered slightly differently for each record

dedupe takes in human training data and comes up with the best rules for your dataset to quickly and automatically find similar records, even with very large databases.

Important links:

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dedupe-1.6.12.tar.gz (48.6 kB view details)

Uploaded Source

Built Distributions

dedupe-1.6.12-cp36-cp36m-manylinux1_x86_64.whl (74.4 kB view details)

Uploaded CPython 3.6m

dedupe-1.6.12-cp36-cp36m-manylinux1_i686.whl (71.2 kB view details)

Uploaded CPython 3.6m

dedupe-1.6.12-cp36-cp36m-macosx_10_11_x86_64.whl (50.1 kB view details)

Uploaded CPython 3.6m macOS 10.11+ x86-64

dedupe-1.6.12-cp35-cp35m-manylinux1_x86_64.whl (74.2 kB view details)

Uploaded CPython 3.5m

dedupe-1.6.12-cp35-cp35m-manylinux1_i686.whl (70.9 kB view details)

Uploaded CPython 3.5m

dedupe-1.6.12-cp34-cp34m-win_amd64.whl (50.8 kB view details)

Uploaded CPython 3.4m Windows x86-64

dedupe-1.6.12-cp34-cp34m-win32.whl (50.1 kB view details)

Uploaded CPython 3.4m Windows x86

dedupe-1.6.12-cp34-cp34m-manylinux1_x86_64.whl (74.4 kB view details)

Uploaded CPython 3.4m

dedupe-1.6.12-cp34-cp34m-manylinux1_i686.whl (71.1 kB view details)

Uploaded CPython 3.4m

dedupe-1.6.12-cp27-cp27mu-manylinux1_x86_64.whl (72.1 kB view details)

Uploaded CPython 2.7mu

dedupe-1.6.12-cp27-cp27mu-manylinux1_i686.whl (69.4 kB view details)

Uploaded CPython 2.7mu

dedupe-1.6.12-cp27-cp27m-win_amd64.whl (50.9 kB view details)

Uploaded CPython 2.7m Windows x86-64

dedupe-1.6.12-cp27-cp27m-win32.whl (50.1 kB view details)

Uploaded CPython 2.7m Windows x86

dedupe-1.6.12-cp27-cp27m-manylinux1_x86_64.whl (72.1 kB view details)

Uploaded CPython 2.7m

dedupe-1.6.12-cp27-cp27m-manylinux1_i686.whl (69.5 kB view details)

Uploaded CPython 2.7m

dedupe-1.6.12-cp27-cp27m-macosx_10_11_x86_64.whl (49.7 kB view details)

Uploaded CPython 2.7m macOS 10.11+ x86-64

File details

Details for the file dedupe-1.6.12.tar.gz.

File metadata

  • Download URL: dedupe-1.6.12.tar.gz
  • Upload date:
  • Size: 48.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for dedupe-1.6.12.tar.gz
Algorithm Hash digest
SHA256 f8f663d196b578ef1d5f7272f2d98686c4680439925a86bcc2d9ae2dfd5e1bdf
MD5 c6c96b3d901112047638ea2f5cf1f79f
BLAKE2b-256 3fe111f0e3f2eb214ad3cc216f9d30d249598fef4af032b68f799b55afc4b3bf

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 22484a34aad7615e685439de5b37022200dad68c1fe18e3eea57e2419c7e4ecf
MD5 a544b683ef2283d26c67867651750d34
BLAKE2b-256 61b6f7c8dbb638a2a6a3db9cf64e5a173e919c764a6b3cc9c6a6f8b0903f598f

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp36-cp36m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp36-cp36m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 f8f5209b6c1701c6d2f056eb3679b1d23691f5f1056e60cbbfdbeb0bde6f3244
MD5 e2965ed8e48c0080f090d9962e89486b
BLAKE2b-256 9b2a3e18c92009bc265c65ed69b517dadccd53d28df19b64896da9b051bd141d

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp36-cp36m-macosx_10_11_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp36-cp36m-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 3963e063bc977654d90b38e224f186356ca9955b766f992f86ddfaa7bb37b236
MD5 7b0ce989b7f52b77025254f715a6ca5f
BLAKE2b-256 474245cb872766299dc71f4752d43f979ed05d0bfcd7a12fded24b93646dd7cd

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 dc9122f93d40bdb269a449ec60c48dd239bba90c8cb6c29b049cbb48441147a1
MD5 600566ef822b24e9751dc21bf16f4f64
BLAKE2b-256 2543c1dce9c45f1137b97429a31480bc9595aea67e8d3a6e2203645b046e2e8e

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp35-cp35m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp35-cp35m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 b3b71f8ed50bc12f113dcdbf68636fdb03e6825e4d22541ddf02dbee16712359
MD5 39be82005445d7ac581e5214f9feb7c0
BLAKE2b-256 4f7497da3089f2bd0073f8c03353fc7e71730d32673f11a7d0319f1b5ab1927b

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp34-cp34m-win_amd64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp34-cp34m-win_amd64.whl
Algorithm Hash digest
SHA256 ec5f1b91e6a4290400a4e1c210bf3e9e97dd86d1ceb69213798217359fc2d060
MD5 f407b37721acc9d6b572d3f4209ad02d
BLAKE2b-256 c91edf963b5c3d6f184de429a1d04b113222d435beeb0417d3d5ca7c7dd0292a

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp34-cp34m-win32.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp34-cp34m-win32.whl
Algorithm Hash digest
SHA256 71db7a89bea05808a939fafb203db0e89737cda49f8861add829d7a6c8038913
MD5 b5ed1a0a6b0835e66394016d226c5e16
BLAKE2b-256 cabccfcd0bbfd918d123329a389f674448ba40576ae0b6bdbe42e3e8fe4645d8

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp34-cp34m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp34-cp34m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 8585aa9351cd411e80adad31a3425f6a7b46d5a02d40c4e914d9983c4bdc75e4
MD5 13fe225383504b9ebc62ddade83f64d0
BLAKE2b-256 12ae0ced12a4b4d24392dc3c1f86b2ac75f44f32ab03c3a7b721d4a789c94622

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp34-cp34m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp34-cp34m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 db59c655a294958f8e22b43e9b545a3f2dc0f6319b1af8d356c3f0521836f769
MD5 07bde237c729aec4a45f55c29f163163
BLAKE2b-256 7d3a41ef14e67ee85d8368c058408e9d267f42ee3f68cd54de4a3b50391238e9

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp27-cp27mu-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp27-cp27mu-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 f6550d2d1202be49bb1dcb9e3debfb6a73effdb7d4a017c5c15d3173ea75ff0d
MD5 6d91ddf87112ba48b51f10d0ed32086c
BLAKE2b-256 8b519fb591cf9f6b77323a767c9465f240708248219a996d00831aafa93a0fdd

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp27-cp27mu-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp27-cp27mu-manylinux1_i686.whl
Algorithm Hash digest
SHA256 ddae394c3caab0e0a94f8efa0c9dbaf36e3b9b4d6de605d4afe2850d6de88b2d
MD5 62b8875beb28bad9d6c6e721b15933c7
BLAKE2b-256 357cfcfbf055f6528bf76a2e6e7826e0933573a1d21c88141deb7fb879e26fae

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp27-cp27m-win_amd64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp27-cp27m-win_amd64.whl
Algorithm Hash digest
SHA256 6aaae0a87df4bd6817971ecdfe16f3ebf55b6fff4b73b53ab901798faa8e809d
MD5 0bf042e5de92ffc4289ed1fc765ed85f
BLAKE2b-256 c086ed880114b9ddef30a9352407adc5ab4b7e994cd66b67f1fe5d37be2fad12

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp27-cp27m-win32.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp27-cp27m-win32.whl
Algorithm Hash digest
SHA256 178de9952e4cf72c639453676162ff1b703443b6e695eff6d09d8be9a920bf46
MD5 8fdf6468db11fd6af64c2ce884d27a28
BLAKE2b-256 a06ca0b7f157a6223fdfa0b39722be672152456585173dbad046f0f71e54446e

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp27-cp27m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp27-cp27m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 ed79e71a4d99f880a9d039c3efd14a4d696766fed3dd02930f500061922232b1
MD5 b7c9936759224f0d1aab0f7d44374a24
BLAKE2b-256 045ffb229dd99c04d87ff4a228df7fc3ae0ff161e5bcbbc990fc26c91f10fd9d

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp27-cp27m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp27-cp27m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 fcf209c4cae4701f0f90bd03de12c90b8a5624f618f7bc39d963b7fdb17d76fb
MD5 1309daed30e205f8006947842cb36c86
BLAKE2b-256 5bf601d74d37eb0b7e42184b4d58c31f77877946369aeb3cd38a874baea91501

See more details on using hashes here.

File details

Details for the file dedupe-1.6.12-cp27-cp27m-macosx_10_11_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.12-cp27-cp27m-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 96d6fd6c11e194fab5812524fd6ea15675c8fe28be80fa737ca568e2736fd21d
MD5 06e56166b921a5a1e8ffc154a538b170
BLAKE2b-256 d014b37ab23be8035171a22001b5f81b8468382c22002a47a9973077df593e13

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page