Skip to main content

A python library for accurate and scaleable data deduplication and entity-resolution

Project description

dedupe is a library that uses machine learning to perform de-duplication and entity resolution quickly on structured data. dedupe is the open source engine for dedupe.io

dedupe will help you:

  • remove duplicate entries from a spreadsheet of names and addresses

  • link a list with customer information to another with order history, even without unique customer id’s

  • take a database of campaign contributions and figure out which ones were made by the same person, even if the names were entered slightly differently for each record

dedupe takes in human training data and comes up with the best rules for your dataset to quickly and automatically find similar records, even with very large databases.

Important links:

Project details


Release history Release notifications | RSS feed

This version

1.8.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dedupe-1.8.1.tar.gz (54.6 kB view details)

Uploaded Source

Built Distributions

dedupe-1.8.1-cp36-cp36m-manylinux1_x86_64.whl (78.6 kB view details)

Uploaded CPython 3.6m

dedupe-1.8.1-cp36-cp36m-manylinux1_i686.whl (74.9 kB view details)

Uploaded CPython 3.6m

dedupe-1.8.1-cp36-cp36m-macosx_10_11_x86_64.whl (52.3 kB view details)

Uploaded CPython 3.6m macOS 10.11+ x86-64

dedupe-1.8.1-cp35-cp35m-manylinux1_x86_64.whl (78.4 kB view details)

Uploaded CPython 3.5m

dedupe-1.8.1-cp35-cp35m-manylinux1_i686.whl (74.7 kB view details)

Uploaded CPython 3.5m

dedupe-1.8.1-cp34-cp34m-win_amd64.whl (53.0 kB view details)

Uploaded CPython 3.4m Windows x86-64

dedupe-1.8.1-cp34-cp34m-win32.whl (52.3 kB view details)

Uploaded CPython 3.4m Windows x86

dedupe-1.8.1-cp34-cp34m-manylinux1_x86_64.whl (78.5 kB view details)

Uploaded CPython 3.4m

dedupe-1.8.1-cp34-cp34m-manylinux1_i686.whl (74.9 kB view details)

Uploaded CPython 3.4m

dedupe-1.8.1-cp27-cp27mu-manylinux1_x86_64.whl (75.7 kB view details)

Uploaded CPython 2.7mu

dedupe-1.8.1-cp27-cp27mu-manylinux1_i686.whl (72.8 kB view details)

Uploaded CPython 2.7mu

dedupe-1.8.1-cp27-cp27m-win_amd64.whl (52.9 kB view details)

Uploaded CPython 2.7m Windows x86-64

dedupe-1.8.1-cp27-cp27m-win32.whl (52.1 kB view details)

Uploaded CPython 2.7m Windows x86

dedupe-1.8.1-cp27-cp27m-manylinux1_x86_64.whl (75.7 kB view details)

Uploaded CPython 2.7m

dedupe-1.8.1-cp27-cp27m-manylinux1_i686.whl (72.8 kB view details)

Uploaded CPython 2.7m

dedupe-1.8.1-cp27-cp27m-macosx_10_11_x86_64.whl (51.7 kB view details)

Uploaded CPython 2.7m macOS 10.11+ x86-64

File details

Details for the file dedupe-1.8.1.tar.gz.

File metadata

  • Download URL: dedupe-1.8.1.tar.gz
  • Upload date:
  • Size: 54.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for dedupe-1.8.1.tar.gz
Algorithm Hash digest
SHA256 b6e2592489467a578982e4dc6883fd0bb2271fa5e8b897be91a268cda9bb7493
MD5 afffbd7dcb7cb70f5a4f1af854e568c7
BLAKE2b-256 e24917bd20cdd7ff5da5e7b89004a67e8e56990c3ba45323b922f076bd04621d

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 68f8d214bfe13c74381fb39641a60ec23a7cff3313907d940f1bcdbbfec41aa7
MD5 aeffbe702db4a78845c2d04cf790d16f
BLAKE2b-256 64091c23b2f8e2ffad6f9cdab7fe1dcb4459e25d7dd667acc7971b2abe6cf45d

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp36-cp36m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp36-cp36m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 13b32a818c2300a063ef3b274aedf40133431f8fe70a4b69c2b3dce72086942f
MD5 dd1e28368558b5d97610f6e34669befc
BLAKE2b-256 782f03b48cd591ef401091ae34e0aa0c129b2d4ec80eb21da837e5e43be8bb48

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp36-cp36m-macosx_10_11_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp36-cp36m-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 96685fb00827feeda4c3b8346ccad6c9bc897b633e1ad8705ad448ecadb1b2a7
MD5 c5daa2b882c4dd7aaf00e4dceb900ea2
BLAKE2b-256 16d29ea36c6ba36f48307ae94aade7c08e5c03a478566c26caf13c59fdbd614c

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 20dcd5b90cc595f17beb32118b0f5fc7be220806694078181a45a779ce9e303c
MD5 70a9fed165c79267a80a803a60b94604
BLAKE2b-256 bfde2850e2abcfc70d4e0673b1b6c48a739e3e63ac76ab39bc801234561439d7

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp35-cp35m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp35-cp35m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 1bda5b421243a6fd184ee1263a1687060d85763ebfda98dd91cfc75e7b1c2e68
MD5 fb4c3695ddada626c60148f922ec352a
BLAKE2b-256 866c406548f55c07957be44cceb3eea7c115303286f88b1b6061c60ad6db2c0c

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp34-cp34m-win_amd64.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp34-cp34m-win_amd64.whl
Algorithm Hash digest
SHA256 dd3f3c56e826c0b82f26485efed8e8c7a43f931acac291f1b3994ab2bd3c193e
MD5 c80da324dbc6d72e45e7568580807db2
BLAKE2b-256 f9c5d45f4164304d2731d215f52e2346801983d09f21c48ab13fc76748223828

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp34-cp34m-win32.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp34-cp34m-win32.whl
Algorithm Hash digest
SHA256 ef69bbc2f3e811f6a1e36163fd58049d03669a8b8b5527c022bf8d5ad7fcebe8
MD5 d39a1795c7d6ec979d2971081b3547e4
BLAKE2b-256 44a357f20bbda67d8aac3d9af210de240f3b9f579eb8d64b8fafce8e5f16b3cd

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp34-cp34m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp34-cp34m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 ddd5f49e97d3b5fcdbc2bfe7ac2d4c3df88d9eae336663c5f2a0ceb7760cd627
MD5 46097d9b1af064732cae6809cd0d056f
BLAKE2b-256 def9b270fbf8cbf47bdda2b0f143d359afcdeb93599fa1bd92aa7cf99283f080

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp34-cp34m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp34-cp34m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 fcb1bd720ac09abe60586a8309de1a3df8892913003ea23deca802fd3bbdc6b2
MD5 0641b74f132e5b7f04abdb29f53845d3
BLAKE2b-256 187784988668d98a837fe1bb092c88b401de475e22fdc94a4c58846d07f7a955

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp27-cp27mu-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp27-cp27mu-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 42fad8feda5ecec61c81e57d74b49e395e7e7d21877ae4aa25eaaee9fd9d114d
MD5 10a559ab01c68b6c6b038b6ff78d4931
BLAKE2b-256 04c75c3f0e0fd089bffaf0a1fc0e1aea04df36477d52f0fd97c68c1b4e2e4c14

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp27-cp27mu-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp27-cp27mu-manylinux1_i686.whl
Algorithm Hash digest
SHA256 fe002e49d26331bfc36eea7195ef624332fbbf024ea77aa771f158ac10379eb6
MD5 c1800c4010d514abde333e60accb5a4c
BLAKE2b-256 a51e66e42b95c847e4098362221a9f896c0f946d33c01a3ba77c813fef90a1ea

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp27-cp27m-win_amd64.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp27-cp27m-win_amd64.whl
Algorithm Hash digest
SHA256 ff89ede224ef6ead4c53df20b6131680d443320c302f9c21e61aa5a68eedcbd4
MD5 d713fcf1ed41570c577a01b7ff272442
BLAKE2b-256 dfb9cbcef6fbf9e46482368112a1ce43446817656bf898f8a442736d20ed48c8

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp27-cp27m-win32.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp27-cp27m-win32.whl
Algorithm Hash digest
SHA256 508b9ba9aa4921a3df523eeb74ca31702a2ef1c25683b15ffdef9c4d866fec7a
MD5 0f5f581e0d84e9f74380a9bb71476993
BLAKE2b-256 5c8b2918bcdb3718e6a4f2ff8fc3ae69ca2980d8b567ab4f56224e6e20f1628e

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp27-cp27m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp27-cp27m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 a6aab0c969bb87a557e840d15a4e2de98eaa37601da75802d993081f42beb030
MD5 94913fe0687d9708401af96cd6afa86f
BLAKE2b-256 5a74e7013a4e5b55567da9297f50dd3da3236e6e9d75461ccaf12fab93dd8594

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp27-cp27m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp27-cp27m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 621ee623e3e8d1879dda6c45b31dbb8af5af33fd979afdf57a9f796d3be884f2
MD5 333c7d9650ff37c57fb81cdcf3f51639
BLAKE2b-256 647e24f6dfac2d3b95896a28714e9646effbc286acfd9a34c415a8a0b4df3159

See more details on using hashes here.

File details

Details for the file dedupe-1.8.1-cp27-cp27m-macosx_10_11_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.8.1-cp27-cp27m-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 0729e9b6c7e0ded09b9589ca554833efa9d5d3933d053b1149925fe6b36cf933
MD5 3d452c717caf4ff902e865f14a1ca8e4
BLAKE2b-256 1a2781c7bffbed11092b916e0a94fbcf8fd55c59131ea5c0e047400d033d7919

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page