Skip to main content

A python library for accurate and scaleable data deduplication and entity-resolution

Project description

dedupe is a library that uses machine learning to perform de-duplication and entity resolution quickly on structured data. dedupe is the open source engine for dedupe.io

dedupe will help you:

  • remove duplicate entries from a spreadsheet of names and addresses

  • link a list with customer information to another with order history, even without unique customer id’s

  • take a database of campaign contributions and figure out which ones were made by the same person, even if the names were entered slightly differently for each record

dedupe takes in human training data and comes up with the best rules for your dataset to quickly and automatically find similar records, even with very large databases.

Important links:

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dedupe-1.6.13.tar.gz (48.6 kB view details)

Uploaded Source

Built Distributions

dedupe-1.6.13-cp36-cp36m-manylinux1_x86_64.whl (74.4 kB view details)

Uploaded CPython 3.6m

dedupe-1.6.13-cp36-cp36m-manylinux1_i686.whl (71.2 kB view details)

Uploaded CPython 3.6m

dedupe-1.6.13-cp36-cp36m-macosx_10_11_x86_64.whl (50.1 kB view details)

Uploaded CPython 3.6m macOS 10.11+ x86-64

dedupe-1.6.13-cp35-cp35m-manylinux1_x86_64.whl (74.2 kB view details)

Uploaded CPython 3.5m

dedupe-1.6.13-cp35-cp35m-manylinux1_i686.whl (70.9 kB view details)

Uploaded CPython 3.5m

dedupe-1.6.13-cp34-cp34m-win_amd64.whl (50.8 kB view details)

Uploaded CPython 3.4m Windows x86-64

dedupe-1.6.13-cp34-cp34m-win32.whl (50.1 kB view details)

Uploaded CPython 3.4m Windows x86

dedupe-1.6.13-cp34-cp34m-manylinux1_x86_64.whl (74.4 kB view details)

Uploaded CPython 3.4m

dedupe-1.6.13-cp34-cp34m-manylinux1_i686.whl (71.1 kB view details)

Uploaded CPython 3.4m

dedupe-1.6.13-cp27-cp27mu-manylinux1_x86_64.whl (72.1 kB view details)

Uploaded CPython 2.7mu

dedupe-1.6.13-cp27-cp27mu-manylinux1_i686.whl (69.4 kB view details)

Uploaded CPython 2.7mu

dedupe-1.6.13-cp27-cp27m-win_amd64.whl (50.9 kB view details)

Uploaded CPython 2.7m Windows x86-64

dedupe-1.6.13-cp27-cp27m-win32.whl (50.1 kB view details)

Uploaded CPython 2.7m Windows x86

dedupe-1.6.13-cp27-cp27m-manylinux1_x86_64.whl (72.1 kB view details)

Uploaded CPython 2.7m

dedupe-1.6.13-cp27-cp27m-manylinux1_i686.whl (69.5 kB view details)

Uploaded CPython 2.7m

dedupe-1.6.13-cp27-cp27m-macosx_10_11_x86_64.whl (49.7 kB view details)

Uploaded CPython 2.7m macOS 10.11+ x86-64

File details

Details for the file dedupe-1.6.13.tar.gz.

File metadata

  • Download URL: dedupe-1.6.13.tar.gz
  • Upload date:
  • Size: 48.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for dedupe-1.6.13.tar.gz
Algorithm Hash digest
SHA256 e588e115da5178354e2ad8e51e509e79c9c5e0c90ba80643f7bbc4e543f1b55d
MD5 de3724697ce8ca5114b1225cc2977a34
BLAKE2b-256 46348c3d3c3138a8f5ded739d2f483509253013b4f2a2b02a3e8a29b6234417f

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 5b99f09df1e1e57063cac0b16c24d2cea178f12810ca5854dafe89d7e3006bd9
MD5 d716c79b6c09bfd7c6fb52c595e7b971
BLAKE2b-256 ac6e24ec2348b12912dcb5d8b74819a3e5569b40974e0020f1230eaab548f0d3

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp36-cp36m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp36-cp36m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 a494d093560344582d3580f48305e7c849aed28ae8636263d5f45914d7e51d51
MD5 9178469cde5bd2a101a7862472d262d0
BLAKE2b-256 5aab0cb73a93835979633b96d650ac978e2acd7ad4d1e4d96ea3bb76b7c53a38

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp36-cp36m-macosx_10_11_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp36-cp36m-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 495ed073e79118c5e207b15a7bbbebc81f22890063bfc0ecf4cb8cce9cd42fbe
MD5 170820a8cac8c706daa0f48f73cc1c1e
BLAKE2b-256 b4dbf2f57a3d60a4d1a601d32c7c879b23678b5c2e83a86689428e8aa3e2d947

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 5080959f045115078d24d33f0233f259bf7d7da415d611ab611d0f82943ff0ce
MD5 a279145e7caeacd26e4e974d884a622e
BLAKE2b-256 164c934429017011b029e1d910f7871184acd1a415e8f3ce5a5974b6c5e172d6

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp35-cp35m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp35-cp35m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 4e76af9f98c4fda42617988e0bbae43b694a0a3dd0ddf3fddc1ecf9ea29a0ae6
MD5 3166dffe7b81c01ceadd83e6e28d12c8
BLAKE2b-256 14fdd38a839a90fc285da2f2f879189c20e5694f8145ad86e0e74960f79ccd88

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp34-cp34m-win_amd64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp34-cp34m-win_amd64.whl
Algorithm Hash digest
SHA256 2246db95c703c4722497644f58a280ccaf5f5618e2065299cad476521053725e
MD5 c6124385a321f3aac8dbe4c267ca9d3a
BLAKE2b-256 7b38a6aed3db6299d12c14e07feb19622ecf71a8ed352549a3591353084e2bbf

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp34-cp34m-win32.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp34-cp34m-win32.whl
Algorithm Hash digest
SHA256 5135777a3dc487637298a2b98548d9aabd987e92fc707e950637019350cdf87c
MD5 dbdad503a6267426e7bf5833aaec82ce
BLAKE2b-256 87c5fd427459ac9a8c41773fe3b33fc51191920873d47307908e492f314db8b1

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp34-cp34m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp34-cp34m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 c3e77c044058365852005add881a05d411182b0934dc7aa551644043e0e6a9ca
MD5 dccb5de276cebc8087ea3d16c5fd3bc3
BLAKE2b-256 e80939dfdc7cb20724f765912ac0f782efe62c2c364adafc714d9b0f701a32c7

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp34-cp34m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp34-cp34m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 0a829db7e5e9bc5566526aac20bfa2c328cd201e1edef43d0a64441e9e8ffe29
MD5 ebfc675a4094fc5a146d73670137446d
BLAKE2b-256 a2aed1001a172aedf73afe487c719161d292eccf3902c679f54396c0b2008081

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp27-cp27mu-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp27-cp27mu-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 f42b9eb0c7a8e20a77d2be493376702be00996143a88e6894ac2a18ab18bfe69
MD5 1f2ea14c82dbf496081b07ab4f3bb8db
BLAKE2b-256 cb2fe81e04df2a5deb661b1e4990ad0a3798f5163a171e3bd304cae5482cad1d

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp27-cp27mu-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp27-cp27mu-manylinux1_i686.whl
Algorithm Hash digest
SHA256 62b806ff31730d23fed74cd3cc61d077a07a8f97906930357ad610260bc18330
MD5 a42a6ca98f922f321e39f07330e438a7
BLAKE2b-256 987c3554803ba477fcecb897908d1b20fa4499e61e65a0f5d2fcc442e82611a1

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp27-cp27m-win_amd64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp27-cp27m-win_amd64.whl
Algorithm Hash digest
SHA256 cb8bc2257f82aa230b084ff0c199526f44012e06e335c28ea81db05a5cfcfda2
MD5 035925fa2cc9801b7feabfcd61789025
BLAKE2b-256 16f5a94507d5dadb6ff476d0b1d8d8b1f71776279e33423a17e0d44e8a76017b

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp27-cp27m-win32.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp27-cp27m-win32.whl
Algorithm Hash digest
SHA256 724efcc0673f84b715fc9edb46f8b8fdeb9d85b36775d5c3f4653fc6879fa932
MD5 b5aa94757a3167536b0f6c4d4f89394d
BLAKE2b-256 605bdfeaa2760fbdd93919deffeca47ce746a6f32139904554e71def8c28c558

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp27-cp27m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp27-cp27m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 20849118864fdb71a11a4f797b67eaa2d27a9f02a20a37f06f3a3144c47bfdf2
MD5 f919f0c89555330f4e65df6370cfcdf5
BLAKE2b-256 12f833bc6b8cf16ff39ac6340530e0bb844678ba75376dc0b8657a4b55da07f9

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp27-cp27m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp27-cp27m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 2e77d3a4b36d4f526706ef9bd547551353a64eb639043017c07df66d07ef0e3a
MD5 7190691087dfdd1308f4a9dbc5e4347f
BLAKE2b-256 e9450e6e9f3c2570636cf5711df41c25820ec77abf27a768f66d8999e62cd089

See more details on using hashes here.

File details

Details for the file dedupe-1.6.13-cp27-cp27m-macosx_10_11_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.6.13-cp27-cp27m-macosx_10_11_x86_64.whl
Algorithm Hash digest
SHA256 d03db5b07f3e7c97ca63c5a3afa4f57f81640972f848809f26062b91380a396a
MD5 e31a9e026d18df31119e6136c2eeefec
BLAKE2b-256 e8e14666158c33e55592de7e55520dade89b097996f52afbf0f9a2c36d3d8544

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page