Skip to main content

A python library for accurate and scaleable data deduplication and entity-resolution

Project description

dedupe is a library that uses machine learning to perform de-duplication and entity resolution quickly on structured data. dedupe is the open source engine for dedupe.io

dedupe will help you:

  • remove duplicate entries from a spreadsheet of names and addresses

  • link a list with customer information to another with order history, even without unique customer id’s

  • take a database of campaign contributions and figure out which ones were made by the same person, even if the names were entered slightly differently for each record

dedupe takes in human training data and comes up with the best rules for your dataset to quickly and automatically find similar records, even with very large databases.

Important links:

Project details


Release history Release notifications | RSS feed

This version

1.9.2

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dedupe-1.9.2.tar.gz (56.3 kB view details)

Uploaded Source

Built Distributions

dedupe-1.9.2-cp36-cp36m-manylinux1_x86_64.whl (79.3 kB view details)

Uploaded CPython 3.6m

dedupe-1.9.2-cp36-cp36m-manylinux1_i686.whl (76.2 kB view details)

Uploaded CPython 3.6m

dedupe-1.9.2-cp36-cp36m-macosx_10_12_x86_64.whl (51.4 kB view details)

Uploaded CPython 3.6m macOS 10.12+ x86-64

dedupe-1.9.2-cp35-cp35m-manylinux1_x86_64.whl (79.1 kB view details)

Uploaded CPython 3.5m

dedupe-1.9.2-cp35-cp35m-manylinux1_i686.whl (76.0 kB view details)

Uploaded CPython 3.5m

dedupe-1.9.2-cp34-cp34m-win_amd64.whl (52.3 kB view details)

Uploaded CPython 3.4m Windows x86-64

dedupe-1.9.2-cp34-cp34m-win32.whl (51.6 kB view details)

Uploaded CPython 3.4m Windows x86

dedupe-1.9.2-cp34-cp34m-manylinux1_x86_64.whl (79.2 kB view details)

Uploaded CPython 3.4m

dedupe-1.9.2-cp34-cp34m-manylinux1_i686.whl (76.1 kB view details)

Uploaded CPython 3.4m

dedupe-1.9.2-cp27-cp27mu-manylinux1_x86_64.whl (76.9 kB view details)

Uploaded CPython 2.7mu

dedupe-1.9.2-cp27-cp27mu-manylinux1_i686.whl (74.0 kB view details)

Uploaded CPython 2.7mu

dedupe-1.9.2-cp27-cp27m-win_amd64.whl (52.2 kB view details)

Uploaded CPython 2.7m Windows x86-64

dedupe-1.9.2-cp27-cp27m-win32.whl (51.3 kB view details)

Uploaded CPython 2.7m Windows x86

dedupe-1.9.2-cp27-cp27m-manylinux1_x86_64.whl (76.9 kB view details)

Uploaded CPython 2.7m

dedupe-1.9.2-cp27-cp27m-manylinux1_i686.whl (74.0 kB view details)

Uploaded CPython 2.7m

dedupe-1.9.2-cp27-cp27m-macosx_10_12_x86_64.whl (50.8 kB view details)

Uploaded CPython 2.7m macOS 10.12+ x86-64

File details

Details for the file dedupe-1.9.2.tar.gz.

File metadata

  • Download URL: dedupe-1.9.2.tar.gz
  • Upload date:
  • Size: 56.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for dedupe-1.9.2.tar.gz
Algorithm Hash digest
SHA256 90e0f2ae97fd571abc15f9e0542ea58e87331447257796d8ab0fa3ed6e859a85
MD5 94144f8a61a55d008d7a4f8f95e88c95
BLAKE2b-256 72583ce127d336ecb9756af5caccf36f54c3a063c8331f94aa2c72246978b9e6

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 0f40202e2509e8e6823914372a39991e8ee71b3445ac6728178a443e98d7d6f1
MD5 35b1235e394f7efd6cd4d4e9e22e0cda
BLAKE2b-256 964f4f639269e01f91d09be486647c4f818bef731a77cc64ec3395be6e91e57f

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp36-cp36m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp36-cp36m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 aa136eae318b69357fe4a3359a9c562966a451f145cf448e51ea01b096fe99c9
MD5 67c5015a86de694b9112dad936421c6d
BLAKE2b-256 0118d05b61f3f2f065e47df3faee910a2e8c9c4552df95ba497ace4ae5d4c4a0

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp36-cp36m-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp36-cp36m-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 5564967e1cec7817e479d191dc44407b4c97add0a7578a30cd325f495e063a9f
MD5 1b7c3edc2431d08bda84c2d3fc47020b
BLAKE2b-256 62cf7f6e03c4db8e6093ea790f9cdc2a7a644e2ec98821ff3d9ef924e12c544d

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 54462ed0e95c004f611df6c595145e1e962e818767624517fba8cb82f7e21b40
MD5 952fc285245d9f9cfb5f13546f5cfc63
BLAKE2b-256 e8a31d855d361d1b2865d8637dcd334298f2198043c5e736df3185c251868e88

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp35-cp35m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp35-cp35m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 6a63df88760d448a0a205d475050753f1cff3b512fad760ca53bbc3eb35e7d2f
MD5 b9c3c3371c9abd7de53cf3b36e6f1030
BLAKE2b-256 70379519f40927d03b6d3e46b81f25a1c2d6452d772e5521fe8d1a8dd3d56f59

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp34-cp34m-win_amd64.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp34-cp34m-win_amd64.whl
Algorithm Hash digest
SHA256 0e87ff03062b1020d7dbd338f6f8cde4b8742314adf3675558088fb09a3202a3
MD5 3006c37808d58504d1bfc25e58e3029d
BLAKE2b-256 e76fcfb2d61331b99b1804405de02e10dca99e005f9615595ec9ba5dde1fa295

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp34-cp34m-win32.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp34-cp34m-win32.whl
Algorithm Hash digest
SHA256 1e000aafcf231d2260b31f78c3f005102f5dd9b6cbb747fe4b9ed055f44299dd
MD5 122b96590bddac9b6243d0cca4d4a51d
BLAKE2b-256 404f81151a57828ccc9edd1860bfeee6b706c4835c0654b9342525b8f5eb5bec

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp34-cp34m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp34-cp34m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 52fbbaa94b66868498805d8b101d19825ecae69f677d955589a261b7f34d05e5
MD5 a7effaa0d06488aecf9880be51d833fc
BLAKE2b-256 014e0e055de675a68896062428e31d577e5b1802ad453b6807df788ff2f643ea

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp34-cp34m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp34-cp34m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 14f134b061e65b9ff6f0da4d5f4aa3a9bff8ab0b076ac440b140f2909b4f1d03
MD5 36299fb1b5328475011296b1a6118c63
BLAKE2b-256 e4d0acfa3175c2cb062c5b8c168daac65a6f1f20a4b1830b6ec2c7443727a29e

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp27-cp27mu-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp27-cp27mu-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 43ba9f90efa3bbc94d243669ac661cf219edca014b66a30fbad3df8707bf4c6b
MD5 914d632f0c132732dc8b7fe79f1da2a5
BLAKE2b-256 96f7062e76556f8f8614f80f7262ebe3cd9a9dc0f6d165c894fbacc15f6c532e

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp27-cp27mu-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp27-cp27mu-manylinux1_i686.whl
Algorithm Hash digest
SHA256 0eeb13812252e869e7526d9370fe602c5476b7799f5457b2a74c0e5dadfb1d0a
MD5 5ac116c28b1b82fd84f8f0bedb0918e8
BLAKE2b-256 b8e61676a77fc8ee5911aec7a387a35eb4a653fde8b83dcad8f69bff2c37e95c

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp27-cp27m-win_amd64.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp27-cp27m-win_amd64.whl
Algorithm Hash digest
SHA256 f6d45d976a4a8a728cf86451c0e62108bebb161e5cde9afc341e6dce62e48dd3
MD5 d8b30b370c37426de8a56d87099f9d28
BLAKE2b-256 71d0f943380cc1902fd097d88872cba07df09c271cd645a77e71a03fd21f8cc5

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp27-cp27m-win32.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp27-cp27m-win32.whl
Algorithm Hash digest
SHA256 1dd909e7e1ce4d602fbf1d400a8074483233b6ad8b7d789b78baf88fd063d793
MD5 0e9fc664b77fbc797b52bd4ace0912a5
BLAKE2b-256 64e68ac5a1558b8944c35f466e49719890ab745af71c10b9573e20c847cd3bd4

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp27-cp27m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp27-cp27m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 433534a2ec42c3a418ffaed770ed6f4185fa677834a8ff1bde8d451d5e511aaf
MD5 b55d368a771c9dea4b2ea1839ad004d9
BLAKE2b-256 5f68c45a028ee64c7786cf9a74360a8cf9362c757ce1145c6dec21335b3d3b9b

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp27-cp27m-manylinux1_i686.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp27-cp27m-manylinux1_i686.whl
Algorithm Hash digest
SHA256 ed51df125ea91fa5a04870b4178edbef0fe06c37d15fc089ac6ff029bab10f13
MD5 dd2520b1256f5d42b8c6c18ce7af8a9a
BLAKE2b-256 47d14d7d8c336e361b1bca5fd39b6a92a4984cc6541ec7687fb292b981108cf6

See more details on using hashes here.

File details

Details for the file dedupe-1.9.2-cp27-cp27m-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for dedupe-1.9.2-cp27-cp27m-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 baa0b73df57e1f2190d2095a22b3d20d50e44e92b7559a22c0ef9245eadd1f04
MD5 2ff2d0da91440f97dc46cb24c829c5f6
BLAKE2b-256 ee861154bb9e8bd0f3328491a37dc6db4d282fff2fbe9bdb3be2735e5426870c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page