Skip to main content

Deduplicate mails from mbox files and maildir folders.

Project description

Command-line tool to deduplicate mails from a set of mbox files and/or maildir folders.

Stable release: Last release Python versions Software license Requirements freshness

Development: Unit-tests status Documentation Status Coverage Status Code Quality

Features

  • Duplicate detection based on cherry-picked mail headers.

  • Source mails from multiple mbox files and/or maildir folders.

  • Multiple removal strategies based on size, timestamp or file path.

  • Dry-run mode.

  • Protection against false-positives by checking for size and content differences.

Installation

This package is available on PyPi, so you can install the latest stable release and its dependencies with a simple pip call:

$ pip install mail-deduplicate

Documentation

Docs are hosted on Read the Docs.

Changes for v3.0.0 (2020-09-03)

  • Rename project from maildir-deduplicate to mail-deduplicate.

  • Rename master branch to main.

  • Add support for mboxes. Closes #48.

  • Remove requirement on -s/--strategy parameter, to let mails being grouped into duplicate sets without any removal action, effectively acting as a second-level dry-run.

Full changelog.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mail-deduplicate-3.0.0.tar.gz (36.9 kB view details)

Uploaded Source

Built Distributions

mail_deduplicate-3.0.0-py3.8.egg (24.6 kB view details)

Uploaded Source

mail_deduplicate-3.0.0-py2.py3-none-any.whl (32.0 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file mail-deduplicate-3.0.0.tar.gz.

File metadata

  • Download URL: mail-deduplicate-3.0.0.tar.gz
  • Upload date:
  • Size: 36.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.2

File hashes

Hashes for mail-deduplicate-3.0.0.tar.gz
Algorithm Hash digest
SHA256 826b1029b9eea23aaf2bd3ca137bffa2cb8ed6673ad553630f4989da3f188b2a
MD5 8910dfe2a1624b288f2ad1006c501af8
BLAKE2b-256 8087025556b90a980049cb7dd0131e276246315246bbd53a7d0819568542de1a

See more details on using hashes here.

File details

Details for the file mail_deduplicate-3.0.0-py3.8.egg.

File metadata

  • Download URL: mail_deduplicate-3.0.0-py3.8.egg
  • Upload date:
  • Size: 24.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.2

File hashes

Hashes for mail_deduplicate-3.0.0-py3.8.egg
Algorithm Hash digest
SHA256 366e4fd30872896bb24d8521e0a49f42c5ce4f6f25a7c151f0a9cea0b667990e
MD5 084c7ddb853d3a19c4bbdae72d5d4578
BLAKE2b-256 f9e0a763a0a4ad592c1a3ff11e75133268c955e9b4b4af6c367979585df70610

See more details on using hashes here.

File details

Details for the file mail_deduplicate-3.0.0-py2.py3-none-any.whl.

File metadata

  • Download URL: mail_deduplicate-3.0.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 32.0 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.2

File hashes

Hashes for mail_deduplicate-3.0.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 99569e3f5298147cf9bb123017fe0cc5d643cc61f0c149e65e234d8759d38fc2
MD5 84fdd683f3f9bcebc66eb4d718adfc1d
BLAKE2b-256 0b1a8bbb9704f2ea6d08830bff77617a7e741d348ba63efb33d7e8b504b0cf99

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page