parsedmarc

A Python package and CLI for parsing aggregate and forensic DMARC reports

These details have not been verified by PyPI

Project links

Homepage

Project description

A screenshot of DMARC summary charts in Kibana

parsedmarc is a Python module and CLI utility for parsing DMARC reports. When used with Elasticsearch and Kibana (or Splunk), it works as a self-hosted open source alternative to commercial DMARC report processing services such as Agari Brand Protection, Dmarcian, OnDMARC, ProofPoint Email Fraud Defense, and Valimail.

Features

Parses draft and 1.0 standard aggregate/rua reports
Parses forensic/failure/ruf reports
Can parse reports from an inbox over IMAP
Transparently handles gzip or zip compressed reports
Consistent data structures
Simple JSON and/or CSV output
Optionally email the results
Optionally send the results to Elasticsearch and/or Splunk, for use with premade dashboards
Optionally send reports to Apache Kafka

Resources

DMARC guides

Demystifying DMARC - A complete guide to SPF, DKIM, and DMARC

SPF and DMARC record validation

If you are looking for SPF and DMARC record validation and parsing, check out the sister project, checkdmarc.

Lookalike domains

DMARC protects against domain spoofing, not lookalike domains. For open source lookalike domain monitoring, check out DomainAware.

CLI help

usage: parsedmarc [-h] [-c CONFIG_FILE] [--strip-attachment-payloads]
               [-o OUTPUT] [-n NAMESERVERS [NAMESERVERS ...]]
               [-t DNS_TIMEOUT] [-s] [--debug] [--log-file LOG_FILE] [-v]
               [file_path [file_path ...]]

Parses DMARC reports

positional arguments:
  file_path             one or more paths to aggregate or forensic report
                        files or emails

optional arguments:
  -h, --help            show this help message and exit
  -c CONFIG_FILE, --config-file CONFIG_FILE
                        A path to a configuration file (--silent implied)
  --strip-attachment-payloads
                        remove attachment payloads from forensic report output
  -o OUTPUT, --output OUTPUT
                        write output files to the given directory
  -n NAMESERVERS [NAMESERVERS ...], --nameservers NAMESERVERS [NAMESERVERS ...]
                        nameservers to query (default is Cloudflare's
                        nameservers)
  -t DNS_TIMEOUT, --dns_timeout DNS_TIMEOUT
                        number of seconds to wait for an answer from DNS
                        (default: 6.0)
  -s, --silent          only print errors and warnings
  --debug               print debugging information
  --log-file LOG_FILE   output logging to a file
  -v, --version         show program's version number and exit

Configuration file

parsedmarc can be configured by supplying the path to an INI file

parsedmarc -c /etc/parsedmarc.ini

For example

# This is an example comment

[general]
save_aggregate = True
save_forensic = True

[imap]
host = imap.example.com
user = dmarcresports@example.com
password = $uperSecure
watch = True

[elasticsearch]
hosts = 127.0.0.1:9200
ssl = False

[splunk_hec]
url = https://splunkhec.example.com
token = HECTokenGoesHere
index = email

The full set of configuration options are:

general
- save_aggregate - bool: Save aggregate report data to the Elasticsearch and/or Splunk
- save_forensic - bool: Save forensic report data to the Elasticsearch and/or Splunk
- strip_attachment_payloads - bool: Remove attachment payloads from results
- output - str: Directory to place JSON and CSV files in
- nameservers - str: A comma separated list of DNS resolvers (Default: Cloudflare’s public resolvers)
- dns_timeout - float: DNS timeout period
- debug - bool: Print debugging messages
- silent - bool: Only print errors (Default: True)
- log_file - str: Write log messages to a file at this path
- n_procs - int: Number of process to run in parallel when parsing in CLI mode (Default: 1)
- chunk_size - int: Number of files to give to each process when running in parallel. Setting this to a number larger than one can improve performance when processing thousands of files
imap
- host - str: The IMAP server hostname or IP address
- port - int: The IMAP server port (Default: 993)
- ssl - bool: Use an encrypted SSL/TLS connection (Default: True)
- skip_certificate_verification - bool: Skip certificate verification (not recommended)
- user - str: The IMAP user
- password - str: The IMAP password
- reports_folder - str: The IMAP folder where the incoming reports can be found (Default: INBOX)
- archive_folder - str: The IMAP folder to sort processed emails into (Default: Archive)
- watch - bool: Use the IMAP IDLE command to process messages as they arrive
- delete - bool: Delete messages after processing them, instead of archiving them
- test - bool: Do not move or delete messages
elasticsearch
- hosts - str: A comma separated list of hostnames and ports or URLs (e.g. 127.0.0.1:9200 or https://user:secret@localhost)
  
  Note
  
  Special characters in the username or password must be URL encoded.
- ssl - bool: Use an encrypted SSL/TLS connection (Default: True)
- cert_path - str: Path to a trusted certificates
- index_suffix - str: A suffix to apply to the index names
- monthly_indexes - bool: Use monthly indexes instead of daily indexes
splunk_hec
- url - str: The URL of the Splunk HTTP Events Collector (HEC)
- token - str: The HEC token
- index - str: The Splunk index to use
- skip_certificate_verification - bool: Skip certificate verification (not recommended)
kafka
- hosts - str: A comma separated list of Kafka hosts
- user - str: The Kafka user
- passsword - str: The Kafka password
- ssl - bool: Use an encrypted SSL/TLS connection (Default: True)
- skip_certificate_verification - bool: Skip certificate verification (not recommended)
- aggregate_topic - str: The Kafka topic for aggregate reports
- forensic_topic - str: The Kafka topic for forensic reports
smtp
- host - str: The SMTP hostname
- port - int: The SMTP port (Default: 25)
- ssl - bool: Require SSL/TLS instead of using STARTTLS
- skip_certificate_verification - bool: Skip certificate verification (not recommended)
- user - str: the SMTP username
- password - str: the SMTP password
- from - str: The From header to use in the email
- to - list: A list of email addresses to send to
- subject - str: The Subject header to use in the email (Default: parsedmarc report)
- attachment - str: The ZIP attachment filenames
- message - str: The email message (Default: Please see the attached parsedmarc report.)

Warning

save_aggregate and save_forensic are separate options because you may not want to save forensic reports (also known as failure reports) to your Elasticsearch instance, particularly if you are in a highly-regulated industry that handles sensitive data, such as healthcare or finance. If your legitimate outgoing email fails DMARC, it is possible that email may appear later in a forensic report.

Forensic reports contain the original headers of an email that failed a DMARC check, and sometimes may also include the full message body, depending on the policy of the reporting organization.

Most reporting organizations do not send forensic reports of any kind for privacy reasons. While aggregate DMARC reports are sent at least daily, it is normal to receive very few forensic reports.

An alternative approach is to still collect forensic/failure/ruf reports in your DMARC inbox, but run parsedmarc with save_forensic = True manually on a separate IMAP folder (using the reports_folder option), after you have manually moved known samples you want to save to that folder (e.g. malicious samples and non-sensitive legitimate samples).

Sample aggregate report output

Here are the results from parsing the example report from the dmarc.org wiki. It’s actually an older draft of the the 1.0 report schema standardized in RFC 7480 Appendix C. This draft schema is still in wide use.

parsedmarc produces consistent, normalized output, regardless of the report schema.

JSON

{
  "xml_schema": "draft",
  "report_metadata": {
    "org_name": "acme.com",
    "org_email": "noreply-dmarc-support@acme.com",
    "org_extra_contact_info": "http://acme.com/dmarc/support",
    "report_id": "9391651994964116463",
    "begin_date": "2012-04-27 20:00:00",
    "end_date": "2012-04-28 19:59:59",
    "errors": []
  },
  "policy_published": {
    "domain": "example.com",
    "adkim": "r",
    "aspf": "r",
    "p": "none",
    "sp": "none",
    "pct": "100",
    "fo": "0"
  },
  "records": [
    {
      "source": {
        "ip_address": "72.150.241.94",
        "country": "US",
        "reverse_dns": "adsl-72-150-241-94.shv.bellsouth.net",
        "base_domain": "bellsouth.net"
      },
      "count": 2,
      "alignment": {
        "spf": true,
        "dkim": false,
        "dmarc": true
      },
      "policy_evaluated": {
        "disposition": "none",
        "dkim": "fail",
        "spf": "pass",
        "policy_override_reasons": []
      },
      "identifiers": {
        "header_from": "example.com",
        "envelope_from": "example.com",
        "envelope_to": null
      },
      "auth_results": {
        "dkim": [
          {
            "domain": "example.com",
            "selector": "none",
            "result": "fail"
          }
        ],
        "spf": [
          {
            "domain": "example.com",
            "scope": "mfrom",
            "result": "pass"
          }
        ]
      }
    }
  ]
}

CSV

xml_schema,org_name,org_email,org_extra_contact_info,report_id,begin_date,end_date,errors,domain,adkim,aspf,p,sp,pct,fo,source_ip_address,source_country,source_reverse_dns,source_base_domain,count,disposition,dkim_alignment,spf_alignment,policy_override_reasons,policy_override_comments,envelope_from,header_from,envelope_to,dkim_domains,dkim_selectors,dkim_results,spf_domains,spf_scopes,spf_results
draft,acme.com,noreply-dmarc-support@acme.com,http://acme.com/dmarc/support,9391651994964116463,2012-04-27 20:00:00,2012-04-28 19:59:59,,example.com,r,r,none,none,100,0,72.150.241.94,US,adsl-72-150-241-94.shv.bellsouth.net,bellsouth.net,2,none,fail,pass,,,example.com,example.com,,example.com,none,fail,example.com,mfrom,pass

Sample forensic report output

Thanks to Github user xennn for the anonymized forensic report email sample.

JSON

{
     "feedback_type": "auth-failure",
     "user_agent": "Lua/1.0",
     "version": "1.0",
     "original_mail_from": "sharepoint@domain.de",
     "original_rcpt_to": "peter.pan@domain.de",
     "arrival_date": "Mon, 01 Oct 2018 11:20:27 +0200",
     "message_id": "<38.E7.30937.BD6E1BB5@ mailrelay.de>",
     "authentication_results": "dmarc=fail (p=none, dis=none) header.from=domain.de",
     "delivery_result": "smg-policy-action",
     "auth_failure": [
       "dmarc"
     ],
     "reported_domain": "domain.de",
     "arrival_date_utc": "2018-10-01 09:20:27",
     "source": {
       "ip_address": "10.10.10.10",
       "country": null,
       "reverse_dns": null,
       "base_domain": null
     },
     "authentication_mechanisms": [],
     "original_envelope_id": null,
     "dkim_domain": null,
     "sample_headers_only": false,
     "sample": "Received: from Servernameone.domain.local (Servernameone.domain.local [10.10.10.10])\n\tby  mailrelay.de (mail.DOMAIN.de) with SMTP id 38.E7.30937.BD6E1BB5; Mon,  1 Oct 2018 11:20:27 +0200 (CEST)\nDate: 01 Oct 2018 11:20:27 +0200\nMessage-ID: <38.E7.30937.BD6E1BB5@ mailrelay.de>\nTo: <peter.pan@domain.de>\nfrom: \"=?utf-8?B?SW50ZXJha3RpdmUgV2V0dGJld2VyYmVyLcOcYmVyc2ljaHQ=?=\" <sharepoint@domain.de>\nSubject: Subject\nMIME-Version: 1.0\nX-Mailer: Microsoft SharePoint Foundation 2010\nContent-Type: text/html; charset=utf-8\nContent-Transfer-Encoding: quoted-printable\n\n<html><head><base href=3D'\nwettbewerb' /></head><body><!DOCTYPE HTML PUBLIC \"-//W3C//DTD HTML 3.2//EN\"=\n><HTML><HEAD><META NAME=3D\"Generator\" CONTENT=3D\"MS Exchange Server version=\n 08.01.0240.003\"></html>\n",
     "parsed_sample": {
       "from": {
         "display_name": "Interaktive Wettbewerber-Übersicht",
         "address": "sharepoint@domain.de",
         "local": "sharepoint",
         "domain": "domain.de"
       },
       "to_domains": [
         "domain.de"
       ],
       "to": [
         {
           "display_name": null,
           "address": "peter.pan@domain.de",
           "local": "peter.pan",
           "domain": "domain.de"
         }
       ],
       "subject": "Subject",
       "timezone": "+2",
       "mime-version": "1.0",
       "date": "2018-10-01 09:20:27",
       "content-type": "text/html; charset=utf-8",
       "x-mailer": "Microsoft SharePoint Foundation 2010",
       "body": "<html><head><base href='\nwettbewerb' /></head><body><!DOCTYPE HTML PUBLIC \"-//W3C//DTD HTML 3.2//EN\"><HTML><HEAD><META NAME=\"Generator\" CONTENT=\"MS Exchange Server version 08.01.0240.003\"></html>",
       "received": [
         {
           "from": "Servernameone.domain.local Servernameone.domain.local 10.10.10.10",
           "by": "mailrelay.de mail.DOMAIN.de",
           "with": "SMTP id 38.E7.30937.BD6E1BB5",
           "date": "Mon, 1 Oct 2018 11:20:27 +0200 CEST",
           "hop": 1,
           "date_utc": "2018-10-01 09:20:27",
           "delay": 0
         }
       ],
       "content-transfer-encoding": "quoted-printable",
       "message-id": "<38.E7.30937.BD6E1BB5@ mailrelay.de>",
       "has_defects": false,
       "headers": {
         "Received": "from Servernameone.domain.local (Servernameone.domain.local [10.10.10.10])\n\tby  mailrelay.de (mail.DOMAIN.de) with SMTP id 38.E7.30937.BD6E1BB5; Mon,  1 Oct 2018 11:20:27 +0200 (CEST)",
         "Date": "01 Oct 2018 11:20:27 +0200",
         "Message-ID": "<38.E7.30937.BD6E1BB5@ mailrelay.de>",
         "To": "<peter.pan@domain.de>",
         "from": "\"Interaktive Wettbewerber-Übersicht\" <sharepoint@domain.de>",
         "Subject": "Subject",
         "MIME-Version": "1.0",
         "X-Mailer": "Microsoft SharePoint Foundation 2010",
         "Content-Type": "text/html; charset=utf-8",
         "Content-Transfer-Encoding": "quoted-printable"
       },
       "reply_to": [],
       "cc": [],
       "bcc": [],
       "attachments": [],
       "filename_safe_subject": "Subject"
     }
   }

CSV

feedback_type,user_agent,version,original_envelope_id,original_mail_from,original_rcpt_to,arrival_date,arrival_date_utc,subject,message_id,authentication_results,dkim_domain,source_ip_address,source_country,source_reverse_dns,source_base_domain,delivery_result,auth_failure,reported_domain,authentication_mechanisms,sample_headers_only
auth-failure,Lua/1.0,1.0,,sharepoint@domain.de,peter.pan@domain.de,"Mon, 01 Oct 2018 11:20:27 +0200",2018-10-01 09:20:27,Subject,<38.E7.30937.BD6E1BB5@ mailrelay.de>,"dmarc=fail (p=none, dis=none) header.from=domain.de",,10.10.10.10,,,,smg-policy-action,dmarc,domain.de,,False

Bug reports

Please report bugs on the GitHub issue tracker

https://github.com/domainaware/parsedmarc/issues

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

8.16.0

Nov 2, 2024

8.15.4

Oct 24, 2024

8.15.3

Oct 24, 2024

8.15.2

Oct 24, 2024

8.15.1

Oct 3, 2024

8.15.0

Sep 4, 2024

8.14.1

Sep 3, 2024

8.14.0

Sep 3, 2024

8.13.0

Aug 25, 2024

8.12.0

May 22, 2024

8.11.0

Apr 2, 2024

8.10.3

Mar 29, 2024

8.10.2

Mar 29, 2024

8.10.1

Mar 27, 2024

8.10.0

Mar 26, 2024

8.9.4

Mar 25, 2024

8.9.3

Mar 25, 2024

8.9.2

Mar 25, 2024

8.9.1

Mar 25, 2024

8.8.0

Mar 4, 2024

8.7.0

Feb 20, 2024

8.6.4

Oct 13, 2023

8.6.3

Oct 11, 2023

8.6.2

Oct 11, 2023

8.6.1

Jun 27, 2023

8.6.0

May 9, 2023

8.5.0

May 3, 2023

8.4.2

Jan 21, 2023

8.4.1

Jan 16, 2023

8.4.0

Dec 24, 2022

8.3.2

Oct 4, 2022

8.3.1

Sep 9, 2022

8.3.0

Jun 20, 2022

8.2.0

May 11, 2022

8.1.1

May 9, 2022

8.1.0

May 9, 2022

8.0.3

May 2, 2022

8.0.2 yanked

Apr 26, 2022

Reason this release was yanked:

IMAP bugs

8.0.1 yanked

Apr 25, 2022

Reason this release was yanked:

IMAP bugs

8.0.0 yanked

Apr 22, 2022

Reason this release was yanked:

Broken packaging

7.1.1

Jan 8, 2022

7.1.0

Jan 5, 2022

7.0.1

Jun 23, 2021

7.0.0

Jun 20, 2021

6.12.0

Nov 25, 2020

6.11.0

Aug 31, 2020

6.10.0

May 10, 2020

6.9.0

Feb 17, 2020

6.8.2

Jan 24, 2020

6.8.1

Jan 22, 2020

6.8.0

Jan 14, 2020

6.7.4

Dec 23, 2019

6.7.3

Dec 17, 2019

6.7.2

Nov 25, 2019

6.7.1

Nov 12, 2019

6.7.0

Nov 6, 2019

6.6.1

Sep 23, 2019

6.6.0

Sep 23, 2019

6.5.5

Sep 13, 2019

6.5.4

Aug 12, 2019

6.5.3

Jul 31, 2019

6.5.2

Jul 30, 2019

6.5.1

Jul 19, 2019

6.5.0

Jul 17, 2019

6.4.2

Jul 2, 2019

6.4.1

May 19, 2019

6.4.0

May 8, 2019

6.3.7

May 3, 2019

6.3.6

Apr 30, 2019

6.3.5

Apr 29, 2019

6.3.4

Apr 23, 2019

This version

6.3.3

Apr 23, 2019

6.3.2

Apr 11, 2019

6.3.1

Mar 29, 2019

6.3.0

Mar 29, 2019

6.2.2

Mar 19, 2019

6.2.1

Feb 25, 2019

6.2.0

Feb 25, 2019

6.1.8

Feb 16, 2019

6.1.6

Feb 16, 2019

6.1.5

Feb 16, 2019

6.1.3

Feb 16, 2019

6.1.2

Feb 16, 2019

6.1.1

Feb 15, 2019

6.1.0

Feb 13, 2019

6.0.3

Feb 12, 2019

6.0.2

Feb 10, 2019

6.0.1

Feb 6, 2019

6.0.0

Feb 5, 2019

5.3.0

Jan 28, 2019

5.2.1

Jan 13, 2019

5.2.0

Jan 13, 2019

5.1.3

Jan 7, 2019

5.1.2

Dec 31, 2018

5.1.1

Dec 20, 2018

5.1.0

Nov 29, 2018

5.0.2

Nov 26, 2018

5.0.1

Nov 26, 2018

5.0.0

Nov 19, 2018

4.4.1

Nov 9, 2018

4.4.0

Nov 9, 2018

4.3.8

Oct 25, 2018

4.3.7

Oct 22, 2018

4.3.6

Oct 19, 2018

4.3.5

Oct 18, 2018

4.3.4

Oct 16, 2018

4.3.3

Oct 15, 2018

4.3.2

Oct 14, 2018

4.3.1

Oct 14, 2018

4.3.0

Oct 12, 2018

4.2.1

Oct 11, 2018

4.2.0

Oct 10, 2018

4.1.9

Oct 8, 2018

4.1.8

Oct 7, 2018

4.1.7

Oct 6, 2018

4.1.6

Oct 5, 2018

4.1.5

Oct 5, 2018

4.1.4

Sep 30, 2018

4.1.3

Sep 29, 2018

4.1.2

Sep 29, 2018

4.1.1

Sep 29, 2018

4.1.0

Sep 27, 2018

4.0.2

Sep 26, 2018

4.0.1

Sep 26, 2018

4.0.0

Sep 26, 2018

3.9.7

Sep 13, 2018

3.9.6

Sep 11, 2018

3.9.5

Sep 10, 2018

3.9.4

Sep 6, 2018

3.9.3

Sep 6, 2018

3.9.2

Sep 6, 2018

3.9.1

Sep 6, 2018

3.9.0

Sep 6, 2018

3.8.2

Sep 4, 2018

3.8.0

Aug 22, 2018

3.7.3

Aug 19, 2018

3.7.2

Aug 1, 2018

3.7.1

Jul 18, 2018

3.7.0

Jul 18, 2018

3.6.1

Jun 29, 2018

3.6.0

Jun 20, 2018

3.5.1

Jun 20, 2018

3.5.0

Jun 10, 2018

3.4.1

Apr 1, 2018

3.4.0

Mar 30, 2018

3.3.0

Mar 27, 2018

3.2.0

Mar 27, 2018

3.1.0

Mar 27, 2018

3.0.0

Mar 26, 2018

2.1.2

Mar 6, 2018

2.1.1

Mar 6, 2018

2.1.0

Mar 5, 2018

2.0.1

Mar 4, 2018

2.0.0

Mar 4, 2018

1.1.0

Feb 8, 2018

1.0.5

Feb 6, 2018

1.0.4

Feb 6, 2018

1.0.3

Feb 6, 2018

1.0.2

Feb 6, 2018

1.0.1

Feb 6, 2018

1.0.0

Feb 6, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parsedmarc-6.3.3.tar.gz (42.7 kB view details)

Uploaded Apr 23, 2019 Source

Built Distribution

parsedmarc-6.3.3-py3-none-any.whl (42.1 kB view details)

Uploaded Apr 23, 2019 Python 3

File details

Details for the file parsedmarc-6.3.3.tar.gz.

File metadata

Download URL: parsedmarc-6.3.3.tar.gz
Upload date: Apr 23, 2019
Size: 42.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.0 CPython/3.7.3rc1

File hashes

Hashes for parsedmarc-6.3.3.tar.gz
Algorithm	Hash digest
SHA256	`292366e8f683f67998278730ccceb645d939fd651130fe22df680e218b1b7121`
MD5	`31f6e7fe788ad52d70d3207468e8e2de`
BLAKE2b-256	`a5f32e9a2a4cc92ed9d508609fa7e6f6ce482af84bbc1cdc3e4724b963da574e`

See more details on using hashes here.

File details

Details for the file parsedmarc-6.3.3-py3-none-any.whl.

File metadata

Download URL: parsedmarc-6.3.3-py3-none-any.whl
Upload date: Apr 23, 2019
Size: 42.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.0 CPython/3.7.3rc1

File hashes

Hashes for parsedmarc-6.3.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`868f66ed5dbd736971697cb56aaa62b718dad02e4754e52e0a03412e81baab7a`
MD5	`78976f4b3b5d7fbe65dfbe4240e9b941`
BLAKE2b-256	`abd31f86d43fddacc84295c3acbd1da1f4ba47a1236a761e9cbeff677680cd89`