0

When parsing a collection of MBox files, I came across a surprising number of Date header fields of the form:

"Date:" date-time "<" addr-spec ">"

Reading the available RFCs I find no matching syntax. The valid form appears to be:

"Date:" date-time [CFWS]

CFWS represents comments plus folding whitespace as described in Section 3.3 (Date and Time Specification) of RFC5322.

Reading Proper Mail Date Header Formatting where the author analyses 132k date headers, yet the form above never shows in the linked dataset.

Is this an MBox artefact, IMF attribute, or corruption from a mail agent and/or mail relay?

The Internet Message Format has been evolving since the 1980s, it's a bit of a mess and has been interpreted a number of ways à la HTTP. I'm leaning toward this being a vendor-specific modification that results in a malformed Date header field? IDK.

Example MBox

From ???@???
From: from@mail.com
To: to@mail.com
Date: Sat, 02 Feb 2002 00:55:01 +0000 <name@mail.com>
Subject: Email One

This is email 1.

Specifications

Ben Crowhurst
  • 8,204
  • 6
  • 48
  • 78
  • I have never seen this particular form of corruption in my 25+ years of staring at email messages. Probably very specific to a particular piece of software, perhaps something you ran locally? – tripleee Feb 14 '22 at 06:11
  • Same. I've never seen this in my 25+ years of writing MIME & mbox parsers. – jstedfast Feb 14 '22 at 17:18

0 Answers0