Re: notmuch ignoring alot of emails

Subject: Re: notmuch ignoring alot of emails

Date: Fri, 28 Jun 2019 16:11:21 -0400

To: Alexei Gilchrist

Cc: notmuch@notmuchmail.org

From: Alvaro Herrera


On 2019-Jun-28, Alvaro Herrera wrote:

> I think a real solution is to parse the message header, look for the
> Content-Length, and determine mbox-ness by looking for "From" only past
> that many bytes; that seems to match what other mail parsing tools do.

Sorry, I misspoke: there's no such thing as Content-Length.
It's Content-Type/boundary that needs to be watched for.  Only consider
that the file is an mbox if a "^From " line appears after the boundary
end marker (which seems to be defined as "the boundary string followed
by two dashes --").

Here's a sample message, BTW:
https://www.postgresql.org/message-id/raw/3ad5ba71-d200-96da-f903-7e3b16416140@lab.ntt.co.jp
(username "archives", password "antispam").

-- 
Álvaro Herrera       Valdivia, Chile
_______________________________________________
notmuch mailing list
notmuch@notmuchmail.org
https://notmuchmail.org/mailman/listinfo/notmuch

Thread: