Re: [PATCH] test: Add test for searching of uncommonly encoded messages

Subject: Re: [PATCH] test: Add test for searching of uncommonly encoded messages

Date: Sat, 25 Feb 2012 12:36:00 +0400

Cc:

Hi!
I've struck another problem:

I've got an html/text email with body encoded with cp1251.
Its encoding is mentioned in both Content-type: email header and html <meta>
tag. So when the client tries to display it with external html2text converter,
The message is decoded twice: first by client, second by html2text (I use w3m).

As I understand, notmuch (while indexing this message) decodes it once and
index it in the right way (though including html tags to index). But what if
the message contains no "charset" option in Content-Type email header but
contain <meta> content-type tag with charset noted? Should such message be
considered as being composed wrong or it should be indexed with diving into
html details (content-type)?

Previous message (by thread): Re: [PATCH 1/2] Convert non-UTF-8 parts to UTF-8 before indexing them

Thread:

Serge Z—Searching through different charsets [inbox, unread]
- Michal Sojka—Re: Searching through different charsets [inbox, unread]
  - Michal Sojka—[PATCH] test: Add test for searching of uncommonly encoded messages [inbox, notmuch::patch, notmuch::pushed, unread]
    - Serge Z—Re: [PATCH] test: Add test for searching of uncommonly encoded messages [inbox, unread]
      - Michal Sojka—Re: [PATCH] test: Add test for searching of uncommonly encoded messages [inbox, unread]
        Serge Z—Re: [PATCH] test: Add test for searching of uncommonly encoded messages [inbox, unread]
        Michal Sojka—Re: [PATCH] test: Add test for searching of uncommonly encoded messages [inbox, unread]
        Serge Z—Re: [PATCH] test: Add test for searching of uncommonly encoded messages [inbox, unread]
        Michal Sojka—Re: Double decoded text/html parts (was: [PATCH] test: Add test for searching of uncommonly encoded messages) [inbox, unread]
        Serge Z—Re: Double decoded text/html parts (was: [PATCH] test: Add test for searching of uncommonly encoded messages) [inbox, unread]
    - Michal Sojka—[PATCH 1/2] Convert non-UTF-8 parts to UTF-8 before indexing them [inbox, notmuch::patch, notmuch::pushed, unread]
      - Michal Sojka—[PATCH 2/2] test: Remove 'broken' flag from encoding test [inbox, notmuch::patch, notmuch::pushed, unread]
      - Austin Clements—Re: [PATCH 1/2] Convert non-UTF-8 parts to UTF-8 before indexing them [inbox, notmuch::review, unread]
      - David Bremner—Re: [PATCH 1/2] Convert non-UTF-8 parts to UTF-8 before indexing them [inbox, unread]
    - David Bremner—Re: [PATCH] test: Add test for searching of uncommonly encoded messages [inbox, unread]