On Fri, 24 Feb 2012, Serge Z wrote: > > Quoting Michal Sojka (2012-02-24 11:00:02) > >I'm not big fan of adding "preprocessor". First, I thing that both > >reasons you mention are actually bugs and it would be better to fix them > >for everybody than requiring each user to configure some preprocessor. > >Second, depending on what and how would your preprocessor do, the > >initial mail indexing could be a way slower, which is also nothing that > >people want. > > > >Do you have any other use case for the preprocessor besides utf8 and > >html->text conversions? > > > >Cheers, > >-Michal > > Well, I don't want to add any external preprocessor too. > > This may be considered as an architectural decision: search engine should not > access messages directly, but through some preprocessing layer which would > handle the case of different encodings in body and headers, RFC2047-encoded > headers (if this is not handled yet) etc. > > Anyway, this solution imho would be nice to be concluded inside a separate > library Yes, this library is called gmime and notmuch already make use of it. -Michal