Re: [PATCH v3 3/5] Add indexing for the mimetype term

Subject: Re: [PATCH v3 3/5] Add indexing for the mimetype term

Date: Sat, 17 Jan 2015 16:21:50 +0100

Cc:

Todd <todd@electricoding.com> writes:

> Adds the indexing and removes the broken test flag
> ---
>  lib/database.cc        |  1 +
>  lib/index.cc           | 10 ++++++++++
>  test/T190-multipart.sh |  4 ----
>  3 files changed, 11 insertions(+), 4 deletions(-)
>
> diff --git a/lib/database.cc b/lib/database.cc
> index 0d2c417..3974e2e 100644
> --- a/lib/database.cc
> +++ b/lib/database.cc
> @@ -254,6 +254,7 @@ static prefix_t PROBABILISTIC_PREFIX[]= {
>      { "from",			"XFROM" },
>      { "to",			"XTO" },
>      { "attachment",		"XATTACHMENT" },
> +    { "mimetype",		"XMIMETYPE"},
>      { "subject",		"XSUBJECT"},
>  };

I think the commit message should articulate why we are indexing this as
a probabilistic prefix, rather than as a boolean prefix. In particular,
this gives people a last chance to complain.

The reference I know is http://xapian.org/docs/queryparser.html

If I understand correctly (it would be great if you could test this
Todd) , with a probabilistic prefix,

   mimetime:pdf

will match

application/pdf
image/pdf
application/x-pdf
application/x-ext-pdf

but not

application/x-bzpdf
application/x-gzpdf
application/x-xzpdf

On the whole, this is probably more beneficial than bad.  The downside
of probabilistic prefixes/fields is that they are not "anchored", so
there is no easy way to distinguish

      application/pdf

from

      pdf
      application/x-pdf

I guess in a perfect world this would also be explained in
notmuch-search-terms(7), but that's pretty much orthogonal to this
series.

d

Previous message (by thread): Re: [PATCH v4 0/5] Index the content-type of MIME parts

Thread:

Todd—[PATCH] Index Content-Type of attachments with a contenttype prefix [inbox, notmuch::obsolete, notmuch::patch, unread]
- David Bremner—Re: [PATCH] Index Content-Type of attachments with a contenttype prefix [inbox, unread]
  - Todd—Re: [PATCH] Index Content-Type of attachments with a contenttype prefix [inbox, unread]
- Jani Nikula—Re: [PATCH] Index Content-Type of attachments with a contenttype prefix [inbox, unread]
  - Todd—Re: [PATCH] Index Content-Type of attachments with a contenttype prefix [inbox, unread]
  - Todd—[PATCH v2 0/5] Index the content-type of MIME parts [inbox, unread]
    - Todd—[PATCH v2 1/5] Add a failing unit test for indexed mime types [inbox, notmuch::obsolete, notmuch::patch, unread]
      - Jani Nikula—Re: [PATCH v2 1/5] Add a failing unit test for indexed mime types [inbox, unread]
        Jani Nikula—Re: [PATCH v2 1/5] Add a failing unit test for indexed mime types [inbox, unread]
    - Todd—[PATCH v2 2/5] Add the NOTMUCH_FEATURE_INDEXED_MIMETYPES database feature [inbox, notmuch::obsolete, notmuch::patch, unread]
      - Jani Nikula—Re: [PATCH v2 2/5] Add the NOTMUCH_FEATURE_INDEXED_MIMETYPES database feature [inbox, unread]
        Austin Clements—Re: [PATCH v2 2/5] Add the NOTMUCH_FEATURE_INDEXED_MIMETYPES database feature [inbox, unread]
    - Todd—[PATCH v2 3/5] Add indexing for the mimetype term [inbox, notmuch::obsolete, notmuch::patch, unread]
      - Jani Nikula—Re: [PATCH v2 3/5] Add indexing for the mimetype term [inbox, unread]
    - Todd—[PATCH v2 4/5] Update completions for Emacs and bash [inbox, notmuch::obsolete, notmuch::patch, unread]
      - Jani Nikula—Re: [PATCH v2 4/5] Update completions for Emacs and bash [inbox, unread]
    - Todd—[PATCH v2 5/5] Update documentation [inbox, notmuch::obsolete, notmuch::patch, unread]
- Todd—[PATCH v3 1/5] Add failing unit tests for indexed mime types [inbox, notmuch::moreinfo, notmuch::patch, unread]
  - Todd—[PATCH v3 2/5] Add the NOTMUCH_FEATURE_INDEXED_MIMETYPES database feature [inbox, notmuch::obsolete, notmuch::patch, unread]
  - Todd—[PATCH v3 3/5] Add indexing for the mimetype term [inbox, notmuch::moreinfo, notmuch::patch, unread]
    - David Bremner—Re: [PATCH v3 3/5] Add indexing for the mimetype term [inbox, unread]
      - Todd—Re: [PATCH v3 3/5] Add indexing for the mimetype term [inbox, signed, unread]
  - Todd—[PATCH v3 4/5] Update completions for Emacs and bash [inbox, notmuch::obsolete, notmuch::patch, unread]
  - Todd—[PATCH v3 5/5] Update documentation [inbox, notmuch::moreinfo, notmuch::patch, unread]
    - David Bremner—Re: [PATCH v3 5/5] Update documentation [inbox, unread]
  - David Bremner—Re: [PATCH v3 1/5] Add failing unit tests for indexed mime types [inbox, unread]
- Todd—[PATCH v4 0/5] Index the content-type of MIME parts [inbox, unread]
  - Todd—[PATCH v4 1/5] test: Add failing unit tests for indexed mime types [inbox, notmuch::patch, notmuch::pushed, unread]
  - Todd—[PATCH v4 2/5] Add the NOTMUCH_FEATURE_INDEXED_MIMETYPES database feature [inbox, notmuch::patch, notmuch::pushed, unread]
  - Todd—[PATCH v4 3/5] Add indexing for the mimetype term [inbox, notmuch::patch, notmuch::pushed, unread]
  - Todd—[PATCH v4 4/5] Update completions for Emacs and bash [inbox, notmuch::patch, notmuch::pushed, unread]
  - Todd—[PATCH v4 5/5] Update documentation [inbox, notmuch::patch, notmuch::pushed, unread]
    - Jani Nikula—Re: [PATCH v4 5/5] Update documentation [inbox, unread]
  - David Bremner—Re: [PATCH v4 0/5] Index the content-type of MIME parts [inbox, unread]