Re: regex [X-Z] with non-ascii char returns different results from (X|Y|Z)

Subject: Re: regex [X-Z] with non-ascii char returns different results from (X|Y|Z)

Date: Thu, 22 Aug 2019 22:53:34 +0300

To:

Cc: Notmuch

From: Tomi Ollila


On Thu, Aug 22 2019, David Bremner wrote:

> "yury.t" <tptlab@tuta.io> writes:
>
>> Thank you for your reply.
>> I confirmed that the issue is reproduced in C program. https://pastebin.com/5NaCM45G <https://pastebin.com/5NaCM45G>
>>
>> Sorry for bothering you...
>
> I'm not sure, but it might be a glibc bug. Since we are already using
> glib, maybe we should use
>
>       https://developer.gnome.org/glib/stable/glib-Perl-compatible-regular-expressions.html
>
> I don't know if it also has this problem with [] and non-ascii
> characters.

Since pcre2 supports \K that gives positive vibe about the above

( 'Resetting the match start' 
  in http://www.pcre.org/current/doc/html/pcre2pattern.html )

Tomi 

>
> d
_______________________________________________
notmuch mailing list
notmuch@notmuchmail.org
https://notmuchmail.org/mailman/listinfo/notmuch

Thread: