diff options
author | Eric Wong <e@yhbt.net> | 2020-03-21 02:03:53 +0000 |
---|---|---|
committer | Eric Wong <e@yhbt.net> | 2020-03-25 01:48:35 +0000 |
commit | 7909c5f7439777e3b3643f14224b27f0a8f8fb07 (patch) | |
tree | 12e9187b2069d63582c3c72546f7f71756d2f79a /lib/PublicInbox/Search.pm | |
parent | 089380f8ebecbe868c5599773b3c23d40b3db129 (diff) | |
download | public-inbox-7909c5f7439777e3b3643f14224b27f0a8f8fb07.tar.gz |
We only support searching on prefixes matching /\A\w+\z/ because Xapian requires ':' to delimit the prefix and splits on spaces without quotes. I've also verified Xapian supports multibyte UTF-8 characters, underscores, and bare numbers as search prefixes, so there's no need to restrict it beyond what Perl's UTF-8 aware \w character class offers.
Diffstat (limited to 'lib/PublicInbox/Search.pm')
-rw-r--r-- | lib/PublicInbox/Search.pm | 2 |
1 files changed, 2 insertions, 0 deletions
diff --git a/lib/PublicInbox/Search.pm b/lib/PublicInbox/Search.pm index fabf5301..86a6ad67 100644 --- a/lib/PublicInbox/Search.pm +++ b/lib/PublicInbox/Search.pm @@ -316,6 +316,8 @@ sub qp { my $user_pfx = $self->{-user_pfx} = []; for (@$altid) { # $_ = 'serial:gmane:/path/to/gmane.msgmap.sqlite3' + # note: Xapian supports multibyte UTF-8, /^[0-9]+$/, + # and '_' with prefixes matching \w+ /\Aserial:(\w+):/ or next; my $pfx = $1; push @$user_pfx, "$pfx:", <<EOF; |