From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.1 (2015-04-28) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: X-Spam-Status: No, score=-4.0 required=3.0 tests=ALL_TRUSTED,BAYES_00 shortcircuit=no autolearn=ham autolearn_force=no version=3.4.1 Received: from localhost (dcvr.yhbt.net [127.0.0.1]) by dcvr.yhbt.net (Postfix) with ESMTP id 7347B1F597 for ; Mon, 30 Jul 2018 08:23:51 +0000 (UTC) From: Eric Wong To: meta@public-inbox.org Subject: [PATCH] search: (really) match the behavior of WWW for indexing text Date: Mon, 30 Jul 2018 08:23:51 +0000 Message-Id: <20180730082351.13305-1-e@80x24.org> List-Id: Not sure what was going through my mind when I made my first attempt at this, but we really want to make sure we index all the text we display in the web view (and presumably anything a reasonable mail client can display). Followup-to: 0cf6196025d4e4880cd1ed859257ce21dd3cdcf6 ("search: match the behavior of WWW for indexing text") --- lib/PublicInbox/SearchIdx.pm | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git a/lib/PublicInbox/SearchIdx.pm b/lib/PublicInbox/SearchIdx.pm index 1d259a8..29868d9 100644 --- a/lib/PublicInbox/SearchIdx.pm +++ b/lib/PublicInbox/SearchIdx.pm @@ -321,8 +321,7 @@ sub add_xapian ($$$$$) { defined $s or return; my (@orig, @quot); - my $body = $part->body; - my @lines = split(/\n/, $body); + my @lines = split(/\n/, $s); while (defined(my $l = shift @lines)) { if ($l =~ /^>/) { $self->index_body(\@orig, $doc) if @orig; -- EW