user/dev discussion of public-inbox itself
 help / color / mirror / code / Atom feed
Search results ordered by [date|relevance]  view[summary|nested|Atom feed]
thread overview below | download mbox.gz: |
* [PATCH 08/10] search: avoid mindlessly calling body_set
  2016-09-09  0:01  5% [PATCH 0/10] search: more mairix prefix compatibility Eric Wong
@ 2016-09-09  0:01  7% ` Eric Wong
  0 siblings, 0 replies; 2+ results
From: Eric Wong @ 2016-09-09  0:01 UTC (permalink / raw)
  To: meta

It's not worth entering a complex codepath in Email::MIME to
save some (probably immeasurable amount of) memory, here.  We've
already stopped doing this in our WWW code a while back, too.
If we really cared enough about it, we'd prioritize work on a
streaming replacement for Email::MIME.
---
 lib/PublicInbox/SearchIdx.pm | 1 -
 1 file changed, 1 deletion(-)

diff --git a/lib/PublicInbox/SearchIdx.pm b/lib/PublicInbox/SearchIdx.pm
index 86be9ed..0e2d225 100644
--- a/lib/PublicInbox/SearchIdx.pm
+++ b/lib/PublicInbox/SearchIdx.pm
@@ -188,7 +188,6 @@ sub add_message {
 
 			my (@orig, @quot);
 			my $body = $part->body;
-			$part->body_set('');
 			my @lines = split(/\n/, $body);
 			while (defined(my $l = shift @lines)) {
 				if ($l =~ /^>/) {
-- 
EW


^ permalink raw reply related	[relevance 7%]

* [PATCH 0/10] search: more mairix prefix compatibility
@ 2016-09-09  0:01  5% Eric Wong
  2016-09-09  0:01  7% ` [PATCH 08/10] search: avoid mindlessly calling body_set Eric Wong
  0 siblings, 1 reply; 2+ results
From: Eric Wong @ 2016-09-09  0:01 UTC (permalink / raw)
  To: meta

This brings us closer to the behavior of mairix(1) for search
by supporting n:, t:, c:, f:, tc:, tcf:, n:, b:, and bs:
prefixes as documented in the mairix(1) manpage.

We also introduce the use of q: and nq: prefixes for quoted and
non-quoted text, respectively.

There is a schema version change in [PATCH 7/10] to maintain
compatibility with Debian 7.x wheezy installs.  The in-place
reindexing would've been expensive anyways, so perhaps the
schema bump is a good idea, anyways, as creating a fresh index
should be faster than --reindex.

Eric Wong (10):
      search: allow searching user fields (To/Cc/From)
      search: drop longer subject: prefix for search
      search: more granular message body searching
      search: fix space regressions from recent changes
      search: match quote detection behavior of view
      search: increase term positions for each quoted hunk
      search: fix compatibility with Debian wheezy
      search: avoid mindlessly calling body_set
      search: match the behavior of WWW for indexing text
      search: index attachment filenames

 lib/PublicInbox/Search.pm    |  32 +++++++++---
 lib/PublicInbox/SearchIdx.pm | 104 ++++++++++++++++++++++++-------------
 t/search.t                   | 120 ++++++++++++++++++++++++++++++++++++++++---
 3 files changed, 206 insertions(+), 50 deletions(-)


^ permalink raw reply	[relevance 5%]

Results 1-2 of 2 | reverse | options above
-- pct% links below jump to the message on this page, permalinks otherwise --
2016-09-09  0:01  5% [PATCH 0/10] search: more mairix prefix compatibility Eric Wong
2016-09-09  0:01  7% ` [PATCH 08/10] search: avoid mindlessly calling body_set Eric Wong

Code repositories for project(s) associated with this public inbox

	https://80x24.org/public-inbox.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).