From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: X-Spam-Status: No, score=-4.0 required=3.0 tests=ALL_TRUSTED,BAYES_00 shortcircuit=no autolearn=ham autolearn_force=no version=3.4.0 Received: from localhost (dcvr.yhbt.net [127.0.0.1]) by dcvr.yhbt.net (Postfix) with ESMTP id BE7011F744 for ; Sat, 25 Jun 2016 22:34:30 +0000 (UTC) From: Eric Wong To: meta@public-inbox.org Subject: [PATCH] TODO: add a few Xapian-related items Date: Sat, 25 Jun 2016 22:34:30 +0000 Message-Id: <20160625223430.15321-1-e@80x24.org> List-Id: "git cat-file --batch" seems expensive for big repos and loading 70K+ tree objects in git isn't all that fast. Ideas are cheap, time, code, and testing are not :P --- TODO | 8 +++++++- 1 file changed, 7 insertions(+), 1 deletion(-) diff --git a/TODO b/TODO index 0d6f1a0..f29f2f0 100644 --- a/TODO +++ b/TODO @@ -18,7 +18,7 @@ TODO items for public-inbox * Optional reply-to-list support for mirroring lists that want it :< Reply-to-list encourages the existing list as a single-point-of-failure, - but having an extra mirror using public-inbox.org is nice regardless. + but having an extra mirror using public-inbox code is nice regardless. * Configurable linkification for per-inbox shorthands: "$gmane/123456" could be configured to expand to the @@ -44,6 +44,12 @@ TODO items for public-inbox * handle Xapian date range queries: http://mid.gmane.org/20151005222157.GE5880@survex.com +* Consider storing git blob ID in Xapian doc data to avoid ref + and tree lookups based on Message-Id. + +* Allow in-place Xapian updates without clobbering the whole + index (versioning each doc data entry?) for big archives + * use REQUEST_URI properly for CGI / mod_perl2 compatibility with Message-IDs which include '%' (done?) -- EW