From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: X-Spam-Status: No, score=-4.0 required=3.0 tests=ALL_TRUSTED,BAYES_00 shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from localhost (dcvr.yhbt.net [127.0.0.1]) by dcvr.yhbt.net (Postfix) with ESMTP id 1DE5C20248; Mon, 18 Mar 2019 23:04:28 +0000 (UTC) Date: Mon, 18 Mar 2019 23:04:27 +0000 From: Eric Wong To: Bjorn Helgaas Cc: meta@public-inbox.org Subject: Re: Threading in git repo? Message-ID: <20190318230427.os5zvvwkybamd3ne@dcvr> References: <20190313230707.GB210027@google.com> <20190314074447.GA8156@dcvr> <20190318213817.GA88541@google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20190318213817.GA88541@google.com> List-Id: Bjorn Helgaas wrote: > I'm still hoping to get to a solution using a local public-inbox > archive, without requiring a network connection or even additional > local servers. Would a Perl script which works like mairix(*) be acceptable to you? I have something planned along those lines... scripts/dupe-finder could be used as a starting point. Maybe a FUSE filesystem to export an mbox as a Maildir or mbox would work, too. But it's probably too expensive and slow, especially for Maildirs. * mairix indexes Maildirs/mboxes, and dumps search results to a new Maildir/mbox which mutt can understand: git clone https://github.com/vandry/mairix.git git clone https://github.com/rc0/mairix.git > On Thu, Mar 14, 2019 at 07:44:47AM +0000, Eric Wong wrote: > > If the git commit messages all had key headers > > (Message-ID/From/To/Cc/References/In-Reply-To/Subject), then > > yes; then a SQLite/Xapian-agnostic client could be taught to > > read and do threading based on that; with fewer git ODB > > accesses. I don't think it's worth introducing at this > > time, though. > > If I understand correctly Yup :)