From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: X-Spam-Status: No, score=-4.1 required=3.0 tests=ALL_TRUSTED,AWL,BAYES_00, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, T_SCC_BODY_TEXT_LINE shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from localhost (dcvr.yhbt.net [127.0.0.1]) by dcvr.yhbt.net (Postfix) with ESMTP id 295AB1F4D7; Mon, 20 Jun 2022 19:27:30 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=80x24.org; s=selector1; t=1655753250; bh=e8dxyzep02ty6aJ2DK9UCxSq47Z6Ztp0g6bjZDX5hY0=; h=From:To:Subject:Date:From; b=fbh6qvlZNR6wWQbxRyD98/uzj7FcnR2JZHIvEGtn4o3lrbMTX5+DAqQ8hpsUU5QSf CgtLSUeorvM+yVU8g0ClE4xdQ5xmhxLa/+R1T5i23yCAKvC46UdHuTHqrO4uoTLd3+ mFPaBJPc4HnpHQ15/E5vJtdumLlD8PZikXIljWbU= From: Eric Wong To: meta@public-inbox.org Subject: [PATCH 0/3] search indexing improvements Date: Mon, 20 Jun 2022 19:27:27 +0000 Message-Id: <20220620192730.550803-1-e@80x24.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit List-Id: Still stuck on POP3 account manglement, but here's some easy-ish indexing changes for public-inbox-* tools. These require a full reindex with either public-inbox-index or public-inbox-extindex, but old/new indexes should be fully compatible and should be doable hot: public-inbox-index --no-fsync --reindex /path/to/v1-or-v2 public-inbox-extindex --no-fsync --reindex --all /path/to/eidx Will probably take 2 days or so on my own machine. Note: lei doesn't support reindexing, yet, but will, soon... Eric Wong (3): searchidx: use regexp as first arg for `split' op search: support "patchid:" prefix (git patch-id --stable) search: do not index base-85 binary patches MANIFEST | 1 + TODO | 5 --- lib/PublicInbox/Search.pm | 5 ++- lib/PublicInbox/SearchIdx.pm | 75 ++++++++++++++++++++++++++---------- t/data/binary.patch | 20 ++++++++++ t/extsearch.t | 7 +++- t/search.t | 15 ++++++++ t/v2mda.t | 10 ++++- 8 files changed, 108 insertions(+), 30 deletions(-) create mode 100644 t/data/binary.patch