From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: X-Spam-Status: No, score=-4.2 required=3.0 tests=ALL_TRUSTED,AWL,BAYES_00, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, T_SCC_BODY_TEXT_LINE shortcircuit=no autolearn=ham autolearn_force=no version=3.4.6 Received: from localhost (dcvr.yhbt.net [127.0.0.1]) by dcvr.yhbt.net (Postfix) with ESMTP id 861901F406 for ; Fri, 24 Nov 2023 04:18:20 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=80x24.org; s=selector1; t=1700799500; bh=LmWO7aw5KssW4I74mgSgi9IgyvJppYms9nPjgavR9ao=; h=From:To:Subject:Date:From; b=Lqzj85IYObbmUJf0zjmBW6X7a5PoIflOcGGNQgBdfzhVqFaMfKW7STEKd+Jw4f9B0 oaL/hRuBTLSKxOuL1d03nm7MoVNOlspHTf62y/IiZio9L/UIv/jWb0PSzsO4H3dZXA krBoVl/Pd9qdtBYdztkGhtypN8DEpE4gkJzVrhKA= From: Eric Wong To: meta@public-inbox.org Subject: [PATCH] doc/extindex: document --dedupe switch Date: Fri, 24 Nov 2023 04:18:19 +0000 Message-Id: <20231124041819.1979651-1-e@80x24.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit List-Id: We've had it since v1.7.0 when -extindex was introduced, but it was never documented outside of commit messages. --- Documentation/public-inbox-extindex.pod | 26 +++++++++++++++++++++---- 1 file changed, 22 insertions(+), 4 deletions(-) diff --git a/Documentation/public-inbox-extindex.pod b/Documentation/public-inbox-extindex.pod index be4ea4de..361eb43f 100644 --- a/Documentation/public-inbox-extindex.pod +++ b/Documentation/public-inbox-extindex.pod @@ -47,6 +47,20 @@ C set to C and their respective Xapian public-inboxes where cross-posting is common, this allows significant space savings on Xapian indices. +=item --dedupe=MSGID + +=item --dedupe + +Rerun deduplication on messages of a Message-IDs or all messages +if no Message-ID is specified. Deduplication rules may change +and evolve over time, especially if filters are involved. + +C<--dedupe=MSGID> may be specified multiple times to deduplicate +multiple Message-IDs. + +Use this if you see C +warnings from WWW logs. + =item --gc Perform garbage collection instead of indexing. Use this if @@ -61,10 +75,6 @@ used for in-place upgrades and bugfixes while read-only server processes are utilizing the index. Keep in mind this roughly doubles the size of the already-large Xapian database. -The extindex locks will be released roughly every 10s to -allow L and L -processes to write to the extindex. - =item --fast Used with C<--reindex>, it will only look for new and stale @@ -131,6 +141,14 @@ Default: none, uses C Occasionally, public-inbox will update its schema version and require a full index by running this command. +=head1 LOCKING + +It is safe to use C<--dedupe>, C<--gc> and C<--reindex> while +other processes are writing to covered inboxes or extindex. +The extindex locks will be released roughly every 10s to +allow L and L +processes to write to the extindex. + =head1 CONTACT Feedback welcome via plain-text mail to L