From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS22989 209.51.188.0/24 X-Spam-Status: No, score=-3.7 required=3.0 tests=AWL,BAYES_00,DKIM_INVALID, DKIM_SIGNED,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, NICE_REPLY_A,RCVD_IN_DNSWL_MED,RCVD_IN_MSPIKE_H4,RCVD_IN_MSPIKE_WL, SPF_HELO_PASS,SPF_PASS shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id 413181F5AE for ; Sun, 18 Jul 2021 21:45:39 +0000 (UTC) Received: from localhost ([::1]:40690 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1m5EbS-0007uA-18 for normalperson@yhbt.net; Sun, 18 Jul 2021 17:45:38 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:40824) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1m5EbO-0007tu-TJ for bug-gnulib@gnu.org; Sun, 18 Jul 2021 17:45:34 -0400 Received: from mo4-p00-ob.smtp.rzone.de ([81.169.146.161]:36237) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1m5EbL-0005vY-4K for bug-gnulib@gnu.org; Sun, 18 Jul 2021 17:45:34 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; t=1626644719; s=strato-dkim-0002; d=clisp.org; h=References:In-Reply-To:Message-ID:Date:Subject:Cc:To:From:Cc:Date: From:Subject:Sender; bh=S966ATGU/+8sQOtCLCCGUgi+xY13i6a1nXG4wibrm44=; b=quI5quAH9dMXQwfKjkDo9P87A3l9qzmW9o073YwumEwtyqGSSTb6A3VF3yV170YzlP y2D688Wrx1KHFFvEIwz7rYCrF0VgiJWQFvUVJLb+Bz2Uq4iHgRsz+/vwZ9B7KN0Uwk47 1ADLhPJgMbsZpDm9x5urAVzuF7tlos8cFUJAnfF7fUMsTQQFOqWFyTGLJ0KpVa//w8pP uSrG5HujyVUgsbKor5Q/onL66UcuZp/XBL+QAeIESAkC4hsfnFcDAaXCvI0ysG8XXmtW 2FHAQS2AgYMqtIgpQTR1CjAfWWXkdgPRzVndfbIvj+yiGxpjchYnB/8zM1hIzzATZ6/D bAkQ== Authentication-Results: strato.com; dkim=none X-RZG-AUTH: ":Ln4Re0+Ic/6oZXR1YgKryK8brlshOcZlIWs+iCP5vnk6shH/DXj0JGsbh0vbrMZq" X-RZG-CLASS-ID: mo00 Received: from bruno.haible.de by smtp.strato.de (RZmta 47.28.1 DYNA|AUTH) with ESMTPSA id u08ae3x6ILjIHaA (using TLSv1 with cipher ECDHE-RSA-AES256-SHA (curve X9_62_prime256v1 with 256 ECDH bits, eq. 3072 bits RSA)) (Client did not present a certificate); Sun, 18 Jul 2021 23:45:18 +0200 (CEST) From: Bruno Haible To: arnold@skeeve.com Subject: Re: regex unit tests Date: Sun, 18 Jul 2021 23:45:18 +0200 Message-ID: <7719867.Yo8exX7jZS@omega> User-Agent: KMail/5.1.3 (Linux/4.4.0-210-generic; KDE/5.18.0; x86_64; ; ) In-Reply-To: <202107181859.16IIxOCA007113@freefriends.org> References: <3323531.JAME3IizvO@omega> <202107181859.16IIxOCA007113@freefriends.org> MIME-Version: 1.0 Content-Transfer-Encoding: 7Bit Content-Type: text/plain; charset="us-ascii" Received-SPF: none client-ip=81.169.146.161; envelope-from=bruno@clisp.org; helo=mo4-p00-ob.smtp.rzone.de X-Spam_score_int: -28 X-Spam_score: -2.9 X-Spam_bar: -- X-Spam_report: (-2.9 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, NICE_REPLY_A=-0.07, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_HELO_PASS=-0.001, SPF_NONE=0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: bug-gnulib@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Gnulib discussion list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: eggert@cs.ucla.edu, bug-gnulib@gnu.org Errors-To: bug-gnulib-bounces+normalperson=yhbt.net@gnu.org Sender: "bug-gnulib" Hi Arnold, > > > (And how I've documented things in the manual, also since forever.) > > > > If you want the behaviour of the GNU regex to be stable over time, you > > should contribute unit tests to tests/test-regex.c. > > This is a separate issue. It almost sounds like you're saying "it's your > fault there's a bug here, you didn't contribute unit tests". I'm not talking about past incidents and "fault", because that is generally useless. I'm talking about the future and what we can do to avoid that packages that depend on the 'regex' module see regressions. If a Gnulib module does not have a decent test coverage in Gnulib, then its bugs and regressions become apparent only after a while and only through these other packages. A good example of this sequence of events was , but I'm sure you can find many others of the same kind in the mailing list archive. If, on the other hand, there is a unit test and it runs on glibc platforms, a regression is likely to be visible in the weekly continuous integration build . For the regex module, with 20 KB of tests for 300 KB of code full of complex algorithms, the test coverage is very thin, and it is *to be expected* that regressions are only visible once the code is integrated into gawk, grep, sed, etc. Similarly for the 'dfa' module with 5 KB of tests for 140 KB of code. The regex and dfa modules are being maintained here (by Paul, with contributions from various people), and we have seen that it is not obvious whether a patch is good or not: sometimes Paul has rejected patches, sometimes he had to revert patches. I think it would be good if these two modules had a larger test coverage, and I'm inviting everyone who can to contribute to these unit tests. > I hope that's not your intent; if it is then sorry, I don't buy it. The module doesn't have tests for the RE_SYNTAX_AWK RE_SYNTAX_GNU_AWK RE_SYNTAX_POSIX_AWK syntaxes. It's gawk which depends on the correct functioning of these syntaxes, not glibc, not grep, not sed, not emacs. Therefore IMO if the gawk developers don't contribute some test cases for these syntaxes, no one will. (I certainly won't, because I find writing tests a bit boring, and I don't see why I should have the "boring" part whereas others have the "fun" part :-) ) Bruno