From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS22989 209.51.188.0/24 X-Spam-Status: No, score=-4.0 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RCVD_IN_DNSWL_MED,RCVD_IN_MSPIKE_H2,SPF_HELO_PASS, SPF_PASS shortcircuit=no autolearn=ham autolearn_force=no version=3.4.6 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id F33AF1F47C for ; Thu, 5 Jan 2023 23:11:00 +0000 (UTC) Authentication-Results: dcvr.yhbt.net; dkim=pass (2048-bit key; unprotected) header.d=aarsen.me header.i=@aarsen.me header.a=rsa-sha256 header.s=MBO0001 header.b=eE6tdsk7; dkim-atps=neutral Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1pDZNn-0002Kw-7F; Thu, 05 Jan 2023 18:10:47 -0500 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1pDZNd-0002Il-Ks; Thu, 05 Jan 2023 18:10:38 -0500 Received: from mout-p-202.mailbox.org ([80.241.56.172]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_CHACHA20_POLY1305:256) (Exim 4.90_1) (envelope-from ) id 1pDZNb-00059B-8c; Thu, 05 Jan 2023 18:10:36 -0500 Received: from smtp202.mailbox.org (smtp202.mailbox.org [10.196.197.202]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-384) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by mout-p-202.mailbox.org (Postfix) with ESMTPS id 4Np2J356nqz9sTZ; Fri, 6 Jan 2023 00:10:27 +0100 (CET) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=aarsen.me; s=MBO0001; t=1672960227; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=OtTr5Z3zolDME+AYe5PreTVtxpkNRrioBO1f/5iG6SU=; b=eE6tdsk7Hhi516LY6za+ECc0IZn+03ik8CHvd4xs9sT9Qdef9rV4n3hJiQwTWwhsNrnstp g8x7noZhFp2NOlQrbfzEMW8/UHhBmEBfdik0hY+dLsxAqtk8y2jPrd3NtSPQ+RITZ7XoCC w9cHUmncZ+7uhfipEU1S5+OyWMYIFvK14LdjBbjo2KE2dT9grEDCfg1jATy7csF1Tdr3Q9 yNWjcJY/XqTVqeMTjJgepHE47ma+2l5I8B53k+SipWAFwTHEcavcLrW1akKdpw+jPlOUc9 nYM9ENdy1qg9ZP9dEWJatXCnm60SIE8yWOkS6jnLztswKEjFSfqLtvVTAoryPA== References: <20221230000119.hyui6umnspuyzqum@bubbles> <202212300913.2BU9DV6V030160@freefriends.org> <6DADB0FC-87EE-4028-91DF-C93A968A8982@gentoo.org> <202301011906.301J6ROQ018104@freefriends.org> <1cdb3d41-d675-cca1-7498-c3840f2ee8ad@cs.ucla.edu> From: Arsen =?utf-8?Q?Arsenovi=C4=87?= To: Paul Eggert Cc: arnold@skeeve.com, sam@gentoo.org, concord@gentoo.org, bug-gawk@gnu.org, bug-gnulib@gnu.org Subject: Re: Clang-built Gawk 5.2.1 regex oddity Date: Fri, 06 Jan 2023 00:06:56 +0100 In-reply-to: <1cdb3d41-d675-cca1-7498-c3840f2ee8ad@cs.ucla.edu> Message-ID: <87bkncy3vl.fsf@aarsen.me> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha512; protocol="application/pgp-signature" Received-SPF: pass client-ip=80.241.56.172; envelope-from=arsen@aarsen.me; helo=mout-p-202.mailbox.org X-Spam_score_int: -27 X-Spam_score: -2.8 X-Spam_bar: -- X-Spam_report: (-2.8 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_LOW=-0.7, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: bug-gnulib@gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gnulib discussion list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: bug-gnulib-bounces+normalperson=yhbt.net@gnu.org Sender: bug-gnulib-bounces+normalperson=yhbt.net@gnu.org --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Hi, Paul Eggert writes: > This is a serious bug in Clang: it generates incorrect machine code. > > The code that Clang generates for the following (gawk/support/dfa.c lines > 1141-1143): > > ((dfa->syntax.dfaopts & DFA_CONFUSING_BRACKETS_ERROR > ? dfaerror : dfawarn) > (_("character class syntax is [[:space:]], not [:space:]"))); > > is immediately followed by the code generated for the following > (gawk/support/dfa.c line 1015): > > dfaerror (_("invalid character class")); > > and this is incorrect because the two source code regions are not connect= ed > with each other. This is now fixed in Clang: https://reviews.llvm.org/rGcf8fd210a35c8e93136cb8edc5c6a2e818dc1b1d Happy hacking! =2D-=20 Arsen Arsenovi=C4=87 --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iIYEARYKAC4WIQT+4rPRE/wAoxYtYGFSwpQwHqLEkwUCY7dY3hAcYXJzZW5AYWFy c2VuLm1lAAoJEFLClDAeosSTWagBAL8faCSvB4oYL4OhswId44UWzOUNDaSkkEwU a6t7ym82AP4o57J1qTjbq7hRLHxAEVGsZcw6Tq6RMaud4QRKTKyXBg== =iuue -----END PGP SIGNATURE----- --=-=-=--