From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on starla X-Spam-Level: X-Spam-Status: No, score=-1.1 required=3.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,MAILING_LIST_MULTI,SPF_HELO_PASS,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 Received: from nue.mailmanlists.eu (nue.mailmanlists.eu [94.130.110.93]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits)) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id A0BA31F406 for ; Thu, 31 Aug 2023 03:33:19 +0000 (UTC) Authentication-Results: dcvr.yhbt.net; dkim=pass (1024-bit key; secure) header.d=ml.ruby-lang.org header.i=@ml.ruby-lang.org header.a=rsa-sha256 header.s=mail header.b=sypuX1Dj; dkim=fail reason="signature verification failed" (2048-bit key; unprotected) header.d=ruby-lang.org header.i=@ruby-lang.org header.a=rsa-sha256 header.s=s1 header.b=Z6F+Au7n; dkim-atps=neutral Received: from nue.mailmanlists.eu (localhost [127.0.0.1]) by nue.mailmanlists.eu (Postfix) with ESMTP id 3B0367FCB2; Thu, 31 Aug 2023 03:33:11 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ml.ruby-lang.org; s=mail; t=1693452791; bh=Fn0z0O0y4G52WzpnLqIG3iykC7OAGsDx41QDzETUBso=; h=Date:References:To:Reply-To:Subject:List-Id:List-Archive: List-Help:List-Owner:List-Post:List-Subscribe:List-Unsubscribe: From:Cc:From; b=sypuX1DjgjvbV6IMQsV+YC/9+MOzAm2iwcuZn8braLyEQb3ATI2OPHfCTrI4rZ/42 7w2E6mcQYbdgU652bTcuI2HhnQCvts2bGCfsSPbPqGWpqCzWO5Qnk+FXrvxYWuQqsa MOcWSh33DiDGQrFSUumUJ5okTsAc3y0Ylsxet0GQ= Received: from xtrwkhkc.outbound-mail.sendgrid.net (xtrwkhkc.outbound-mail.sendgrid.net [167.89.16.28]) by nue.mailmanlists.eu (Postfix) with ESMTPS id 93D267FCA7 for ; Thu, 31 Aug 2023 03:33:08 +0000 (UTC) Authentication-Results: nue.mailmanlists.eu; dkim=pass (2048-bit key; unprotected) header.d=ruby-lang.org header.i=@ruby-lang.org header.a=rsa-sha256 header.s=s1 header.b=Z6F+Au7n; dkim-atps=neutral DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ruby-lang.org; h=from:references:subject:mime-version:content-type: content-transfer-encoding:list-id:to:cc:content-type:from:subject:to; s=s1; bh=3PcQqwOhrGGVHwszshP2gZjPerf1p+SAU2PiLbiYcBk=; b=Z6F+Au7nVR/A2xSnd2NKdD0YcQhGFJh2NZeQ4hZi0/WHWOxjJIDTq6qgNZFLXa8j/1cI 4duHBFFFsYlhpkgJSYK8MkBjuvI5RHDbObU9ruoE+dx8VQN6F6Apu3kcGexHvQ26BO/h/F 0jBjqV57uYZrR0oG6tBBSS3/ZSmDVAxWlkIdj2BjcEWfzb8tL+QPnYPArgcaKuWKQdzDkv vqS9eMivSe5vzdEPiEA/Ei59KDpmVQI7Uch41uUHhnUrSr8HX7iUxwglwdczFJMfHsLOSO uHt5wmSYRzaNy7uo6JzcKEx/pJq/rJswbqzCZjWUunsNQPnrf9MT29BN+XrWl7vw== Received: by filterdrecv-77869f68cc-kmpqh with SMTP id filterdrecv-77869f68cc-kmpqh-1-64F009F3-1C 2023-08-31 03:33:07.395717196 +0000 UTC m=+9691027.691028854 Received: from herokuapp.com (unknown) by geopod-ismtpd-15 (SG) with ESMTP id fMg23uFNTKGJ13u4CTnFgw for ; Thu, 31 Aug 2023 03:33:07.352 +0000 (UTC) Date: Thu, 31 Aug 2023 03:33:07 +0000 (UTC) Message-ID: References: Mime-Version: 1.0 X-Redmine-Project: ruby-master X-Redmine-Issue-Tracker: Bug X-Redmine-Issue-Id: 19007 X-Redmine-Issue-Author: nobu X-Redmine-Issue-Assignee: duerst X-Redmine-Sender: jeremyevans0 X-Mailer: Redmine X-Redmine-Host: bugs.ruby-lang.org X-Redmine-Site: Ruby Issue Tracking System X-Auto-Response-Suppress: All Auto-Submitted: auto-generated X-Redmine-MailingListIntegration-Message-Ids: 91104 X-SG-EID: =?us-ascii?Q?zy3UxWTRryXcrjCh7SV39ZkShJ1PHkEOmoUEekBgd8G8RAJk=2FsUOlbLwuWEoNU?= =?us-ascii?Q?BxG3BBFmcsr21OPpQwa7zNRSwswJm1szgMd841w?= =?us-ascii?Q?HPqc4P=2FiDD+YwsHKqDtcFbN5OC2ezn4kkknOuSL?= =?us-ascii?Q?C0beB58YFhpa0a5fUtIdBIW=2FoGrbVA3BWsHfiIj?= =?us-ascii?Q?3pSR51XhjvR=2FkJ=2FiEPjkRJWAvPBO7XZnOzlx9TI?= =?us-ascii?Q?TzKflADcDEWUJD9dl71IpYTo4BJsxOKkssVGJ60?= =?us-ascii?Q?1lSoOE9lwH+IOs3E7XGug=3D=3D?= To: ruby-core@ml.ruby-lang.org X-Entity-ID: b/2+PoftWZ6GuOu3b0IycA== Message-ID-Hash: SZX2OI5NFV4LAF6G277Z5WQVRZV3DQUN X-Message-ID-Hash: SZX2OI5NFV4LAF6G277Z5WQVRZV3DQUN X-MailFrom: bounces+313651-b711-ruby-core=ml.ruby-lang.org@em5188.ruby-lang.org X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header X-Mailman-Version: 3.3.3 Precedence: list Reply-To: Ruby developers Subject: [ruby-core:114602] [Ruby master Bug#19007] Unicode tables differences from Unicode.org 14.0 data List-Id: Ruby developers Archived-At: List-Archive: List-Help: List-Owner: List-Post: List-Subscribe: List-Unsubscribe: From: "jeremyevans0 (Jeremy Evans) via ruby-core" Cc: "jeremyevans0 (Jeremy Evans)" Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Issue #19007 has been updated by jeremyevans0 (Jeremy Evans). @duerst Did this issue reoccur in the update to Unicode 15? If not, do you= think this can be closed? ---------------------------------------- Bug #19007: Unicode tables differences from Unicode.org 14.0 data https://bugs.ruby-lang.org/issues/19007#change-104423 * Author: nobu (Nobuyoshi Nakada) * Status: Open * Priority: Normal * Assignee: duerst (Martin D=FCrst) * ruby -v: 3.2.0 6898984f1cd * Backport: 2.7: DONTNEED, 3.0: DONTNEED, 3.1: DONTNEED ---------------------------------------- I found the header in Unicode Emoji 14.0 data files had changed slightly (a= nd again at 15.0), but `enc/unicode/case-folding.rb` didn't follow it. Then I fixed it and rebuilt the headers under `enc/unicode/14.0.0`, `name2c= type.h` had diffences from the master, as bellow. `CR_Lower`, `CR_Cased` and `CR_Other_Lowercase` just seem misses in the pre= vious operation, and no problems. But U+11720..U+11721 in `CR_Grapheme_Cluster_Break_SpacingMark` is absent i= n the original data of the Unicode.org. According to @naruse's investigation, it was removed at the commit [Update = to Unicode 14.0.0], while U+11720 is still SpacingMark in the latest https:= //www.unicode.org/reports/tr29/. [Update to Unicode 14.0.0]: https://github.com/latex3/unicode-data/commit/5= 570040ac8a30e2c2ca4912d415ecaa0498fa23a#diff-1e957b94de10ea96d32a338c005b1f= 05788af458cf335fc92683bc297e53ed94L582 ```diff diff --git a/enc/unicode/14.0.0/name2ctype.h b/enc/unicode/14.0.0/name2ctyp= e.h index 99a3eeca190..f49e5cd7273 100644 --- a/enc/unicode/14.0.0/name2ctype.h +++ b/enc/unicode/14.0.0/name2ctype.h @@ -1565,7 +1565,7 @@ static const OnigCodePoint CR_Graph[] =3D { =20 /* 'Lower': [[:Lower:]] */ static const OnigCodePoint CR_Lower[] =3D { - 664, + 668, 0x0061, 0x007a, 0x00aa, 0x00aa, 0x00b5, 0x00b5, @@ -2196,6 +2196,10 @@ static const OnigCodePoint CR_Lower[] =3D { 0x105a3, 0x105b1, 0x105b3, 0x105b9, 0x105bb, 0x105bc, + 0x10780, 0x10780, + 0x10783, 0x10785, + 0x10787, 0x107b0, + 0x107b2, 0x107ba, 0x10cc0, 0x10cf2, 0x118c0, 0x118df, 0x16e60, 0x16e7f, @@ -12651,7 +12655,7 @@ static const OnigCodePoint CR_Math[] =3D { =20 /* 'Cased': Derived Property */ static const OnigCodePoint CR_Cased[] =3D { - 151, + 155, 0x0041, 0x005a, 0x0061, 0x007a, 0x00aa, 0x00aa, @@ -12763,6 +12767,10 @@ static const OnigCodePoint CR_Cased[] =3D { 0x105a3, 0x105b1, 0x105b3, 0x105b9, 0x105bb, 0x105bc, + 0x10780, 0x10780, + 0x10783, 0x10785, + 0x10787, 0x107b0, + 0x107b2, 0x107ba, 0x10c80, 0x10cb2, 0x10cc0, 0x10cf2, 0x118a0, 0x118df, @@ -22615,7 +22623,7 @@ static const OnigCodePoint CR_Extender[] =3D { =20 /* 'Other_Lowercase': Binary Property */ static const OnigCodePoint CR_Other_Lowercase[] =3D { - 20, + 24, 0x00aa, 0x00aa, 0x00ba, 0x00ba, 0x02b0, 0x02b8, @@ -22636,6 +22644,10 @@ static const OnigCodePoint CR_Other_Lowercase[] = =3D { 0xa770, 0xa770, 0xa7f8, 0xa7f9, 0xab5c, 0xab5f, + 0x10780, 0x10780, + 0x10783, 0x10785, + 0x10787, 0x107b0, + 0x107b2, 0x107ba, }; /* CR_Other_Lowercase */ =20 /* 'Other_Uppercase': Binary Property */ @@ -37049,7 +37061,7 @@ static const OnigCodePoint CR_Grapheme_Cluster_Brea= k_Extend[] =3D { =20 /* 'Grapheme_Cluster_Break_SpacingMark': Grapheme_Cluster_Break=3DSpacingM= ark */ static const OnigCodePoint CR_Grapheme_Cluster_Break_SpacingMark[] =3D { - 161, + 160, 0x0903, 0x0903, 0x093b, 0x093b, 0x093e, 0x0940, @@ -37183,7 +37195,6 @@ static const OnigCodePoint CR_Grapheme_Cluster_Brea= k_SpacingMark[] =3D { 0x116ac, 0x116ac, 0x116ae, 0x116af, 0x116b6, 0x116b6, - 0x11720, 0x11721, 0x11726, 0x11726, 0x1182c, 0x1182e, 0x11838, 0x11838, ``` --=20 https://bugs.ruby-lang.org/ ______________________________________________ ruby-core mailing list -- ruby-core@ml.ruby-lang.org To unsubscribe send an email to ruby-core-leave@ml.ruby-lang.org ruby-core info -- https://ml.ruby-lang.org/mailman3/postorius/lists/ruby-c= ore.ml.ruby-lang.org/