From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS31976 209.132.180.0/23 X-Spam-Status: No, score=-5.7 required=3.0 tests=AWL,BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,RCVD_IN_DNSWL_HI,RP_MATCHES_RCVD shortcircuit=no autolearn=ham autolearn_force=no version=3.4.0 Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by dcvr.yhbt.net (Postfix) with ESMTP id 355561FF40 for ; Sat, 3 Dec 2016 13:21:21 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752096AbcLCNTo (ORCPT ); Sat, 3 Dec 2016 08:19:44 -0500 Received: from mx1.2b3w.ch ([92.42.186.250]:35518 "EHLO mx1.2b3w.ch" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751056AbcLCNTm (ORCPT ); Sat, 3 Dec 2016 08:19:42 -0500 Received: from mx1.2b3w.ch (localhost [127.0.0.1]) by mx1.2b3w.ch (Postfix) with ESMTP id 0277EC3479; Sat, 3 Dec 2016 14:19:39 +0100 (CET) Received: from drbeat.li (215-243-153-5.dyn.cable.fcom.ch [5.153.243.215]) by mx1.2b3w.ch (Postfix) with ESMTPSA id D2F7FC346D; Sat, 3 Dec 2016 14:19:38 +0100 (CET) Received: by drbeat.li (Postfix, from userid 1000) id AE106201A7; Sat, 3 Dec 2016 14:19:38 +0100 (CET) From: Beat Bolli To: git@vger.kernel.org Cc: Beat Bolli , =?UTF-8?q?Torsten=20B=C3=B6gershausen?= Subject: [PATCH v3 2/3] update-unicode.sh: strip the plane offsets from the double_width[] table Date: Sat, 3 Dec 2016 14:19:32 +0100 Message-Id: <1480771173-731-2-git-send-email-dev+git@drbeat.li> X-Mailer: git-send-email 2.7.2 In-Reply-To: <1480771173-731-1-git-send-email-dev+git@drbeat.li> References: <1480762392-28731-3-git-send-email-dev+git@drbeat.li> <1480771173-731-1-git-send-email-dev+git@drbeat.li> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Virus-Scanned: ClamAV using ClamSMTP Sender: git-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: git@vger.kernel.org The function bisearch() in utf8.c does a pure binary search in double_width. It does not care about the 17 plane offsets which unicode/uniset/uniset prepends. Leaving the plane offsets in the table may cause wrong results. Filter out the plane offsets in update-unicode.sh. Cc: Torsten Bögershausen Signed-off-by: Beat Bolli --- update_unicode.sh | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/update_unicode.sh b/update_unicode.sh index 3c84270..4c1ec8d 100755 --- a/update_unicode.sh +++ b/update_unicode.sh @@ -30,7 +30,7 @@ fi && grep -v plane) }; static const struct interval double_width[] = { - $(uniset/uniset --32 eaw:F,W) + $(uniset/uniset --32 eaw:F,W | grep -v plane) }; EOF ) -- 2.7.2