From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on dcvr.yhbt.net X-Spam-Level: X-Spam-ASN: AS31976 209.132.180.0/23 X-Spam-Status: No, score=-2.8 required=3.0 tests=AWL,BAYES_00,BODY_8BITS, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_EF,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RCVD_IN_DNSWL_MED,SPF_HELO_PASS,SPF_PASS shortcircuit=no autolearn=ham autolearn_force=no version=3.4.2 Received: from sourceware.org (server1.sourceware.org [209.132.180.131]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id 5ABF91F4BD for ; Tue, 8 Oct 2019 19:20:10 +0000 (UTC) DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:date:to:from:cc:reply-to:subject:message-id :in-reply-to:references:mime-version:content-type; q=dns; s= default; b=uQ+tbg3BCazda0/dA35ymacrZ6y13HUzNe/z5ZzOM9QdRZ7K3UhMQ ObAz6GLBauvnVOFHDweXe1SHHqd7QExLiQANIdLTKx7vxPleb6B7rRM2h2HOOwuy mkNqCPobUuKOp6Llr0mRkJFADtE2AmcQHM53el/cAeg3vX9ytk3rZA= DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:date:to:from:cc:reply-to:subject:message-id :in-reply-to:references:mime-version:content-type; s=default; bh=HtkU+X+ifUZHJSw18dE55KinhVk=; b=dKvlLI2iD38VPxZGsDc7hBEOcfsB JiUdqPy1ntaVPvhfodr3eOTo7Fsj6uuHnSAH1uB5IFNnKF0YdjDguKUSRl5da6Os apNqtlKBHgd0y/QSriILP43jU5Kp9qS91Ak8mRhApcjUGUNsYkkhzrJ6pCkbqY10 Q0RdgFgH7iaWLH8= Received: (qmail 26041 invoked by alias); 8 Oct 2019 19:20:06 -0000 Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: libc-alpha-owner@sourceware.org Received: (qmail 26015 invoked by uid 89); 8 Oct 2019 19:20:04 -0000 Authentication-Results: sourceware.org; auth=none X-HELO: mail4.protonmail.ch Date: Tue, 08 Oct 2019 19:19:52 +0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kobylkin.com; s=protonmail; t=1570562398; bh=kk3R3uqKaV8YaLh1gG9GkdjMMTG5qO8CLnaw1WcVvtw=; h=Date:To:From:Cc:Reply-To:Subject:In-Reply-To:References: Feedback-ID:From; b=e13W7eiDon6EPoKJBYQZ+Q+IxJsGsVcdM5vf1znXH+d5xLvXGbmhIFB3/spnafOGf nH9Qvkuf6yd7rHIawca05so6xb2SQH/aUDYkpjt0+OvXIlkbFyUSaMUNz0gqCsnEnV mWqVW2EVNxZ4TuNvAkqtQublGKxHVY28N+NqTVz8= To: "libc-locales@sourceware.org" , "libc-alpha@sourceware.org" , Carlos O'Donell , Rafal Luzynski From: "Diego (Egor) Kobylkin" Cc: Marko Myllynen , Siddhesh Poyarekar Reply-To: "Diego (Egor) Kobylkin" Subject: Re: [PATCH] locale/C-translit.h.in: Greek -> ASCII transliteration table [BZ #12031] Message-ID: In-Reply-To: <15ng4NDEFJeZhF1FBBL6X6CB9aroE4hyWVKzRshoOYhTmf-Cj2U64VAczBw5-eTCL0PqD_Urr7Fjv0P1bZMtTIwmoE7kiaGesv6e6KJhB_U=@kobylkin.com> References: <15ng4NDEFJeZhF1FBBL6X6CB9aroE4hyWVKzRshoOYhTmf-Cj2U64VAczBw5-eTCL0PqD_Urr7Fjv0P1bZMtTIwmoE7kiaGesv6e6KJhB_U=@kobylkin.com> MIME-Version: 1.0 Content-Type: multipart/signed; protocol="application/pgp-signature"; micalg=pgp-sha512; boundary="---------------------ed09cbb60ddb8399a85b4c58061be1b2"; charset=UTF-8 This is an OpenPGP/MIME signed message (RFC 4880 and 3156) -----------------------ed09cbb60ddb8399a85b4c58061be1b2 Content-Type: multipart/mixed;boundary=---------------------c7aef03668a8e1e59667c570fc84b3bd -----------------------c7aef03668a8e1e59667c570fc84b3bd Content-Transfer-Encoding: quoted-printable Content-Type: text/plain;charset=utf-8 Carlos, Rafal, here is another patch for ASCII transliteration bug [BZ #12031], this time= for Greek. You were instrumental in getting the other patch for the transliteration = [BZ #2872] approved. So I want to make you aware of this patch. = Just to make sure, it has nothing to do with Cyrillics. = It is entirely Greek -> ASCII transliteration table. Yet it has exact same= structure = as [BZ #2872]. So it is only logical if you two could just re-run the same= tests you = did for [BZ #2872]. Given it is Greek there may be other considerations as well of course. Hap= py to hear = on this from anyone else any time. Best regards, Egor =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 Original M= essage =E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90=E2=80=90 On Wednesday, September 4, 2019 9:31 AM, Diego (Egor) Kobylkin wrote: > Dear locale maintainers, > = > fix the glibc bug 12031 "iconv -t ascii//translit with Greek characters"= [1] > add Greek transliteration rows to locale/C-translit.h.in. > = > This work is done on the heels of the successfully committed patch for t= he > virtually the same bug [BZ #2872] but concerning Cyrillic characters. [2= ] > = > AFAIK there are many versions of transcription tables for Greek to ASCII > transcription. Given that current iconv logic can only translit one to m= any > but not many to many symbols we take the "Standard" part of > the Romanization_of_Greek#Modern_Greek table [3] > = > and only keep the one letter Greek graphems. That "standard" seems to be= close to > the ELOT 743 indeed but not the same. > = > So we omit things like M and =CE=9C=CF=80 being transliterated as M and = B accordingly. > Rather =CE=9C=CF=80 will be treated like two separate graphems and trans= literated as Mp. > = > Here is the list of some standards I have collected so far. There doesn'= t seem > a way to harmonize them all into one. But if anyone want to propose a so= lution - > please do. > = > - =CE=95=CE=9B=CE=9F=CE=A4 743 https://www.teicrete.gr/users/kutrulis/= Ergalia/ELOT743.htm Passports. > - ISO 843 https://en.wikipedia.org/wiki/ISO_843 > - ALA-LC https://www.loc.gov/catdir/cpso/romanization/greek.pdf Book t= itles. > - BGN/PCGN http://libraries.ucsd.edu/bib/fed/USBGN_romanization.pdf > - http://geonames.nga.mil/gns/html/Romanization/Romanization_Greek.pdf= Geographical names. > = > Furthermore to cover the whole U0370-U03FF Greek/Coptic Unicode rang= e I have > asked around and made a best effort transliteration for the rest of = characters > not covered in above standards. > = > Should you have better sources for the actual translit entries pleas= e make sure to > send your feedback! > = > The patch is attached. > = > Best regards, > Egor Kobylkin > = > https://sourceware.org/bugzilla/show_bug.cgi?id=3D12031 [1] > https://sourceware.org/ml/libc-alpha/2019-07/msg00477.html [2] > https://en.wikipedia.org/wiki/Romanization_of_Greek#Modern_Greek [3] > -----------------------c7aef03668a8e1e59667c570fc84b3bd Content-Type: application/pgp-keys; filename="publickey - egor@kobylkin.com - 0x01FEB4E8.asc"; name="publickey - egor@kobylkin.com - 0x01FEB4E8.asc" Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="publickey - egor@kobylkin.com - 0x01FEB4E8.asc"; name="publickey - egor@kobylkin.com - 0x01FEB4E8.asc" LS0tLS1CRUdJTiBQR1AgUFVCTElDIEtFWSBCTE9DSy0tLS0tDQpWZXJzaW9uOiBPcGVuUEdQLmpz IHY0LjYuMg0KQ29tbWVudDogaHR0cHM6Ly9vcGVucGdwanMub3JnDQoNCnhqTUVYTGN4NkJZSkt3 WUJCQUhhUnc4QkFRZEFUYVpYRStOUS9ZYXJYRk9jTEhJQk9DSWJ6TXNnNXpQZQ0KSTZ5VzR4OHBQ VlhOSnlKbFoyOXlRR3R2WW5sc2EybHVMbU52YlNJZ1BHVm5iM0pBYTI5aWVXeHJhVzR1DQpZMjl0 UHNKM0JCQVdDZ0FmQlFKY3R6SG9CZ3NKQndnREFnUVZDQW9DQXhZQ0FRSVpBUUliQXdJZUFRQUsN CkNSQStPcVNEZ0FHcG9acmVBUDlOTUdxMXZ1UVJiY1hBbGhZbStvRU9XMGVWYXRyK0RJcDRBdGJo YzdkZw0KUUFFQXA1NjBKMFEzRHpmK1BKY1pDdFBHeERlOWZWVkZyelBYUzN3MTBYN00wd2ZPT0FS Y3R6SG9FZ29yDQpCZ0VFQVpkVkFRVUJBUWRBb2RSbXRLSDkwV0ZMZzlwTHloS0c2b0RvZWpIdWhj OEd0eTROSXlhRUxtd0QNCkFRZ0h3bUVFR0JZSUFBa0ZBbHkzTWVnQ0d3d0FDZ2tRUGpxa2c0QUJx YUVtc2dFQTZnSWdWQ29jMVp0cw0KWWMyNVh6MEtVWXNuMWtPNEZxZmwyd2pQNzVUYkxYZ0EvQW9o dWdlc2xXZVFsRTdUQ2FhU3hFV0RXL2xYDQo4SmRlTEo4dFlIZFEvNU1MDQo9T0JwMQ0KLS0tLS1F TkQgUEdQIFBVQkxJQyBLRVkgQkxPQ0stLS0tLQ0K -----------------------c7aef03668a8e1e59667c570fc84b3bd-- -----------------------ed09cbb60ddb8399a85b4c58061be1b2 Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: ProtonMail wl4EARYKAAYFAl2c4VIACgkQPjqkg4ABqaFUJgEAyefXQGuxNs2INP0gZaMy lKvQILTj79GM6A2jYL5YK/wA/1+z7E/ChVbvPjQHIWq6Ushqhr6aRdpsQHrg LF56mtgH =lgIB -----END PGP SIGNATURE----- -----------------------ed09cbb60ddb8399a85b4c58061be1b2--