* diacrit: mark deprecated
@ 2019-01-21 1:11 Bruno Haible
2019-01-21 5:01 ` Jim Meyering
0 siblings, 1 reply; 4+ messages in thread
From: Bruno Haible @ 2019-01-21 1:11 UTC (permalink / raw)
To: bug-gnulib, Jim Meyering
Hi Jim,
You are listed as the maintainer of the 'diacrit' module. It doubt anyone is
still using this module, because it assumes an 8-bit character set, whereas
most systems have switched to UTF-8 10 to 18 years ago. Do you agree to mark
it deprecated?
diff --git a/modules/diacrit b/modules/diacrit
index 725d683..bda96f5 100644
--- a/modules/diacrit
+++ b/modules/diacrit
@@ -1,6 +1,12 @@
Description:
Decomposition of composed ISO-8859-1 characters.
+Status:
+deprecated
+
+Notice:
+This module is deprecated. Use the module 'uninorm/canonical-decomposition' instead.
+
Files:
lib/diacrit.h
lib/diacrit.c
^ permalink raw reply related [flat|nested] 4+ messages in thread
* Re: diacrit: mark deprecated
2019-01-21 1:11 diacrit: mark deprecated Bruno Haible
@ 2019-01-21 5:01 ` Jim Meyering
2019-07-17 20:52 ` Bernhard Voelker
0 siblings, 1 reply; 4+ messages in thread
From: Jim Meyering @ 2019-01-21 5:01 UTC (permalink / raw)
To: Bruno Haible; +Cc: bug-gnulib@gnu.org List
On Sun, Jan 20, 2019 at 5:11 PM Bruno Haible <bruno@clisp.org> wrote:
> Hi Jim,
>
> You are listed as the maintainer of the 'diacrit' module. It doubt anyone is
> still using this module, because it assumes an 8-bit character set, whereas
> most systems have switched to UTF-8 10 to 18 years ago. Do you agree to mark
> it deprecated?
Hi Bruno, that's fine with me.
Thanks
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: diacrit: mark deprecated
2019-01-21 5:01 ` Jim Meyering
@ 2019-07-17 20:52 ` Bernhard Voelker
2019-07-17 21:31 ` Bruno Haible
0 siblings, 1 reply; 4+ messages in thread
From: Bernhard Voelker @ 2019-07-17 20:52 UTC (permalink / raw)
To: Jim Meyering, Bruno Haible; +Cc: bug-gnulib@gnu.org List, Coreutils
[adding coreutils: discussion at
https://lists.gnu.org/r/bug-gnulib/2019-01/msg00116.html ]
On 1/21/19 6:01 AM, Jim Meyering wrote:
> On Sun, Jan 20, 2019 at 5:11 PM Bruno Haible <bruno@clisp.org> wrote:
>> Hi Jim,
>>
>> You are listed as the maintainer of the 'diacrit' module. It doubt anyone is
>> still using this module, because it assumes an 8-bit character set, whereas
>> most systems have switched to UTF-8 10 to 18 years ago. Do you agree to mark
>> it deprecated?
>
> Hi Bruno, that's fine with me.
> Thanks
In GNU coreutils, we now get this warning during bootstrap:
Notice from module diacrit:
This module is deprecated. Use the module 'uninorm/canonical-decomposition' instead.
And indeed, the 'diacrit' module is still in use by 1 source:
$ GIT_PAGER= git grep -En 'todiac|tobase'
src/ptx.c:1053: diacritic = todiac (character);
src/ptx.c:1056: base = tobase (character);
src/ptx.c:1338: edited_flag[character] = todiac (character) != 0;
Have a nice day,
Berny
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: diacrit: mark deprecated
2019-07-17 20:52 ` Bernhard Voelker
@ 2019-07-17 21:31 ` Bruno Haible
0 siblings, 0 replies; 4+ messages in thread
From: Bruno Haible @ 2019-07-17 21:31 UTC (permalink / raw)
To: Bernhard Voelker; +Cc: bug-gnulib, Jim Meyering, coreutils
Bernhard Voelker wrote:
> In GNU coreutils, we now get this warning during bootstrap:
>
> Notice from module diacrit:
> This module is deprecated. Use the module 'uninorm/canonical-decomposition' instead.
>
> And indeed, the 'diacrit' module is still in use by 1 source:
>
> $ GIT_PAGER= git grep -En 'todiac|tobase'
> src/ptx.c:1053: diacritic = todiac (character);
> src/ptx.c:1056: base = tobase (character);
> src/ptx.c:1338: edited_flag[character] = todiac (character) != 0;
Indeed, 'ptx' does not yet support multibyte locales.
$ echo "Böse Bübchen" | ptx -
Böse Bübchen
Böse Bübchen
Böse Bü bchen
Bö se Bübchen
It looks even weirder with Greek input:
$ echo "Το τέλος του Ψυχρού Πολέμου και η διάλυση της Σοβιετικής Ένωσης άφησαν τις Ηνωμένες Πολιτείες για ένα διάστημα ως τη μόνη υπερδύναμη." | ptx -
(No output at all!)
Bruno
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2019-07-17 21:31 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-01-21 1:11 diacrit: mark deprecated Bruno Haible
2019-01-21 5:01 ` Jim Meyering
2019-07-17 20:52 ` Bernhard Voelker
2019-07-17 21:31 ` Bruno Haible
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).