bug-gnulib@gnu.org mirror (unofficial)
 help / color / mirror / Atom feed
* diacrit: mark deprecated
@ 2019-01-21  1:11 Bruno Haible
  2019-01-21  5:01 ` Jim Meyering
  0 siblings, 1 reply; 4+ messages in thread
From: Bruno Haible @ 2019-01-21  1:11 UTC (permalink / raw)
  To: bug-gnulib, Jim Meyering

Hi Jim,

You are listed as the maintainer of the 'diacrit' module. It doubt anyone is
still using this module, because it assumes an 8-bit character set, whereas
most systems have switched to UTF-8 10 to 18 years ago. Do you agree to mark
it deprecated?


diff --git a/modules/diacrit b/modules/diacrit
index 725d683..bda96f5 100644
--- a/modules/diacrit
+++ b/modules/diacrit
@@ -1,6 +1,12 @@
 Description:
 Decomposition of composed ISO-8859-1 characters.
 
+Status:
+deprecated
+
+Notice:
+This module is deprecated. Use the module 'uninorm/canonical-decomposition' instead.
+
 Files:
 lib/diacrit.h
 lib/diacrit.c



^ permalink raw reply related	[flat|nested] 4+ messages in thread

* Re: diacrit: mark deprecated
  2019-01-21  1:11 diacrit: mark deprecated Bruno Haible
@ 2019-01-21  5:01 ` Jim Meyering
  2019-07-17 20:52   ` Bernhard Voelker
  0 siblings, 1 reply; 4+ messages in thread
From: Jim Meyering @ 2019-01-21  5:01 UTC (permalink / raw)
  To: Bruno Haible; +Cc: bug-gnulib@gnu.org List

On Sun, Jan 20, 2019 at 5:11 PM Bruno Haible <bruno@clisp.org> wrote:
> Hi Jim,
>
> You are listed as the maintainer of the 'diacrit' module. It doubt anyone is
> still using this module, because it assumes an 8-bit character set, whereas
> most systems have switched to UTF-8 10 to 18 years ago. Do you agree to mark
> it deprecated?

Hi Bruno, that's fine with me.
Thanks


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: diacrit: mark deprecated
  2019-01-21  5:01 ` Jim Meyering
@ 2019-07-17 20:52   ` Bernhard Voelker
  2019-07-17 21:31     ` Bruno Haible
  0 siblings, 1 reply; 4+ messages in thread
From: Bernhard Voelker @ 2019-07-17 20:52 UTC (permalink / raw)
  To: Jim Meyering, Bruno Haible; +Cc: bug-gnulib@gnu.org List, Coreutils

[adding coreutils: discussion at
 https://lists.gnu.org/r/bug-gnulib/2019-01/msg00116.html ]

On 1/21/19 6:01 AM, Jim Meyering wrote:
> On Sun, Jan 20, 2019 at 5:11 PM Bruno Haible <bruno@clisp.org> wrote:
>> Hi Jim,
>>
>> You are listed as the maintainer of the 'diacrit' module. It doubt anyone is
>> still using this module, because it assumes an 8-bit character set, whereas
>> most systems have switched to UTF-8 10 to 18 years ago. Do you agree to mark
>> it deprecated?
> 
> Hi Bruno, that's fine with me.
> Thanks

In GNU coreutils, we now get this warning during bootstrap:

  Notice from module diacrit:
    This module is deprecated. Use the module 'uninorm/canonical-decomposition' instead.

And indeed, the 'diacrit' module is still in use by 1 source:

  $ GIT_PAGER= git grep -En 'todiac|tobase'
  src/ptx.c:1053:          diacritic = todiac (character);
  src/ptx.c:1056:              base = tobase (character);
  src/ptx.c:1338:        edited_flag[character] = todiac (character) != 0;

Have a nice day,
Berny


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: diacrit: mark deprecated
  2019-07-17 20:52   ` Bernhard Voelker
@ 2019-07-17 21:31     ` Bruno Haible
  0 siblings, 0 replies; 4+ messages in thread
From: Bruno Haible @ 2019-07-17 21:31 UTC (permalink / raw)
  To: Bernhard Voelker; +Cc: bug-gnulib, Jim Meyering, coreutils

Bernhard Voelker wrote:
> In GNU coreutils, we now get this warning during bootstrap:
> 
>   Notice from module diacrit:
>     This module is deprecated. Use the module 'uninorm/canonical-decomposition' instead.
> 
> And indeed, the 'diacrit' module is still in use by 1 source:
> 
>   $ GIT_PAGER= git grep -En 'todiac|tobase'
>   src/ptx.c:1053:          diacritic = todiac (character);
>   src/ptx.c:1056:              base = tobase (character);
>   src/ptx.c:1338:        edited_flag[character] = todiac (character) != 0;

Indeed, 'ptx' does not yet support multibyte locales.

$ echo "Böse Bübchen" | ptx -
                                       Böse Bübchen
                               Böse   Bübchen
                           Böse Bü   bchen
                                 Bö   se Bübchen

It looks even weirder with Greek input:

$ echo "Το τέλος του Ψυχρού Πολέμου και η διάλυση της Σοβιετικής Ένωσης άφησαν τις Ηνωμένες Πολιτείες για ένα διάστημα ως τη μόνη υπερδύναμη." | ptx -

(No output at all!)

Bruno

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2019-07-17 21:31 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-01-21  1:11 diacrit: mark deprecated Bruno Haible
2019-01-21  5:01 ` Jim Meyering
2019-07-17 20:52   ` Bernhard Voelker
2019-07-17 21:31     ` Bruno Haible

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).