[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Transliteration for use in UTF-8 locales



The sort of cases I'm worrying about are:

 - sending data to a child process through command-line arguments and
   pipes, e.g. mutt talking to gpg

 - sending strings to a library, e.g. mutt talking to curses

It seems to me that these automatic-transcription locales might cause
some problems.

Could you give an example of a situation in which they are
advantageous?

Also, where did the idea come from? None of the documents I have read
say anything about it. For example, the man page for wctomb contains
no warning that characters may be converted in a weird and wonderful
manner. For example, I read here that wctomb "converts the wide
character wc to its multibyte representation"; it doesn't say that it
"converts the wide character wc to the multibyte representation of a
sequence of wide characters that the locale designer feels may be more
appropriate for certain human readers".

I hope at least that wcwidth(wc) gives the appropriate result, i.e.
wcwidth(L'å') is 2 if 'å' is going to be transcribed as "aa" by
wctomb.

Edmund
-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/