[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: UTF-8 versus utf8



Markus Kuhn <mgk25@xxxxxxxxx> wrote:


>However, glibc normalizes encoding names internally.
>It might be nicer if glibc used instead of a normalization routine an
>internal table of known encoding names, and just matched with the
>normalization routine against the stored names, and if it recognizes
>a name such as "UTF-8", it should output that as the cannonical form,
>not the normalized "utf8". That would avoid lots of headaches.


 Yes. The same problem with russinan "KOI8-R" charset name.
Glibc uses very simplified "name mangling", so some charset aliases doesn't
fit to this scheme : CP-866 .vs. IBM-866 as example.
Fortunately, RedHat produces correct names for some charset as special case
(after glibc install)
Just do in your distributive :
$ mv  /usr/share/locale/en_US.utf8   /usr/share/locale/en_US.UTF-8
$ locale -a

 We should ask Ulrich Drepper to change this.

-- 
-=AV=-

-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/