[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Transliteration for use in UTF-8 locales



Markus Kuhn wrote:

> The ISO TR 14652 transliteration mechanism (which is already partly
> implemented in glibc 2.1.95) was probably primarily intended for people
> whose I/O devices cannot handle UTF-8 and want to see as much as
> possible of the wide character information in their 7/8-bit coding
> system. If you have a UTF-8 locale, every wide character can uniquely
> and without loss of information be converted into a multi-byte
> character, and no transliteration seems necessary at first sight.

Will iconv automatically use that mecanism ?

For example if I use iconv to convert some UTF-8 data to ISO-2022-JP that
has support for the characters in US-ASCII, but not for those in
ISO-8859-1, will there be automatic translitteration ?

How many translitteration tables will be included in standard ?
What is the method to choose the kind of translitteration, create new
methods ?

For example in the case of japaneses kana, two different transliteration
methods to latin alphabet are commonly used, hepburn and the japanese
official method.

-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/