[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: multibyte encodings other than UTF-8



On Tue, 2 Nov 1999, Edmund GRIMLEY EVANS wrote:

> This is off-topic, of course, but perhaps someone here can tell me if
> there are multibyte encodings in use, other than UTF-8, in which the
> number of octets used to represent a character is not equal to the
> number of character cells used to display it on a terminal?
> In fact a list of the multibyte encodings would be nice. I just know
> that there are several for each of Chinese, Japanese, Korean, and the
> only one I ever looked at in detail had number of octets = width on
> display, so you can handle it just like any 8-bit character set.

- Taiwan's CNS 11643 in EUC-TW encoding (2-3 bytes per full-width CJK
  character); see http://www.cns11643.gov.tw/
- Taiwan's CCCII (3 bytes per full-width CJK character)
- USA's EACC, used in libraries (3? bytes per CJK character)


Thomas Chan
tc31@cornell.du

-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/