[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: multibyte encodings other than UTF-8
On Tue, 2 Nov 1999, Edmund GRIMLEY EVANS wrote:
> This is off-topic, of course, but perhaps someone here can tell me if
> there are multibyte encodings in use, other than UTF-8, in which the
> number of octets used to represent a character is not equal to the
> number of character cells used to display it on a terminal?
> In fact a list of the multibyte encodings would be nice. I just know
> that there are several for each of Chinese, Japanese, Korean, and the
> only one I ever looked at in detail had number of octets = width on
> display, so you can handle it just like any 8-bit character set.
- Taiwan's CNS 11643 in EUC-TW encoding (2-3 bytes per full-width CJK
character); see http://www.cns11643.gov.tw/
- Taiwan's CCCII (3 bytes per full-width CJK character)
- USA's EACC, used in libraries (3? bytes per CJK character)
Thomas Chan
tc31@cornell.du
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/