[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Character set tagging considered harmful
> Sounds like a good idea. Looking through the xterm code, I notice that it
> stores each character of the screen in two bytes (excluding color and
> attributes). This can't be the UTF-8 code, that would require three bytes.
> Since Vim does screen handling like xterm, I might end up doing the same.
what you're considering UTF-8 is the input/output (which indeed can be 3 or
more bytes) - the corresponding UCS data are 2-byte.
(Markus has requested a second plane, which would increase the 2-byte codes
to 4 - but that's a different matter)
--
Thomas E. Dickey
dickey@clark.net
http://www.clark.net/pub/dickey
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/