[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Linux console UTF-8 by default
On Sat, 2004-01-10 at 23:51, Edward H. Trager wrote:
> I guess I was recalling (from http://www.cl.cam.ac.uk/~mgk25/unicode.html)
> that six bytes allows encoding all possible
> 2^31 UCS code points, although
> I suppose nothing above plane 1 has been defined.
1. That page is a little out of date (although a wonderful resource).
2. Although UCS theoretically allows 2^31 code points, it will never
encode any character higher than U+10FFFF.
3. There are already some stuff assigned to plane 2, 14, 15, and 16 (the
latest two for private use characters). There is a plan to use plane 3
also.
roozbeh
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/