[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Linux console UTF-8 by default
Edward H. Trager wrote:
On Saturday 2004.01.10 20:48:31 +0330, Roozbeh Pournader wrote:
On Sat, 2004-01-10 at 20:36, Edward H. Trager wrote:
Is there any good reason why implementors would not support the
full range of Unicode -- i.e., UTF-8 up to six serialized bytes?
UTF-8 up to four bytes, you mean. See
<http://www.faqs.org/rfcs/rfc3629.html>.
I guess I was recalling (from http://www.cl.cam.ac.uk/~mgk25/unicode.html)
that six bytes allows encoding all possible
2^31 UCS code points, although
I suppose nothing above plane 1 has been defined. - Ed Trager
Plane 2 has tens of thousands of Chinese characters and Plane 14 has
variation selectors and language tags. However, nothing will ever be
defined above Plane 16. JTC1/SC2/WG2 made a firm commitment to that.
Jungshik
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/