[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: UTF-8 as the single common encoding everywhere
On Thu, Jun 07, 2001 at 04:15:32AM +0430, Roozbeh Pournader wrote:
>
>
> On 6 Jun 2001, H. Peter Anvin wrote:
>
> > There is only one UTF-8.
>
> But which is that? The one described in RFC 2279, the one in ISO
> 10646-1:2000, or the one in Unicode 3.1? These are different.
Yes, that is right. The UTF-8 in 10646 and RFC 2279 covers
a 31 bit space, while Unicode 3.1 UTF-8 covers only a 21 bit
space. I think we should go with the RFC/ISO version.
Kind regards
Keld
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/