[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: UTF-8 as the single common encoding everywhere



On Thu, Jun 07, 2001 at 04:15:32AM +0430, Roozbeh Pournader wrote:
> 
> 
> On 6 Jun 2001, H. Peter Anvin wrote:
> 
> > There is only one UTF-8.
> 
> But which is that? The one described in RFC 2279, the one in ISO
> 10646-1:2000, or the one in Unicode 3.1? These are different.

Yes, that is right. The UTF-8 in 10646 and RFC 2279 covers
a 31 bit space, while Unicode 3.1 UTF-8 covers only a 21 bit
space. I think we should go with the RFC/ISO version.

Kind regards
Keld
-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/