[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Unicode in Emacs again



Kenichi Handa <handa@xxxxxxxxx> writes:

> Florian Weimer <fw@xxxxxxxxxxxxx> writes:
>> What does 'via surrogate pair' mean?  I guess the second line should
>> read:
>
>>>    00 xxxx xxxxxxxx xxxxxxxx   Unicode 20bit (U+10000 - U+FFFFF)
>
> Yes.   That's correct, and the third line shoud read as below:
>
>    01 0000 xxxxxxxx xxxxxxxx   Unicode 20bit (U+100000 - U+10FFFF)

I'm still not convinced it's correct.  My current understanding is
that it should be:

  00 xxxx xxxxxxxx xxxxxxxx   Unicode 20 bit       (U+000000 - U+0FFFFF)
  01 0000 xxxxxxxx xxxxxxxx   Unicode 20.08... bit (U+100000 - U+10FFFF)

I'm currently reading the emacs-unicode mailing list, and it seems a
few essential issues weren't on the horizon back then.  Shall I send a
comment to the emacs-unicode mailing list if I'm finished?
--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/