[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: UTF8_STRING web page



Followup to:  <20010511154605.A20251@xxxxxxxxxxxxxxxxxxxxxxxxxx>
By author:    David Starner <dstarner98@xxxxxxxxxxxxx>
In newsgroup: linux.utf8
>
> On Fri, May 11, 2001 at 10:25:41PM +0200, Juliusz Chroboczek wrote:
> > Following a number of requests for stable links to a description of
> > the UTF8_STRING atom, I have put together a draft web page on
> > 
> >   http://www.pps.jussieu.fr/~jch/software/UTF8_STRING/
> > 
> > This page doesn't currently contain anything new, but I will try to
> > keep the location stable.  I would be grateful if you could update the
> > location of any links to the UTF8_STRING draft.
> 
> Why do you say "When restricted to the BMP, ... UTF-8 carries at most
> a 50% overhead over UTF-16"? Outside the BMP, UTF-8 takes 4 bytes and
> UTF-16 takes 4 bytes, so there should be no need for the qualifier.
> 

Probably he was thinking that UTF-8 can take up to 6 bytes, but UTF-16
can't even represent those characters, so it's pretty meaningless at
that stage...

	-hpa
-- 
<hpa@xxxxxxxxxxxxx> at work, <hpa@xxxxxxxxx> in private!
"Unix gives you enough rope to shoot yourself in the foot."
http://www.zytor.com/~hpa/puzzle.txt
-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/