[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: perl unicode support
Rich Felker wrote:
>
> On Tue, Mar 27, 2007 at 11:53:15PM -0400, ï¼³ï½?ï½?ï½?ï¼´ï½*ï½?ï½? wrote:
...
ï¼µï½?ï½?ï½?ï½?ï½?ã??ï½?ï½?ï½*ã??ï½?ï½?ï½?ï½?ã??ï½?ï½?ï½-ï½?ï½?ï½?ï½?ã??ï½*ï½?ï½?ï¼?ï¼~ã??
> > ï½?ï½?ï½?ï½*ï½?ï½?ï½fï½?ï½?ã??ï½?ï½?ã??ï½?ï½?ï½*ï½?ã??ï½?ï½?ï½?ï½*ï½?ï½?ï½?ã??
>
> Haha, was it your intent to use this huge japanese wide ascii? :)
> Sadly I don't think Daniel can read anything but Latin-1...
Well, actually _I_ can (slowly), but right, my mailer can't.
> Here's an ascii transliteration...
> ~Rich
>
> On Tue, Mar 27, 2007 at 11:53:15PM -0400, SrinTuar wrote:
...
> > The regular expression engine should be utf-8 aware.
Of course! That's why I've been arguing that byte-based regular
expression processing won't work with UTF-8. (E.g., if the match-
any-character symbol "." matches any byte, it will break multi-byte
sequences at places that aren't character boundaries.)
Daniel
--
Daniel Barclay
dsb@xxxxxxxxx
â?ìTL_"×Éè|¸§»'jYez÷¥°
Ü?+Þ?Ûiÿù??Yå?)îÆ?àþX§»Ï