[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: perl unicode support
Rich Felker wrote:
>
> For example, the unix "cut" program works automatically with UTF-8
> text as long as the delimiter is a single byte, and if you want
> multibyte delimiters, all you need to do is make it accept a multibyte
> delimeter character and then do a substring search instead of a byte
> search. There is no need to ever treat the input string as characters,
> and in fact doing so just makes it slow and bloated.
cut -c2-3 ...
Daniel
--
Daniel Barclay
dsb@xxxxxxxxx
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/