[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: grep is horriby slow in UTF-8 locales
Markus Kuhn wrote:
> b) relying entirely on ISO C's generic multi-byte functions, to make
> sure that even stateful monsters like the ISO 2022 encodings
> are supported equally.
Use of mbrlen is not done because of ISO 2022 encodings (which are not
usable as locale encodings!), but because of the non-UTF-8 multibyte
encodings: EUC-JP, Big5, GB18030 etc.
Bruno
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/