[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: Choosing right character representation for i18n issues
> De: Kai Großjohann
> Date: dimanche 24 octobre 1999 14:43
>
> I've got a certain application in mind and I'm wondering whether using
> UTF-8 as the underlying character representation in that application
> will be the right choice. The application is full-text search. What
> I have in mind is to do the following: each document is converted into
> UTF-8 for indexing...
You would do well to read Thierry Sourbier's paper on that subject presented
at the last Unicode conference. The slides are online at
http://www.unicode.org/unicode/iuc/iuc15/b3/slides.ppt.
--
François Yergeau
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/