[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: Choosing right character representation for i18n issues



> De: Kai Großjohann
> Date: dimanche 24 octobre 1999 14:43
>
> I've got a certain application in mind and I'm wondering whether using
> UTF-8 as the underlying character representation in that application
> will be the right choice.  The application is full-text search.  What
> I have in mind is to do the following: each document is converted into
> UTF-8 for indexing...

You would do well to read Thierry Sourbier's paper on that subject presented
at the last Unicode conference.  The slides are online at
http://www.unicode.org/unicode/iuc/iuc15/b3/slides.ppt.

--
François Yergeau

-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/