[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Utf-8 support in C functions on Linux



Hello Francois,

As your questions are not Web-related at all, may I suggest
that you don't copy www-international@xxxxxx for the moment?
If you have some Web and Internationalization related questions,
please feel free to come back here.

Regards, Martin.

At 09:48 01/12/13 -0500, Richard, Francois M wrote:
I have been posting quite regularly questions about utf-8 support/Locale on
Linux (and Solaris). Before asking one more, I would like to thank people
who contributed to theses discussions. Your feedback and replies have been
always very interesting and most of the time very valuable ;)... Thanks for
taking some of your time to answer.

We were doing some testing with a piece of C code on Linux (Locale sensitive
thanks to SetLocale() and with a system Locale set first to en_US.utf8 and
then to sv_SV.utf8) and it looks like magically strcoll() was sorting the
utf-8 file read in input(two characters only: çand z). So in en_US.utf8, çcame first, then z. And in sv_SV.uft8, z came first, then ç


Does it mean strcoll() properly handle utf-8 data??? I would be very
surprised. But how to explain the proper sorting results we got?

Is there somewhere an extensive list indicating which C char functions do
handle utf-8 properly and which ones do not (and as a result need to be
replaced with wide C functions to correctly manipulate utf-8 data)? That
would save us a lot of time since interpreting our test results is not in
fact that obvious.

Fran輟is


-- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/