[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: How to detect the encoding of a string?



From: Danilo Segan <dsegan@xxxxxxx>

> > If you would like to experiment with your own ZIP application,
> > try
> > http://www.thranio.gr/sxolikes-giortes/telikes/omilies/apoxairetisthrio-logos-mathith.zip
> > The filename is encoded in CP737 (a la iconv). All open-source ZIP
> > tools (=unzip, file-roller, ark) fail to detect the encoding.
> > WinZip is able to detect the encoding.
> 
> My guess is that WinZip is running on a Greek Windows, and that
> WinZip uses old IBM encodings for i18n names on them, assuming CP737
> on Greek system.
> 
> Can you confirm or dispute my assumption (by eg. trying on a non-Greek
> Windows system, or just confirming that this was actually attempted on
> a non-Greek system)?

I opene d the ZIP file with the WinXP builtin ZIP handler and it extracted the file as
"Ǻª«ÿá¿£½á¬½p¿áª¬ óµÜª¬ úÿ?P½p.doc"  (trash, it may look different when you read this
mail, as my mail program is pretty braindead regarding encondings)

Regards,
David



--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/