[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Unicode, character ambiguities
> Saying about round-trip compatibility, yes, round-trip compatibility
> for EUC-JP, EUC-KR, Big5, GB2312, GBK are guaranteed, i.e., Unicode
> is a superset of these encodings (character sets). However,
> (1) there are no authorative mapping tables between these encodings
> and Unicode and there are various private mapping tables. This
> can cause portability problem around round-trap compatibility.
On Thu, Jan 10, 2002 at 06:18:10PM +0900, Tomohiro KUBOTA wrote:
> For reference of glyph, I am using
> http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=f9b1
> and so on. Otherwise, displayed glyphs depend on system and
> we cannot discuss about same glyph.
By the way, I notice this page contains "mappings to major standards"
including Big5 and GB 2312. Also, JIS, and I assume there's established
mappings between JIS and EUC-JP, and probably for the other languages,
too.
"The mappings to major standards have been exhaustively proofed and are
a normative part of Unicode." (http://www.unicode.org/charts/unihan.html)
"A normative part of Unicode" sounds very authoritative; am I missing
something?
--
Glenn Maynard
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/