[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Unicode, character ambiguities



Hi,

At Thu, 10 Jan 2002 04:46:42 -0500,
Glenn Maynard wrote:

> By the way, I notice this page contains "mappings to major standards"
> including Big5 and GB 2312.  Also, JIS, and I assume there's established
> mappings between JIS and EUC-JP, and probably for the other languages,
> too.
> 
> "The mappings to major standards have been exhaustively proofed and are
> a normative part of Unicode." (http://www.unicode.org/charts/unihan.html)
> 
> "A normative part of Unicode" sounds very authoritative; am I missing
> something?

CJK standards such as JIS X 0208, JIS X 0212, GB 2312, KS X 1001,
and Big5 contains not only Han Ideographs but also various characters
such as Hiragana, Katakana, Hangul, Latin Alphabets, Greeks, Cyrillics,
punctuations, other symbols (like square, triangle, circle, dagger, ...).

For Han Ideographs, a unique authorative mapping table is available.
For other parts , no authorative mapping tables are available.

---
Tomohiro KUBOTA <kubota@xxxxxxxxxx>
http://www.debian.or.jp/~kubota/
"Introduction to I18N"  http://www.debian.org/doc/manuals/intro-i18n/
--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/