[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Unicode, character ambiguities
On Thursday 10 January 2002 02:17 am, you wrote:
> On Wed, Jan 09, 2002 at 03:30:57AM -0500, Glenn Maynard wrote:
> > My suggestion, in the case of Ogg tags, was to add a LANG
> > (renamed to UTF8_LANG) tag, indicating the font language the tags
> > should be displayed in (unless overridden). This was also added
> > to the proposal. Japanese users could tell their viewer to
> > ignore this tag and always use a Japanese font for CJK text.
>
> Hmm. Looks like Unicode language tags are a much better solution.
Unicode language tags are heavily deprecated. Language tagging is
markup, and there is no point pretending you have plain text when you
mark languages.
If you want tagging in plain text, use a standard. As far as I can
tell, the best available standard for such things is XML, which
defines Unicode as its preferred character set.
I see no reason to encode language in Ogg tags. Users should be able
to choose a Unicode fontset that suits their needs for displaying all
languages.
--
Edward Cherlin
edward@xxxxxxxxxxxxxxxx
Does your Web site work?
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/