[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: xterm utf8controls



> Example: Imagine, you have a shell script that tests whether a submitted
> UTF-8 message is not longer than 20 lines. With correct UTF-8 usage, "wc
> -l" does not have to be modified to process UTF-8 files, because all it
> does is counting LF characters (bytes) in the file.
> 
Unless somebody is using LS or PS rather than LF in their Unicode files :-)

This also presupposes (as UNIX itself does, in general, which I believe
to be a Good Thing) that "plain text" is "preformatted" -- as distinct from
the Microsoft idea of plain text, in which a "line" is really a "paragraph",
and assumes that all "plain text" is fed through some sort of "rendering
engine" for viewing by humans.

- Frank
-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/