[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: xterm utf8controls
> Example: Imagine, you have a shell script that tests whether a submitted
> UTF-8 message is not longer than 20 lines. With correct UTF-8 usage, "wc
> -l" does not have to be modified to process UTF-8 files, because all it
> does is counting LF characters (bytes) in the file.
>
Unless somebody is using LS or PS rather than LF in their Unicode files :-)
This also presupposes (as UNIX itself does, in general, which I believe
to be a Good Thing) that "plain text" is "preformatted" -- as distinct from
the Microsoft idea of plain text, in which a "line" is really a "paragraph",
and assumes that all "plain text" is fed through some sort of "rendering
engine" for viewing by humans.
- Frank
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/