[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Displaying malformed UTF-8 sequences in an editor
Bram Moolenaar wrote on 2000-07-24 10:34 UTC:
> Try out the new Vim version 6.0c. It keeps malformed sequences. Displaying
> them isn't working well though. I could use some suggestions on how to do
> that. Perhaps it's best to display each malformed byte with a special
> character?
I would expect an editor to treat bytes of malformed UTF-8 sequences
just like an ASCII editor treats upper half ISO 8859 characters. One
very common convention is to represent them as a backslash followed by
three octal digits as in \377. "Less" writes <9C> in inverse, which is
probably nicer. Something like that (hex is typically far more useful
than octal).
Markus
--
Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
Email: mkuhn at acm.org, WWW: <http://www.cl.cam.ac.uk/~mgk25/>
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/