[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Automatic encoding guessing
"H. Peter Anvin" wrote:
>
> Followup to: <20011023131533.A2406@xxxxxxxxxxxxxxxxxxxxxxxxxx>
> By author: David Starner <dstarner98@xxxxxxxxxxxxx>
> In newsgroup: linux.utf8
> >
> > On Tue, Oct 23, 2001 at 11:05:45AM -0700, H. Peter Anvin wrote:
> > > > - ISO 8859 files should be free of C1 and most C0 codes (except
> > > > for the usual LF/TAB).
> > >
> > > I have also had Emacs 20 garble data because of the above assumption
> > > :(
> >
> > What were you editing? Many C0 codes (except CR/LF/TAB/FF/BS/VT) and C1
> > codes are basically binary garbage; an ISO-8859-* document that
> > contains them is really more some type of rich text or binary format.
> >
>
> Files with control codes as markup. More common than you seem to
> think.
For something like that, I could certainly accept that Emacs could not
guess the encoding. But I would rather see it ask about unusual cases
instead of guessing wrong.
--
D. Dale Gulledge, Sr. Programmer,
dsplat@xxxxxxxxxxxxxxxx
C, C++, Perl, Unix (AIX, Linux), Oracle, Java,
Internationalization (i18n), Awk.
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/