[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
How to read mail with &#nnnn
Sometimes I receive mail in
Content-Type: text/html;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
with &#nnnn codes for those characters that can't
be represented in iso-8859-1. The following
example is a piece of Vietnamese text. I know
very little about encodings, but it looks to me
like the text consists of ASCII, "=XX" (quoted
printable), and "&#nnnn;" (decimal Unicode).
<P class=3Dnormal>B=E0i n=E0y đ<FONT face=3D"Times New =
Roman">ă</FONT>ng kh=E1 l=E2u tr=EAn tờ=20
b=E1o bạn. Đỗ th=F4ng Minh thật sự kh=F4ng =
xa lạ g=EC với ch=FAng t=F4ị Anh từ Nhật khi=20
đ<FONT face=3D"Times New Roman">ến Hoa thịnh =
Đốn thường đến nh=E0</FONT> ch=FAng =
t=F4i=20
The mail is sent by MS Outlook Express and can be
read with same.
I've set things up to read and write utf-8, but my
setup (emacs 21.2, gnus 5.9.0) can't read the
above. Is this "&#nnnn;" thing a generally
recognized format?
This text/html section is preceded by a text/plain
section. The difference between them is that the
plain section contains "?" where the html section
contains "&#nnnn;".
Any recommendations will be appreciated.
--
Lam Dang
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/