[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

How to read mail with &#nnnn



Sometimes I receive mail in 

  Content-Type: text/html;
	charset="iso-8859-1"
  Content-Transfer-Encoding: quoted-printable

with &#nnnn codes for those characters that can't
be represented in iso-8859-1.  The following
example is a piece of Vietnamese text.  I know
very little about encodings, but it looks to me
like the text consists of ASCII, "=XX" (quoted
printable), and "&#nnnn;" (decimal Unicode).

  <P class=3Dnormal>B=E0i n=E0y &#273;<FONT face=3D"Times New =
  Roman">&#259;</FONT>ng kh=E1 l=E2u tr=EAn t&#7901;=20
  b=E1o b&#7841;n. &#272;&#7895; th=F4ng Minh th&#7853;t s&#7921; kh=F4ng =
  xa l&#7841; g=EC v&#7899;i ch=FAng t=F4ị Anh t&#7915; Nh&#7853;t khi=20
  &#273;<FONT face=3D"Times New Roman">&#7871;n Hoa th&#7883;nh =
  &#272;&#7889;n th&#432;&#7901;ng &#273;&#7871;n nh=E0</FONT> ch=FAng =
  t=F4i=20

The mail is sent by MS Outlook Express and can be
read with same.

I've set things up to read and write utf-8, but my
setup (emacs 21.2, gnus 5.9.0) can't read the
above.  Is this "&#nnnn;" thing a generally
recognized format?

This text/html section is preceded by a text/plain
section.  The difference between them is that the
plain section contains "?" where the html section
contains "&#nnnn;".


Any recommendations will be appreciated.

-- 
Lam Dang
--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/