[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: multilingual man pages
> David Starner writes:
>
>> What has to be done with less? If LESSCHARSET=utf-8, it should handle
>> utf-8 correct already.
LESSCHARSET is long obsolete and should not be used. Less-358 tests
LANG/LC_CTYPE/LC_ALL for the UTF-8 substring and activates its UTF-8 mode
accordingly. Should be changed to nl_langinfo(CODESET) eventually. Robert
Brady posted here ages ago a patch to fix a less-358 bug with
bold/underlined non-ASCII characters in UTF-8 mode, which I understand
both SuSE 7.3 and Red Hat <whatever> have applied in their latest
versions. Less has not been updated in over a year ...
It's all in the FAQ:
http://www.cl.cam.ac.uk/~mgk25/unicode.html#apps
On Red Hat 7.1, you had to change a line in /etc/man.config to
NROFF /usr/bin/nroff -mandoc
in order to get man pages working in UTF-8 locales. SuSE
7.2 used the mandb package that caches preformatted man pages in a
locale-independent way and thus man couldn't be used there in a mixed
Latin-1/UTF-8 environment.
Groff works fine now in UTF-8 locales and can produce the entire
PostScript standard repertoire in UTF-8 plaintext. Try man groff_char for
a good demo.
Markus
--
Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
Email: mkuhn at acm.org, WWW: <http://www.cl.cam.ac.uk/~mgk25/>
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/