Hi Charles, From: Charles Plessy Subject: Encoding a manpage in unicode ? Date: Sat, 31 Mar 2007 12:35:24 +0900 > I wanted to encode some of the manpages I wrote in Unicode, > but although xsltproc converts them fine, nroff is not able to pipe > something correct to less, and french accents or chinese characters > are not correctly displayed. Is there something to do, or is nroff > simply not unicode compliant? Few months ago I reported some wrongly-encoded manpages and created patches to fix them. Please read my mail[1] to debian-i18m that reported the issue and see bug reports described in the mail, such as Bug#391061. Also, if you can read Japanese, see a discussion[2] on Japanese manpages in Debian-JP's mailing list. Briefly, since ancient manpages do not have encoding information in itself, each parent directory of translated manpages has a default input manpage encoding, which is defined in src/encodings.c (i.e. hard-coded) of the man-db program (e.g. Japanese manpages under /usr/share/man/ja are all assumed to be EUC-JP-encoded files). So, althogh UTF-8 is now popular, it cannot be used as a manpage encoding for languages that have used different encodings, like Japanese that have used EUC-JP and European languages that have used latin-1. This issue is very troublesome and I hope it will be squashed in lenny... [1] http://lists.debian.org/debian-i18n/2006/11/msg00119.html [2] http://lists.debian.or.jp/debian-users/200610/msg00008.html Many thanks, -nori
Attachment:
pgpG1Wpr9_sX8.pgp
Description: PGP signature