[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: manpage character cleanup for UTF-8 compatibility



On Sun, Apr 06, 2003 at 09:03:31PM +0100, Colin Watson wrote:
> On Tue, Mar 25, 2003 at 04:01:51PM -0800, Vineet Kumar wrote:
> > Using a UTF-8 locale, I've been finding many manpages using incorrect
> > characters.  Groff converts many of these characters to reasonable
> > characters in ASCII locales, but some things break in UTF-8 locales.
> 
> One other thought has occurred to me while working on fixing certain
> parts of man-db's locale support. Sooner or later, when groff 2 is
> released (but not beforehand!), we're going to have to move towards
> having all man pages encoded in UTF-8. For most languages this probably
> isn't too bad: you just use de_DE.UTF-8 rather than de, or whatever
> (although I'm not sure how that'd work for languages with multiple
> regional variants). It's going to be a royal pain for English, though,
> because currently we just put things directly in /usr/share/man, meaning
> the C locale, and there's no C.UTF-8, probably for good reasons.
> en_US.UTF-8 would be a poor choice because we also need en_GB.UTF-8 and
> so on.

AIUI, only ASCII is valid in the C locale anyway. Setting the top bit
is an error.

-- 
  .''`.  ** Debian GNU/Linux ** | Andrew Suffield
 : :' :  http://www.debian.org/ | Dept. of Computing,
 `. `'                          | Imperial College,
   `-             -><-          | London, UK

Attachment: pgpqgIEdjbGax.pgp
Description: PGP signature


Reply to: