[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#440420: [PROPOSAL] Manual page encoding



On Tue, Sep 04, 2007 at 01:35:48PM +0100, Colin Watson wrote:
> On Tue, Sep 04, 2007 at 02:04:32PM +0200, Jens Seidel wrote:
> > Is it save to use UTF-8 characters if a very similar character exists in
> > ASCII or can be expressed using groff macros? Think about the many
> > dashes which exist in typography. Is it OK to use a UTF-8 hyphen sign
> > instead of \(hy (same for en-dash, em-dash, ...) especially as the
> > ordinary minus "-" is very similar in the output?
> 
> > Of course there exist transliterations of all these characters I'm
> > currently talking about but it would probably make the live easier to
> > restrict to ASCII if possible, right?
> 
> I do appreciate that there are a few gotchas here. I think it is unduly
 
> So I'd really rather just support plain
> UTF-8 input for alphanumerics, which I think will actually get used.
> 
> Do you think we will need explicit language in policy for this? For the

Ah, no. But it should be documented somewhere and I wondered about this
after reading again your proposed patch (and the further info).

> This seems like a
> reasonable thing to document after man-db 2.5.0, and would cover things
> like UTF-8 hyphen characters.

Right. Without documentation every maintainer could now start fine tuning
man pages using all the stuff provided by Unicode ...
 
> In general, I think it's worthwhile for policy to make comments on
> encoding for purposes of interoperability and standardisation, but I'd
> be inclined to draw the line at filling it up with instructions on how
> to use groff correctly. Does this sound reasonable?

Yes, it does.

Thanks,
Jens



Reply to: