[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#567781: Conversion of english pages to Unicode, via HTML entities.



Le Mon, May 16, 2011 at 07:34:59PM +0200, Simon Paillard a écrit :
> On Sun, May 15, 2011 at 10:24:48PM +0900, Charles Plessy wrote:
> > 
> > would it be welcome if I would start to replace iso-8859-1 characters
> > by HTML entities using smart-change for the english language, in order
> > to ease conversion to Unicode ?  As of today, there would be this
> > number of files changed in the following directories.
> [..]
> 
> No, I would even advice the other: remaining entities -> to the coding used by
> each language.

Entities can be removed after the conversion, and I can help for this as well.

I would like the English pages to be converted to Unicode, and offered my help
a couple of monthes ago.  I proposed to first go to the common denominator of
iso-8859-1 and Unicode, which is ASCII plus entities, and then to switch
encoding, and then to remove the entities.

I sent this to http://bugs.debian.org/567781#77 and I thought it was accepted
by the WWW team after discussion on IRC:

http://meetbot.debian.net/debian-www/2011/debian-www.2011-02-15-21.30.html 

The advantage if this proposition is that the work can be distributed over
time and people.

What are the other plans ?  If it is to have a massive overnight transition,
given my timezone, you can probably count me out…

Have a nice day,

-- 
Charles Plessy
Debian Med packaging team,
http://www.debian.org/devel/debian-med
Tsurumi, Kanagawa, Japan



Reply to: