[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: copypage.pl



Bartosz Fe?ski aka fEnIo:

> I didn't know that it reads .wmlrc file.

It will assume iso 8859-1 if it cannot read the .wmlrc. Maybe that
should be changed.

> Ok... can you tell me which entites will be converted?

There is a list of the named entities in the script (look for
"@entities"), and then there's a compatibility list for the 8859
encodings which lists which of these can be converted back
(iso_8859_2_compat for iso-8859-2). Codepoints that have a 1 will be
converted back.

> May I leave them non-converted?

Currently it will always convert the page, it will convert any
non-entity characters as well, depending on your output encoding. This
is to help the non-iso-8859 translators.

To make it do a binary copy, you will need to change the code in the
copypage subroutine, look for the comments "Decode" and "Encode" and
comment out the code there (the four s/// lines and the if clause). But
please note that this will leave you with iso-8859-1 pages, some pages
may contain characters invalid to iso-8859-2.

-- 
\\//
Peter - http://www.softwolves.pp.se/
  I do not read or respond to mail with HTML attachments.



Reply to: