[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#472988: (no subject)

Subject: html2ps: UTF-8 Web pages result in strange characters, recode to latin1 works
Package: html2ps
Version: 1.0b5-5
Severity: normal

I used wget to download a bunch of pages and then ran htmlps on the index.html.  This resulted in a strange characters in the output which made the output unusable.  Use 'find * -name '*.html' -print0 | xargs recode utf8..latin1' fixed things so it now works.  Since UTF-8 is now the default for Debian, this needs to be fixed.

-- System Information:
Debian Release: lenny/sid
  APT prefers testing
  APT policy: (500, 'testing'), (500, 'stable')
Architecture: i386 (i686)

Kernel: Linux 2.6.24-1-686 (SMP w/1 CPU core)
Locale: LANG=en_CA.UTF-8, LC_CTYPE=en_CA.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages html2ps depends on:
ii  libhtml-parser-perl    3.56-1            A collection of modules that parse
ii  libpaper-utils         1.1.23            library for handling paper charact
ii  libwww-perl            5.808-1           WWW client/server library for Perl
ii  perl                   5.8.8-12          Larry Wall's Practical Extraction 
ii  perlmagick             7: Perl interface to the libMagick gr

Versions of packages html2ps recommends:
ii  gs-gpl                   8.56.dfsg.1-1.1 The GPL Ghostscript PostScript int

-- no debconf information

Reply to: