Bug#410012: html2ps: Unnecessary repetition of author metadata

To: Debian Bug Tracking System <submit@bugs.debian.org>
Subject: Bug#410012: html2ps: Unnecessary repetition of author metadata
From: Trent Buck <trentbuck@gmail.com>
Date: Wed, 07 Feb 2007 13:32:03 +1100
Message-id: <[🔎] 20070207023203.15680.66682.reportbug@twb.ath.cx>
Reply-to: Trent Buck <trentbuck@gmail.com>, 410012@bugs.debian.org

Package: html2ps
Version: 1.0b5-1
Severity: wishlist

Consider the following transcript:

    $ cat [123].xhtml
    <html xmlns="http://www.w3.org/1999/xhtml";><head><title>1</title><meta name="Author" content="Foo" /></head><body /></html>
    <html xmlns="http://www.w3.org/1999/xhtml";><head><title>2</title><meta name="Author" content="Foo" /></head><body /></html>
    <html xmlns="http://www.w3.org/1999/xhtml";><head><title>3</title><meta name="Author" content="Bar" /></head><body /></html>
    $ html2ps [123].xhtml | ps2pdf - tmp.pdf
    $ pdfinfo tmp.pdf | grep ^Author:
    Author:         Foo + Foo + Bar

html2ps generates author metadata by simply concatenating the author
metadata of each source document (interspersed with " + ").  When
multiple documents are by the same author, this is unnecessarily
repetetive.

Consider the case when a large document is split into 80 separate HTML
documents.  Instead of "Author: Trent Buck", the resulting PDF will
contain "Author: Trent Buck + Trent Buck + ... + Trent Buck".

html2ps should accumulate a list of author metadata values as it reads
in each source document, and only add an element to the list if there
is not already an identical element in the list.

Reply to:

Prev by Date: Bug#410011: html2ps: Honour PAPERSIZE environment variable
Next by Date: Processing of cppopt_0.0.2-6_i386.changes
Previous by thread: Bug#410011: marked as done (html2ps: Honour PAPERSIZE environment variable)
Next by thread: Processing of cppopt_0.0.2-6_i386.changes
Index(es):
- Date
- Thread