[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#410012: html2ps: Unnecessary repetition of author metadata



Package: html2ps
Version: 1.0b5-1
Severity: wishlist

Consider the following transcript:

    $ cat [123].xhtml
    <html xmlns="http://www.w3.org/1999/xhtml";><head><title>1</title><meta name="Author" content="Foo" /></head><body /></html>
    <html xmlns="http://www.w3.org/1999/xhtml";><head><title>2</title><meta name="Author" content="Foo" /></head><body /></html>
    <html xmlns="http://www.w3.org/1999/xhtml";><head><title>3</title><meta name="Author" content="Bar" /></head><body /></html>
    $ html2ps [123].xhtml | ps2pdf - tmp.pdf
    $ pdfinfo tmp.pdf | grep ^Author:
    Author:         Foo + Foo + Bar

html2ps generates author metadata by simply concatenating the author
metadata of each source document (interspersed with " + ").  When
multiple documents are by the same author, this is unnecessarily
repetetive.

Consider the case when a large document is split into 80 separate HTML
documents.  Instead of "Author: Trent Buck", the resulting PDF will
contain "Author: Trent Buck + Trent Buck + ... + Trent Buck".

html2ps should accumulate a list of author metadata values as it reads
in each source document, and only add an element to the list if there
is not already an identical element in the list.



Reply to: