Rene Engelhard wrote:
Greg Kochanski wrote:Package: openoffice.org-common Version: 2.0.4.dfsg.2-7 Severity: normal If you make an OO Impress document that contains an image with a superposed text box, then export it either to PDF or to HTML, the resulting document does not contain the text. (pdf2text, grep, etc find nothing, nor does Google.)Because upstream claimed in the issue I filed (since you didn't want to do that....) that it works (http://www.openoffice.org/issues/show_bug.cgi?id=77679) so I just tried it myself: That's not true. pdftotext creates a textfile with Foo and Bar in it, as it should with a slide cotaining Foo and Bar and a image (a rectangle)
Well, try http://kochanski.org/gpk/papers/2006/200607google-annotated.pdf which was produced just recently from http://kochanski.org/gpk/papers/2006/200607google-annotated.odp . That has text boxes, and the PDF gives exactly no text: $ pdftotext 200607google-annotated.pdf $ Possibly there is some difference in the settings used in the PDF production? I did "tagged PDF", but otherwise just used the defaults. I'm pretty sure that untagged PDF has the same problem. Possibly you used a geometric figure, rather than a JPEG image as I did?
For HTML, you seem to be right.