Am Mittwoch 31 März 2010 schrieb Michael Schuerig: > On Wednesday 31 March 2010, Carsten Pfeiffer wrote: > > Am Dienstag, 30. März 2010 schrieb Michael Schuerig: > > > Now, I'm wondering, is this something I ought to report as a bug > > > against strigi or is the problem with Nepomuk for not logging > > > abnormal termination of child processes? Or is it pdftotext for > > > apparently producing invalid UTF-8 from a PDF (iconv doesn't > > > complain about it, though)? > > > > All of the above ;-) > > > > I'd say that > > - nepopmuk or strigi should notice that it crashed on a file and put > > it into some blacklist until its mtime changes > > - strigi should keep on indexing the other files instead of > > restarting - pdftotext as the originator of the file ought to be > > fixed > > Done. > > https://sourceforge.net/tracker/?func=detail&aid=2979889&group_id=17100 > 0&atid=856302 https://bugs.kde.org/show_bug.cgi?id=232814 > > I'm not completely certain that pdftotext really does anything wrong. See also these two of my Nepomuk related bug reports: https://bugs.kde.org/show_bug.cgi?id=232395 https://bugs.kde.org/show_bug.cgi?id=232398 It seems you are also stumbling over an UTF-8 issue, but it seems a different one than me. But I am not sure. -- Martin 'Helios' Steigerwald - http://www.Lichtvoll.de GPG: 03B0 0D6C 0040 0710 4AFA B82F 991B EAAC A599 84C7
Description: This is a digitally signed message part.