[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Acroread: accelerating the search through a PDF



Hi there.
I believe that I have some sollutions to your problems. First of all,
you need to see whether or not your documentts are in some kind of
structured format. if they are, say DocBookXML, or something similar,
you may  be able to find a quick solution to the searching problem. if
the documents are structured, you can probably parce them by entety
type. of course, this depends on how well they are marked up. Like
I've stated earlier, they key item here is to generate rapidly
searchable indexes that can be quaried against. I'm assuming that
since you deal with highly technical data, it is more or less in a
structured form. You could even generate SQL statements and possibly
use SQLLite if you don't want a full DB as overhead. Anyways, I'm more
than willing to help in any way with this project of yours. Let me
know what you think.
--Erik

On 5/29/10, Ron Johnson <ron.l.johnson@cox.net> wrote:
> On 05/29/2010 02:34 PM, Merciadri Luca wrote:
>> Ron Johnson wrote:
> [snip]
>>>
>>> Have you tried other PDF readers?  Searched for Linux-based PDF indexers?
>> As I said in another topic, I am totally okay for free stuff (if it was
>> not the case, I would not be using Debian: thinking unfree but using
>> free is cowardice), but the fact is that I have not found a reader whose
>> range of compatibility with the PDF standard is as high as in acroread.
>> Acroread is slow, boring, sometimes buggy, but I need to use it as long
>> as I do not find a PDF reader which has such a big compatibility range.
>
> Nothing says that you must only use one reader at a time. ;)
>
> If poppler, for example, doesn't render *exactly* but searches
> /rapidly/, then you could search using poppler and "read" using
> Acroread.
>
> Alternatively, install poppler-utils for it's pdftohtml.  Certainly
> it won't be perfect, but a browser might be faster than Acroread.
>
> --
> Dissent is patriotic, remember?
>
>
> --
> To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org
> with a subject of "unsubscribe". Trouble? Contact
> listmaster@lists.debian.org
> Archive: [🔎] 4C01AFDD.7050004@cox.net">http://lists.debian.org/[🔎] 4C01AFDD.7050004@cox.net
>
>


Reply to: