Re: Full paper-to-bibliography toolchain

To: Mark Voorhies <mvoorhie@yahoo.com>, kanzure@gmail.com
Cc: debian-science@lists.debian.org
Subject: Re: Full paper-to-bibliography toolchain
From: Bryan Bishop <kanzure@gmail.com>
Date: Fri, 3 Apr 2009 17:16:14 -0500
Message-id: <[🔎] 55ad6af70904031516r5dd1c395h9225f64ae24db370@mail.gmail.com>
In-reply-to: <[🔎] 200904031018.12934.mvoorhie@yahoo.com>
References: <55ad6af70903141941s6ae5f129q2bc7554b521a87cf@mail.gmail.com> <[🔎] 200904031018.12934.mvoorhie@yahoo.com>

On Fri, Apr 3, 2009 at 12:18 PM, Mark Voorhies <mvoorhie@yahoo.com> wrote:
> On Saturday 14 March 2009 7:41 pm Bryan Bishop wrote:
>> Hi all,
>>
>> This email comes about because of the recent thread about bibliography
>> management. In particular, I've always had my eye out for what sort of
>> software should (or should not) exist for scientific papers.
>
> cb2bib can extract BibTeX references from a set of pdf files with or without
> user assistance (e.g., in the supervised mode, cb2bib guesses journal,
> volume, title, etc. and provides a window where the user can select
> (pdftotext generated) text and assign it to appropriate fields).

I looked over that a few weeks ago, but I'm not entirely sure about
it. Can you please explain whether or not it performs the following?
The website is not clear.

(1) Given a PDF that essentially consists of a collection of images
(scanned data), will it segment the page, extract text, and figure out
what the title of the paper is and the citation information (etc.), or
will it extract the references?

(2) In unsupervised mode, does it automatically extract references and
guess to which fields the information belongs to?

(3) Does it extract BibTeX encoded in PDF files, or does it extract
the PDF-encoded content (i.e. which may or may not preserve BibTeX
markup)?

Thank you! :-)

- Bryan
http://heybryan.org/
1 512 203 0507

Reply to:

References:
- Re: Full paper-to-bibliography toolchain
  - From: Mark Voorhies <mvoorhie@yahoo.com>

Prev by Date: Re: Full paper-to-bibliography toolchain
Next by Date: Re: debian-science_0.5_i386.changes ACCEPTED
Previous by thread: Re: Full paper-to-bibliography toolchain
Next by thread: Re: Bug#522560: ITP: bist -- chemical drawing tool
Index(es):
- Date
- Thread