[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: PDF rendering/extraction involving indic scripts

Hi Mahendra,

Quoting Mahendra Bhandwalkar (2017-01-11 05:13:31)
> is there is any tool in Linux that will read lines from marathi 
> language pdf file and store in mysql Table?

I am unaware of any unified tool specifically doing that.  I recommend 
to use the classic Unix approach of compining multiple single-purpose 
tools - one to extract the text string, one to split into lines, and one 
to inject into MySQL.  And use shell pipes to tie the commands together.

Personally I'd use Perl for the middle part (if needed at all) and 
maybe¹ for the last part too.

 - Jonas

¹ I dislike SQL so would more likely store as RDF in 4store instead.

 * Jonas Smedegaard - idealist & Internet-arkitekt
 * Tlf.: +45 40843136  Website: http://dr.jones.dk/

 [x] quote me freely  [ ] ask before reusing  [ ] keep private

Reply to: