Re: PDF rendering/extraction involving indic scripts
Quoting Mahendra Bhandwalkar (2017-01-11 05:13:31)
> is there is any tool in Linux that will read lines from marathi
> language pdf file and store in mysql Table?
I am unaware of any unified tool specifically doing that. I recommend
to use the classic Unix approach of compining multiple single-purpose
tools - one to extract the text string, one to split into lines, and one
to inject into MySQL. And use shell pipes to tie the commands together.
Personally I'd use Perl for the middle part (if needed at all) and
maybe¹ for the last part too.
¹ I dislike SQL so would more likely store as RDF in 4store instead.
* Jonas Smedegaard - idealist & Internet-arkitekt
* Tlf.: +45 40843136 Website: http://dr.jones.dk/
[x] quote me freely [ ] ask before reusing [ ] keep private