Quoting Jonas Smedegaard (2017-01-30 22:35:13) > Quoting Jonas Smedegaard (2017-01-30 18:28:11) >> Quoting Shrinivasan T (2017-01-30 17:52:29) >>> I am looking for a solution to convert HTML to PDF with custom Tamil >>> language fonts and custom paper size. > [...] >> You might also try pandoc (unlikely to work without fine-tuning, but >> if it works then you can do powerful things like scraping a web page >> and apply a LaTeX template to produce professional-grade output - >> like I did with http://source.jones.dk/eut.git/ to produce >> http://eut.biks.dk/ - an 60+ pages research study edited on a wiki >> and finalized as PDF books optimized for print and "ebook-style" use. > > This - using XeLaTeX and XeTeX internally - seems to work: > > pandoc --standalone --latex-engine xelatex -V lang='' -V papersize=a5 -V mainfont="Uni Ila.Sundaram-10" -V margin-left=5mm -V margin-right=5mm -V margin-top=10mm -V marginbottom=15mm --output manaosai-xetex-ebook.pdf https://ia800203.us.archive.org/31/items/ManaosaiShortStories/Manaosai-short-stories.html Above is my favorite, and I would appreciate feedback from others on the quality of Tamil font rendering (e.g. kerning and whether my suspicion about unsupported characters is correct). This seems to work too: cutycapt --user-style-string="body{font-family:'Uni Ila.Sundaram-10'}" --url=https://ia800203.us.archive.org/31/items/ManaosaiShortStories/Manaosai-short-stories --out=cutycapt.pdf ...but like wkhtmltopdf is limited - e.g. I cannot set page size. python-pisa is currently broken: http://bugs.debian.org/852363 - and likely won't handle Tamil anyway: http://bugs.debian.org/610390 - Jonas -- * Jonas Smedegaard - idealist & Internet-arkitekt * Tlf.: +45 40843136 Website: http://dr.jones.dk/ [x] quote me freely [ ] ask before reusing [ ] keep private
Attachment:
signature.asc
Description: signature