[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: HTML2TXT



nemanuel@student.dei.uc.pt (Nuno Emanuel F. Carvalho) writes:

>  Is there any package/application to convert HTML format to plain text ?
> 
Package: unhtml
Status: install ok installed
Priority: optional
Section: web
Installed-Size: 26
Maintainer: Paul Seelig <pseelig@debian.org>
Version: 2.2-4
Depends: libc6
Description: Removing the markup tags from a HTML file
 This program removes all HTML tags from a HTML file and directs it's output
 to stdout. It can be used as a filter for getting the text content of a HTML
 file without the need of firing up a web browser.

BTW: The version from unstable (compiles fine on stable) now finally
converts HTML entities correctly (hopefully).

                                     Ate logo, P. *8^)
-- 
   ------------ Paul Seelig <pseelig@mail.uni-mainz.de> -------------
   African Music Archive - Institute for Ethnology and Africa Studies
   Johannes Gutenberg-University   -  Forum 6  -  55099 Mainz/Germany
   ------------------- http://ntama.uni-mainz.de --------------------


Reply to: