Bug#309264: O: unhtml -- Remove the markup tags from an HTML file
On 20.05.2005, at 12:49, Jeroen van Wolffelaar wrote:
What is the added value of this package to Debian? Also, why can't
script/program be included in some other package that does
html-processing? Or, what about lynx -dump -stdin with some extra
options to drop the footnotes on links etc? It'll also reformat for
certain textwidths etc, making it IMHO much more useful.
At least it strips only HTML tags, not all XML tags it encounters in
a stream. And it does not strip the contents within <script /> tags.
If you count this as an additional value. This program seems to be
very lightweight, without a interpreter overhead. If you only need
the HTML tags stripped without a pretty formatting like the output of
a Lynx dump this would be for you.