need to collect a list of files for HTML docs
Hi,
One of my packages (libboost-dev) has a lot of documentation as HTML
files, complete with inlined graphics and the like. However, these
files are all mixed in with the source code.
I need a maintainable way to get a list of these files.
There must be some tool that will parse a set of html files
(recursively for all relative links) and give me back a list of files
linked to by <a href=..> and <img> and whatnot. In short: I need
a list of all the files that make up the documentation, starting
from "index.html".
Suggestions?
Thanks,
-S
--
by Rocket to the Moon,
by Airplane to the Rocket,
by Taxi to the Airport,
by Frontdoor to the Taxi,
by throwing back the blanket and laying down the legs ...
- They Might Be Giants
--
To UNSUBSCRIBE, email to debian-devel-request@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Reply to: