[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: wget asp pages recursively?



On Sat, Aug 31, 2002 at 05:19:19AM -0400, Michael D. Crawford wrote:
> I'd like to download a sequence of pages which are produced by someone's 
> asp application so that I may read them while I am offline.
> 
> Is there a parameter to wget that will allow me to do this?
> 
> The URL for the first page is something like
> 
> http://www.something.com/junk.asp&thepageIwant=1
> 
> I can use the "--html-extension" to cause the page I download to have a 
> .html extension, so my web browsers know what to do with the file.  
> However, I don't seem to be able to get wget to follow the link within that 
> page to the next page, because the link is given as a parameter to an asp 
> application.  That is, there is HTML like this:
> 
> <p>Click the following to go to the
> <a href="http://www.something.com/junk.asp&thepageIwant=2";>next 
> page</a>.</p>
> 
> What I need is for wget to understand that stuff following an "&" in a URL 
> indicates that it's a distinctly different page, and it should go 
> recursively retrieve it.  The --recursive option doesn't seem to help.
> 
> Any help you can give me is appreciated.

I use a sequence I learned off the Linux Journal site:
    wget -m -L -t 5 -w 5 http://somplace.com/some.asp&page=1

When I've done so and encountered asp or cgi pages, it has done just
fine -- the -m tag does recursion, infinite depth, and some url
extensions.

--Matthew



Reply to: