[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: lynx and google.com



On Thu 6 May 2004 13:03, William Ballard wrote:
> On Thu, May 06, 2004 at 12:50:20PM -0400, Rick Pasotto wrote:
> > On Thu, May 06, 2004 at 09:23:35AM -0700, William Ballard wrote:
> > > Has anybody tried 'lynx google.com' lately?
> > >
> > > You get HTTP/1.0 400 Bad Request.  It only started happening
> > > about 2 weeks ago.
> >
> > 'links google.com' works. The problem may be with lynx.
>
> Yeah, but lynx not supporting tables is sometimes useful to get a
> site to serve up a different version of the site, if I'm going to
> dump it and parse it.  lynx worked up until last week.  Do you think
> it's using HTTP/1.0 and google stopped supporting it?  Haven't
> bothered tracing it.
>
> Incidentally, I use this script to get around google's text ads:

Maybe they caught on to your little trick and put an end to it? :) Or at 
least tried to, since you seem to be spoofing the UA.

>
> wget -O- -U 'Mozilla/4.0 (compatible; Lynx 2.7.2)' \
>  "http://www.google.com/search?q=$*"; 2>/dev/null | \
>   sed -e 's/Froogle//g' -e 's/<img src=[^>]*>//g' \
>   -e 's/.*\(<div>.*\)/\1/' \
>   -e 's/href=\//href=http:\/\/google.com\//g' \
                       ^
Does this line actually work? To me it looks like you're missing an 
escape before the second '/' before the second 'href'.

>   -e 's/\([^>]\)<br>/\1/g' > ~/search.html
> /x/b/moz/cur/firefox -remote "openURL(/home/tom/search.html)"
>
> I used to use 'lynx -dump' instead of wget and then I didn't need the
> <div> sed clause, because google wouldn't serve up the text ads if
> you didn't have tables.
>
> Try my little script, keep a small terminal open beside Moz and use
> this script for your google searches.  It makes the results take up
> the whole screen again and strips out the text ads!

-- 
David P James
Ottawa, Ontario
http://david.jamesnet.ca
ICQ: #42891899, Jabber: davidpjames@jabber.org

If you've lost something, you had to lose it, not loose it.

Attachment: pgp0tFBs9swRb.pgp
Description: signature


Reply to: