[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: lynx and google.com



On Thu, May 06, 2004 at 12:50:20PM -0400, Rick Pasotto wrote:
> On Thu, May 06, 2004 at 09:23:35AM -0700, William Ballard wrote:
> > Has anybody tried 'lynx google.com' lately?
> > 
> > You get HTTP/1.0 400 Bad Request.  It only started happening about 2 
> > weeks ago.
> 
> 'links google.com' works. The problem may be with lynx.

Yeah, but lynx not supporting tables is sometimes useful to get a site 
to serve up a different version of the site, if I'm going to dump it and 
parse it.  lynx worked up until last week.  Do you think it's using 
HTTP/1.0 and google stopped supporting it?  Haven't bothered tracing it.

Incidentally, I use this script to get around google's text ads:

wget -O- -U 'Mozilla/4.0 (compatible; Lynx 2.7.2)' \
 "http://www.google.com/search?q=$*"; 2>/dev/null | \
  sed -e 's/Froogle//g' -e 's/<img src=[^>]*>//g' \
  -e 's/.*\(<div>.*\)/\1/' \
  -e 's/href=\//href=http:\/\/google.com\//g' \
  -e 's/\([^>]\)<br>/\1/g' > ~/search.html
/x/b/moz/cur/firefox -remote "openURL(/home/tom/search.html)"

I used to use 'lynx -dump' instead of wget and then I didn't need the 
<div> sed clause, because google wouldn't serve up the text ads if you 
didn't have tables.

Try my little script, keep a small terminal open beside Moz and use this 
script for your google searches.  It makes the results take up the whole 
screen again and strips out the text ads!



Reply to: