[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#190637: marked as done (plucker: Error parsing NYTimes pages)



Your message dated Thu, 1 Jun 2006 18:20:52 +0200
with message-id <20060601162052.GA14809@acer.maison.bogus>
and subject line plucker: Error parsing NYTimes pages
has caused the attached Bug report to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what I am
talking about this indicates a serious mail system misconfiguration
somewhere.  Please contact me immediately.)

Debian bug tracking system administrator
(administrator, Debian Bugs database)

--- Begin Message ---
Package: plucker
Version: 1.2-4
Severity: normal

Plucker is unable to retreive content from The New York Times website.


    [karsten@superego:.plucker]$ plucker-build --verbosity=2 -H nytime.html > NYTime
    sDB
    Error:  Runtime error parsing document http://www.nytimes.com/pages/national/index.html: unexpected char in declaration: '<'


-- System Information
Debian Release: testing/unstable
Kernel Version: Linux superego 2.4.20-686 #1 Mon Jan 13 22:22:30 EST 2003 i686 unknown unknown GNU/Linux

Versions of the packages plucker depends on:
ii  python2.1      2.1.3-4        An interactive object-oriented scripting lan


--- End Message ---
--- Begin Message ---
> Package: plucker
> Version: 1.2-4
> Severity: normal
> 
> Plucker is unable to retreive content from The New York Times website.

I just tried with plucker 1.8-11 and could not reproduce this bug. So
either the New York Times website is "corrected" or the bug has been
corrected in plucker.

I now have:

$ plucker-build --maxdepth=1 -c http://www.nytimes.com/pages/national/index.html

Pluckerdir is '/home/rousseau/.plucker'...
---- 0 collected, 1 to do ----
Processing http://www.nytimes.com/pages/national/index.html...
  Retrieved ok.
  Parsed ok; 32 images.
---- 1 collected, 32 to do ----
Processing http://graphics8.nytimes.com/ads/timesse.....mesSelect76.gif...
  Retrieved ok.
  Parsed ok.
---- 2 collected, 31 to do ----
Processing http://graphics8.nytimes.com/ads/etrade/88X31_logo.gif...
  Retrieved ok.
  Parsed ok.
---- 3 collected, 30 to do ----
Processing http://graphics8.nytimes.com/images/misc/nytlogo153x23.gif...
  Retrieved ok.
  Parsed ok.
---- 4 collected, 29 to do ----
Processing http://graphics8.nytimes.com/images/2006...../01glass190.jpg...
  Retrieved ok.
  Parsed ok.
---- 5 collected, 28 to do ----
Processing http://graphics8.nytimes.com/images/2006.....1truancy.75.jpg...
  Retrieved ok.
  Parsed ok.
---- 6 collected, 27 to do ----
Processing http://graphics8.nytimes.com/images/2006.....s/01pets.75.jpg...
  Retrieved ok.
  Parsed ok.
---- 7 collected, 26 to do ----
Processing http://graphics8.nytimes.com/images/2006.....s/SAULNY190.jpg...
  Retrieved ok.
  Parsed ok.
---- 8 collected, 25 to do ----
Processing http://graphics8.nytimes.com/images/2006.....f-video.190.jpg...
  Retrieved ok.
  Parsed ok.
---- 9 collected, 24 to do ----
Processing http://graphics8.nytimes.com/adx/images/....._120x90_blk.gif...
  Retrieved ok.
  Parsed ok.
---- 10 collected, 23 to do ----
Processing http://graphics8.nytimes.com/adx/images/.....new_336x280.gif...
  Retrieved ok.
  Parsed ok.
---- 11 collected, 22 to do ----
Processing http://graphics8.nytimes.com/ads/amex/ch.....n_88x31_20k.gif...
  Retrieved ok.
  Parsed ok.
---- 12 collected, 21 to do ----
Processing http://graphics8.nytimes.com/marketing/2.....mages/mm_1b.gif...
  Retrieved ok.
  Parsed ok.
---- 13 collected, 20 to do ----
Processing http://graphics8.nytimes.com/marketing/2.....mages/mm_3b.gif...
  Retrieved ok.
  Parsed ok.
---- 14 collected, 19 to do ----
Processing http://graphics8.nytimes.com/marketing/2.....utomobiles2.gif...
  Retrieved ok.
  Parsed ok.
---- 15 collected, 18 to do ----
Processing http://graphics8.nytimes.com/ads/images/misc/spacer.gif...
  Retrieved ok.
  Parsed ok.
---- 16 collected, 17 to do ----
Processing http://graphics8.nytimes.com/ads/images/misc/spacer.gif...
  Retrieved ok.
  Parsed ok.
---- 17 collected, 16 to do ----
Processing http://graphics8.nytimes.com/ads/images/misc/spacer.gif...
  Already retrieved and parsed.
---- 17 collected, 15 to do ----
Processing http://graphics8.nytimes.com/ads/images/misc/spacer.gif...
  Retrieved ok.
  Parsed ok.
---- 18 collected, 14 to do ----
Processing http://graphics8.nytimes.com/marketing/2.....ages/bullet.gif...
  Retrieved ok.
  Parsed ok.
---- 19 collected, 13 to do ----
Processing http://graphics8.nytimes.com/marketing/2.....ages/bullet.gif...
  Already retrieved and parsed.
---- 19 collected, 12 to do ----
Processing http://graphics8.nytimes.com/marketing/2.....ages/bullet.gif...
  Already retrieved and parsed.
---- 19 collected, 11 to do ----
Processing http://graphics8.nytimes.com/marketing/2.....mages/mm_15.gif...
  Retrieved ok.
  Parsed ok.
---- 20 collected, 10 to do ----
Processing http://graphics8.nytimes.com/marketing/2.....mages/mm_17.gif...
  Retrieved ok.
  Parsed ok.
---- 21 collected, 9 to do ----
Processing http://media.nyadmcncserve-05y06a.com/im.....1017010_1001647...
  Retrieved ok.
  Moved from 'http://media.nyadmcncserve-05y06a.com/image?spacedesc=1032311_1014155_1x1_1017010_1001647' to 'http://media.nyadmcncserve-05y06a.com/ta/145/502/ClickHere.gif'.
  Parsed ok.
---- 22 collected, 8 to do ----
Processing http://graphics8.nytimes.com/ads/timesse.....elect_86x40.gif...
  Retrieved ok.
  Parsed ok.
---- 23 collected, 7 to do ----
Processing http://graphics8.nytimes.com/ads/house/autos_86x401.gif...
  Retrieved ok.
  Parsed ok.
---- 24 collected, 6 to do ----
Processing http://graphics8.nytimes.com/ads/house/n.....e_86x40_blk.gif...
  Retrieved ok.
  Parsed ok.
---- 25 collected, 5 to do ----
Processing http://graphics8.nytimes.com/ads/house/jobs_86x401.gif...
  Retrieved ok.
  Parsed ok.
---- 26 collected, 4 to do ----
Processing http://graphics8.nytimes.com/ads/house/8.....rketplace_2.gif...
  Retrieved ok.
  Parsed ok.
---- 27 collected, 3 to do ----
Processing http://www.nytimes.com/ads/blank.gif...
  Retrieved ok.
  Parsed ok.
---- 28 collected, 2 to do ----
Processing http://www.nytimes.com/adx/bin/clientsid.....Q263GT-G3OQ3BOG...
  Retrieved ok.
  Parsed ok.
---- 29 collected, 1 to do ----
Processing http://up.nytimes.com/?d=0//&t=3&s=0&ui=.....index%2ehtml%3f...
  Retrieved ok.
Error:  Runtime error parsing document http://up.nytimes.com/?d=0//&t=3&s=0&ui=&r=&u=www%2enytimes%2ecom%2fpages%2fnational%2findex%2ehtml%3f: Error while opening image http://up.nytimes.com/?d=0//&t=3&s=0&ui=&r=&u=www%2enytimes%2ecom%2fpages%2fnational%2findex%2ehtml%3f with netpbm
  Parsing failed.
---- all 29 pages retrieved and parsed ----

Writing out collected data...
Writing to cache dir /home/rousseau/.plucker/cache
Converting http://graphics8.nytimes.com/ads/amex/ch.....ight=31&depth=1...
Converting http://graphics8.nytimes.com/adx/images/.....ight=90&depth=1...
Converting http://graphics8.nytimes.com/ads/house/n.....ight=40&depth=1...
Converting http://graphics8.nytimes.com/ads/house/8.....ight=40&depth=1...
Converting http://graphics8.nytimes.com/ads/house/j.....ight=40&depth=1...
Converting http://graphics8.nytimes.com/images/misc.....ight=23&depth=1...
Converting http://graphics8.nytimes.com/images/2006.....ght=126&depth=1...
Converting http://graphics8.nytimes.com/adx/images/.....ght=280&depth=1...
Converting http://graphics8.nytimes.com/ads/house/a.....ight=40&depth=1...
Converting http://graphics8.nytimes.com/images/2006.....ght=126&depth=1...
Converting http://media.nyadmcncserve-05y06a.com/ta.....eight=1&depth=1...
Converting http://graphics8.nytimes.com/marketing/2.....ight=15&depth=1...
Converting http://graphics8.nytimes.com/marketing/2.....ight=21&depth=1...
Converting http://www.nytimes.com/adx/bin/clientsid.....Q263GT-G3OQ3BOG...
Converting http://graphics8.nytimes.com/marketing/2.....ight=21&depth=1...
Converting http://graphics8.nytimes.com/ads/images/.....eight=1&depth=1...
Converting http://www.nytimes.com/pages/national/index.html...
Converting http://graphics8.nytimes.com/ads/timesse.....ight=10&depth=1...
Converting http://graphics8.nytimes.com/images/2006.....ght=269&depth=1...
Converting http://graphics8.nytimes.com/ads/timesse.....ight=40&depth=1...
Converting http://graphics8.nytimes.com/images/2006.....ight=75&depth=1...
Converting http://graphics8.nytimes.com/images/2006.....ight=75&depth=1...
Converting http://www.nytimes.com/ads/blank.gif?wid.....eight=1&depth=1...
Converting http://graphics8.nytimes.com/ads/etrade/.....ight=31&depth=1...
Converting http://graphics8.nytimes.com/ads/images/.....ight=94&depth=1...
Converting http://graphics8.nytimes.com/marketing/2.....ight=21&depth=1...
Converting http://graphics8.nytimes.com/marketing/2.....ight=21&depth=1...
Converting http://graphics8.nytimes.com/marketing/2.....ight=11&depth=1...
Converting http://graphics8.nytimes.com/ads/images/.....eight=5&depth=1...
Wrote 1 <= plucker:/~special~/index
Wrote 2 <= http://www.nytimes.com/pages/national/index.html
Wrote 3 <= plucker:/~special~/pluckerlinks
Wrote 5 <= plucker:/~special~/metadata
Wrote 11 <= http://graphics8.nytimes.com/ads/amex/ch.....ight=31&depth=1
Wrote 12 <= http://graphics8.nytimes.com/ads/etrade/.....ight=31&depth=1
Wrote 13 <= http://graphics8.nytimes.com/ads/house/8.....ight=40&depth=1
Wrote 14 <= http://graphics8.nytimes.com/ads/house/a.....ight=40&depth=1
Wrote 15 <= http://graphics8.nytimes.com/ads/house/j.....ight=40&depth=1
Wrote 16 <= http://graphics8.nytimes.com/ads/house/n.....ight=40&depth=1
Wrote 17 <= http://graphics8.nytimes.com/ads/images/.....ight=94&depth=1
Wrote 18 <= http://graphics8.nytimes.com/ads/images/.....eight=1&depth=1
Wrote 19 <= http://graphics8.nytimes.com/ads/images/.....eight=5&depth=1
Wrote 20 <= http://graphics8.nytimes.com/ads/timesse.....ight=10&depth=1
Wrote 21 <= http://graphics8.nytimes.com/ads/timesse.....ight=40&depth=1
Wrote 22 <= http://graphics8.nytimes.com/adx/images/.....ght=280&depth=1
Wrote 23 <= http://graphics8.nytimes.com/adx/images/.....ight=90&depth=1
Wrote 24 <= http://graphics8.nytimes.com/images/2006.....ght=126&depth=1
Wrote 25 <= http://graphics8.nytimes.com/images/2006.....ght=126&depth=1
Wrote 26 <= http://graphics8.nytimes.com/images/2006.....ght=269&depth=1
Wrote 27 <= http://graphics8.nytimes.com/images/2006.....ight=75&depth=1
Wrote 28 <= http://graphics8.nytimes.com/images/2006.....ight=75&depth=1
Wrote 29 <= http://graphics8.nytimes.com/images/misc.....ight=23&depth=1
Wrote 30 <= http://graphics8.nytimes.com/marketing/2.....ight=11&depth=1
Wrote 31 <= http://graphics8.nytimes.com/marketing/2.....ight=15&depth=1
Wrote 32 <= http://graphics8.nytimes.com/marketing/2.....ight=21&depth=1
Wrote 33 <= http://graphics8.nytimes.com/marketing/2.....ight=21&depth=1
Wrote 34 <= http://graphics8.nytimes.com/marketing/2.....ight=21&depth=1
Wrote 35 <= http://graphics8.nytimes.com/marketing/2.....ight=21&depth=1
Wrote 36 <= http://media.nyadmcncserve-05y06a.com/ta.....eight=1&depth=1
Wrote 37 <= http://www.nytimes.com/ads/blank.gif?wid.....eight=1&depth=1
Wrote 38 <= http://www.nytimes.com/adx/bin/clientsid.....Q263GT-G3OQ3BOG
Wrote 171 <= plucker:/~special~/links1
Done!

-- 
 Dr. Ludovic Rousseau                        Ludovic.Rousseau@free.fr
 -- Normaliser Unix c'est comme pasteuriser le camembert, L.R. --

--- End Message ---

Reply to: