[debian-www, please CC the bugreport on replies; thanks] -=| Andreas B. Mundt, Wed, Feb 10, 2010 at 06:16:03PM +0100 |=- > Package: libwww-perl > Version: 5.834-1 > Severity: important > > we use GET to download a wikipage and further process the data to > prepare the manual of Debian Edu. The command: > GET "http://wiki.debian.org/DebianEdu/Documentation/Lenny/AllInOne?action=raw" > works fine in Lenny, but stopped working in squeeze where "You are not > allowed to access this!" is returned. If you remove "?action=raw" from > the URL anything is fine. Is this inteded and we have to provide a > header? This is very strange. There is a slight difference in the User-Agent request header (as reported by tcpdump -s 1500 -X): On Lenny (works) ================ User-Agent: lwp-request/0.810 On Sid (breaks) =============== User-Agent: lwp-request/5.834 libwww-perl/5.834 So it seems there is some filter on wiki.debian.org that refuses some user agents. HTTP/1.1 (section 14.43) explicitly allows multiple entries in this header: The field can contain multiple product tokens (section 3.8) and comments identifying the agent and any subproducts which form a significant part of the user agent Cc-ing debian-www for comment.
Attachment:
signature.asc
Description: Digital signature