Bug#969553: urlcheck.py script tries to parse compressed GIMP image files
Laura Arjona Reina <larjona@debian.org> writes:
> Package: www.debian.org
> User: www.debian.org@packages.debian.org
> Usertag: scripts
> Severity: normal
>
> Hi
>
> the scripts "urlcheck" generate this log in the /logos folder:
>
> Looking into http://www.debian.org/logos/openlogo.xcf.gz
> Error reading page: http://www.debian.org/logos/openlogo.xcf.gz
> Looking into http://www.debian.org/logos/officiallogo.xcf.gz
> Error reading page: http://www.debian.org/logos/officiallogo.xcf.gz
> Looking into http://www.debian.org/logos/officiallogo-nd.xcf.gz
> Error reading page: http://www.debian.org/logos/officiallogo-nd.xcf.gz
>
> I guess this means it tries to parse the xcf.gz files and probably we
> need to update the script to skip such files (compressed images).
>
> Anybody familiarised with Python, who can help?
>
> The code of the script is here:
>
> https://salsa.debian.org/webmaster-team/cron/-/tree/master/urlcheck
>
> (I guess the main script, urlcheck.py, is where maybe the fix should be
> made).
>
> The script is called by 3 cron jobs:
>
> 17 3 * * * cd /srv/www.debian.org/cron/urlcheck && ./run.urlcheck
> 36 12 * * * cd /srv/www.debian.org/cron/urlcheck &&
> ./make.bad_link.pages
> 5 13 * * * cd /srv/www.debian.org/cron/urlcheck && ./cleanup.logs
>
> and the daily logs are here:
> https://www-master.debian.org/build-logs/urlcheck/
> (check logos folder).
Hi i did attach simple patch file.
It is not best way. But just it works.
--- run.urlcheck.orig 2020-09-05 10:59:55.275539752 +0900
+++ run.urlcheck 2020-09-05 11:02:39.847539762 +0900
@@ -6,7 +6,7 @@
--ignore News/weekly/oldurl --ignore /Lists-Archives --ignore /cgi-bin/fom \
--ignore debian.org/fom --ignore /releases/ --ignore /international/ --ignore /security/ \
--ignore /devel/ --ignore /News/ --ignore /doc/ --ignore /distrib/ \
- --ignore /ports/ --ignore /intl/ \
+ --ignore /ports/ --ignore /intl/ --ignore /logos/ \
http://www.debian.org/ >& logs/web.$date &
./urlcheck.py --require www.debian.org/international http://www.debian.org/international/ \
>& logs/web.$date.intl &
Sincerely, Byung-Hee
--
^고맙습니다 _救濟蒼生_ 감사합니다_^))//
Reply to: