[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#969553: urlcheck.py script tries to parse compressed GIMP image files



Laura Arjona Reina <larjona@debian.org> writes:

> Package: www.debian.org
> User: www.debian.org@packages.debian.org
> Usertag: scripts
> Severity: normal
>
> Hi
>
> the scripts "urlcheck" generate this log in the /logos folder:
>
> Looking into http://www.debian.org/logos/openlogo.xcf.gz
>   Error reading page: http://www.debian.org/logos/openlogo.xcf.gz
> Looking into http://www.debian.org/logos/officiallogo.xcf.gz
>   Error reading page: http://www.debian.org/logos/officiallogo.xcf.gz
> Looking into http://www.debian.org/logos/officiallogo-nd.xcf.gz
>   Error reading page: http://www.debian.org/logos/officiallogo-nd.xcf.gz
>
> I guess this means it tries to parse the xcf.gz files and probably we
> need to update the script to skip such files (compressed images).
>
> Anybody familiarised with Python, who can help?
>
> The code of the script is here:
>
> https://salsa.debian.org/webmaster-team/cron/-/tree/master/urlcheck
>
> (I guess the main script, urlcheck.py, is where maybe the fix should be
> made).
>
> The script is called by 3 cron jobs:
>
> 17  3 * * *     cd /srv/www.debian.org/cron/urlcheck && ./run.urlcheck
> 36 12 * * *     cd /srv/www.debian.org/cron/urlcheck &&
> ./make.bad_link.pages
> 5  13 * * *     cd /srv/www.debian.org/cron/urlcheck && ./cleanup.logs
>
> and the daily logs are here:
> https://www-master.debian.org/build-logs/urlcheck/
> (check logos folder).

Hi i did attach simple patch file.
It is not best way. But just it works.

--- run.urlcheck.orig	2020-09-05 10:59:55.275539752 +0900
+++ run.urlcheck	2020-09-05 11:02:39.847539762 +0900
@@ -6,7 +6,7 @@
 	--ignore News/weekly/oldurl --ignore /Lists-Archives --ignore /cgi-bin/fom \
 	--ignore debian.org/fom --ignore /releases/ --ignore /international/ --ignore /security/ \
 	--ignore /devel/ --ignore /News/ --ignore /doc/ --ignore /distrib/ \
-   --ignore /ports/ --ignore /intl/ \
+   --ignore /ports/ --ignore /intl/ --ignore /logos/ \
 	http://www.debian.org/ >& logs/web.$date &
 ./urlcheck.py --require www.debian.org/international http://www.debian.org/international/ \
 	>& logs/web.$date.intl &
Sincerely, Byung-Hee

-- 
^고맙습니다 _救濟蒼生_ 감사합니다_^))//

Reply to: