[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#909080: dl10n-spider stops parsing mails when there is a month without mails



Package: debian-i18n
Severity: normal

Dear all

It seems that the script that parses the l10n-language mailing list archives and creates the databases in l10n.debian.org/coordination/00data/status.XX stops (without errors) when there is a month in which no mails were sent to the list (thinking that we arrived the current month/year).

See for example:

https://l10n.debian.org/coordination/turkish/tr.by_date.html

I believe this pages are generated using the data here:

https://l10n.debian.org/coordination/00data/

the status.tr file there stops parsing at year 2016 month 6:

Package:
Date: 2018-09-18
Year: 2016
Month: 6
Message: 0
Page: 1

This behavior can be easily reproduced in this way:

sudo apt install dl10n
git clone https://salsa.debian.org/l10n-team/dl10n.git
cd dl10n/
mkdir data
touch data/status.tr
./dl10n-spider --year 2016 --month 01 --message 0 tr

(it begins to parse and stops in 2016/06)
./dl10n-spider --year 2016 --month 07 --message 0 tr

(it continues to parse and stops in 2017/05)

./dl10n-spider --year 2017 --month 07 --message 0 tr
(it continues to parse and stops in 2017/11, etc.)

If you look at https://lists.debian.org/debian-l10n-turkish/ the script stops when it arrives a year/month without mails, but can continue if we provide a starting point after that empty month.

I've had a look at the code and I think that we should modify the code of the file https://salsa.debian.org/l10n-team/dl10n/blob/master/lib/Debian/L10n/Spider.pm , sub spider($$$$$@) (Line 173 on). I'm afraid I cannot provide a patch, the code is difficult for me to understand and I'm not sure what should I change and how.

I hope somebody else can look at this...

Thanks

--
Laura Arjona Reina
https://wiki.debian.org/LauraArjona


Reply to: