[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

RE: help needed to manage s390x host for ci.debian.net



I am wondering whether we have downgraded the machines to 5.10.0-20 kernel to get rid of the kernel bug which is known to cause issue in user processes at random - described in the cover letter here: https://lists.debian.org/debian-s390/2023/02/msg00019.html

 

The following patch fixes this issue:

https://lists.debian.org/debian-s390/2023/02/msg00019.html

The patch fixes the problem introduced in commit 75309018a24d ("s390: add support for TIF_NOTIFY_SIGNAL")

 

Can we check if the patch solves any of the CI issues?

 

-Dipak

 

 

On 16.02.23 17:49, Paul Gevers wrote:
> As you can see e.g. here [1,2] it comes and goes (albeit sometimes the
> queue was empty). I don't think its very different, I just never got out
> of the s390x host what I was expecting. Long time I blamed it on the
> "stealing" that happens on a shared host, but I think there's more.
>
> https://ci.debian.net/munin/ci-worker-s390x-01/ci-worker-s390x-01/debci_packages_processed.html

So a pet peeve of mine are unitless graphs. Can we please annotate what
the unit is? If we're looking at "Packages processed by architecture"[1]
(which again, isn't a workable unit), then we see that s390x does not
have a bad average, nor an overly bad max - given that it's with one
worker instead of like 14 for amd64 and 10 for arm64?

The average/max for the week is double for amd64 vs. s390x, so what does
the queue size mean? Is there still obsolete work in the queue as well
or does every item have the same value? (There's no way right now it can
catch up with that many items in the queue. Although that again depends
on the unit...)

Kind regards
Philipp Kern

[1]
https://ci.debian.net/munin/debian.net/ci-master.debian.net/debci_total_packages_processed.html


Reply to: