[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: kernel crashes after soft lockups in xen domU



reassign 758622 linux-image-3.16.0-4-amd64
thanks

Hi Ben,

thanks for your response.

Am 2014-11-05 21:40, schrieb Ben Hutchings:
On Wed, 2014-11-05 at 17:56 +0100, Jonas Meurer wrote:
[...]
So the question is: why does the VM run stable on xen1 while it
crashes all the time on xen2. If I compare xen1 and xen2, only
real difference is mainboard (Supermicro X8 on xen1; Supermicro
X9 on xen2) and CPU (Xeon L5939 on xen1; E5-2609 on xen2)

As a next step I'll put the harddisks into another X8/Xeon L5639
server system and try to reproduce the crashes there. My bet is
that this system will not crash anymore. In other words, I guess
that this very bug is only triggered with the X9 + E-2609
combination.

> Can I do anything additional to help debugging the bug? Shall I report
> it
> to Xen upstream or send it to lkml?

Still the same question. Shall I send the bugreport to upstream?
Unfortunately nobody from Debian Linux kernel and/or Xen team seems
to care :-/
[...]

Sorry you haven't had a response from us so far.  This seems to be
fairly clearly a Linux/Xen interaction and I don't know enough about Xen
to suggest how to debug it.

As it involves a relatively old kernel version, I don't think Linux
upstream developers will want to hear about this unless you can also
reproduce it with a more recent version.  Linux 3.16 is available (in
testing and wheezy-backports) if you would like to try that.

I tried linux-image-3.16 from wheezy-backports as VM kernel in the
meantime. Sorry to report back that the bug is still reproducible
with this kernel. I'm reassigning it to the jessie kernel for that
reason.
The kernel backtrace was slightly different, but the behaviour was
the same: After putting the webserver on test VM under heavy load
with siege and pylot, the load exploded until the machine crashed.

Now I replaced the hardware again with a Supermicro S8 board and a
Intel Xeon L5639 CPU - and you know what: the bug disappeared.

I'll have to put the system back into production mode now, so
further debugging will be complicated.

To sum up the situation:

-> a setup with Debian Wheezy Dom0 and Debian Wheezy or Jessie VM
-> the VM runs an apache webserver with mysql backend, nothing more
-> the VM crashes under load if Dom0 CPU is Intel Xeon E5-2609
-> the VM doesn't crash under load if Dom0 CPU is Intel Xeon 5639
-> tested on four completely different hardware setups, all
   components except harddisks replaced several times

Kind regards,
 jonas


Reply to: