[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#897893: Strange kernel panics on linux-image-4.9.0-6-amd64 with mlx4_en driver



Control: tag -1 moreinfo

On Fri, 2018-05-04 at 15:35 +0300, Eugene Budanov wrote:
> Package: linux-image-4.9.0-6-amd64
> Version: 4.9.82-1+deb9u3
> 
> Hi!
> 
> Here's a short problem description.
> 
> We have some Supermicro servers with the same configuration for all
> machines (hardware, kernels, packages, etc). A month ago, or maybe a
> bit later, all of these machines began crashing into kernel panic. I
> can't find any pattern of failure at all. But it happens very often.
> Some machines may drop into kernel panic a couple times a day! But
> usually machines crash about every 3 to 6 days. All of these machines
> have intensive network and i/o operations.
> 
> I saved dmesg log from one of these machines after the crash (see the
> attachment).
> 
> As far as I see, every machine probably has problems with mlx4_en or
> GRO. Also I see list_add double add => list_del corruption. Can I do
> anything to get more detailed logs? What additional information do
> you need for better problem diagnostics?

The WARNING messages show that there are out-of-tree modules (i.e. not
part of the kernel package) loaded.  What are those?

Ben.

-- 
Ben Hutchings
Every program is either trivial or else contains at least one bug

Attachment: signature.asc
Description: This is a digitally signed message part


Reply to: