[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#637190: linux-image-2.6.32-5-amd64: Random kernel panics & general protection faults



On Tue, 2011-08-09 at 15:56 +0200, Simon Morvan wrote:
> Le 09/08/2011 15:09, Ben Hutchings a écrit :
> > On Tue, 2011-08-09 at 14:55 +0200, Simon Morvan wrote:
> >> Le 09/08/2011 14:50, Ben Hutchings a écrit :
> >>> On Tue, 2011-08-09 at 12:22 +0200, Simon Morvan wrote:
> >>>> We're getting random crash of the system (panics, GPF). Stack trace is always different.
> >>> Can you check that the power supply is sufficient for all these disks?
> >> Do you have recommendations ? I haven't found so much information on how
> >> to estimate the power need. Currently this is a 600W power supply (FWIW:
> >> Cooler Master Silent Pro M - 600W)
> > Many motherboards have a voltage monitoring chip, which you should be
> > able to read with the 'sensors' command from the 'lm-sensors' package.
> > This should show whether the actual voltages are being pulled down
> > because the power supply is overloaded.  You would need to actually make
> > all the hard drives active while checking this.
> it8720-isa-0228
> Adapter: ISA adapter
> Vcore:       +1.33 V  (min =  +0.78 V, max =  +1.50 V)
> Vdram:       +1.50 V  (min =  +1.42 V, max =  +1.57 V)
> +3.3V:       +3.30 V  (min =  +3.14 V, max =  +3.47 V)
> *+5V:         +4.92 V  (min =  +4.76 V, max =  +5.24 V)*
> +12V:       +12.36 V  (min = +11.41 V, max = +12.62 V)
> in5:         +2.70 V  (min =  +0.00 V, max =  +4.08 V)
> 5VSB:        +4.92 V  (min =  +4.76 V, max =  +5.24 V)
> Vbat:        +3.25 V
> CPU Fan:       0 RPM  (min =    0 RPM)
> Sys Fan:       0 RPM  (min =    0 RPM)
> Sys Fan:       0 RPM  (min =    0 RPM)
> fan5:          0 RPM  (min =    0 RPM)
> temp1:       +44.0°C  (low  = +127.0°C, high = +127.0°C)  sensor = 
> thermistor
> CPU Temp:    +59.0°C  (low  = +127.0°C, high = +127.0°C)  sensor = 
> thermal diode
> temp3:       +54.0°C  (low  = +127.0°C, high = +127.0°C)  sensor = 
> thermistor
> cpu0_vid:   +0.513 V
> 
> Assuming the sensors.conf is OK (which is not 100% sure for that 
> Gigabyte GA-890GPA-UD3H mobo, AFAIK), do you think the 4.92V for the 5V 
> is too low ?

No, that's entirely normal.  Aside from Vcore, the BIOS appears to set
warning limits of 5% below and above the nominal values, and all
voltages are within those limits.  The temperatures also look fine.

> I was hdparm'ing some disks while, compiling a kernel and a raid5 sync 
> was inprogress at that time (and it freezed, of course)

So, new theory required.

Given you said you're not using ECC memory, can you test it with
memtest86+ for a few hours?

Ben.

Attachment: signature.asc
Description: This is a digitally signed message part


Reply to: