[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#569012: linux-image-2.6.32-1-amd64: Freezes and memory corruption



On Fri, Feb 12, 2010 at 05:47:05AM +0000, Ben Hutchings wrote:
> On Fri, 2010-02-12 at 08:46 +0400, Stepan Golosunov wrote:
> > 12.02.2010 в 02:42:29 +0000 Ben Hutchings написал:
> > > On Tue, 2010-02-09 at 17:26 +0400, Stepan Golosunov wrote:
> > > > Package: linux-2.6
> > > > Version: 2.6.32-6
> > > > Severity: critical
> > > > 
> > > > After running 2.6.32 (as well as 2.6.31 and 2.6.30) for several minutes,
> > > > computer freezes completely. Network or alt-sysrq-b do not work.
> > > > 
> > > > Further investigation showed that data is being corrupted while reading
> > > > or writing files.
> > > > Corruption on reading is easily reproduced with debsums (it fails on
> > > > random files), corruption on writing happened at least once (under
> > > > 2.6.32-trunk-amd64) and looks like:
> > > [...]
> > > 
> > > Given that the first advertised feature of the P5Q Turbo motherboard is
> > > 'Easy Overclocking', I have to ask: are you overclocking it?
> > 
> > No. Unless it overclocks something automatically.
> 
> According to the manual there is an 'Ai Tweaker' menu in the BIOS setup
> program.  Check that all settings are 'Auto', except the last one. The
> 'CPU Margin Enhancement' setting has different options.  I found some
> reports that the 'Optimized' option can result in instability and
> recommending the 'Performance' option.

The problem is reproducible with any 'CPU Margin Enhancement' setting.

However, while playing with init=/bin/bash, I was unable to reproduce
the bug before running "modprobe processor". After that I disabled
'Intel(R) C-STATE Tech' in the BIOS, and now debsums did not produce
any random failure for several times and the system is still running.

dmesg output when loading "processor" with enabled 'Intel(R) C-STATE Tech':
[  476.918852] ACPI: SSDT 00000000cff880d0 00235 (v01 DpgPmm  P001Ist 00000011 INTL 20060113)
[  476.919311] ACPI: SSDT 00000000cff889d0 004B2 (v01  PmRef  P001Cst 00003001 INTL 20060113)
[  476.919855] Monitor-Mwait will be used to enter C-1 state
[  476.919874] Monitor-Mwait will be used to enter C-2 state
[  476.919888] Monitor-Mwait will be used to enter C-3 state
[  476.919892] Marking TSC unstable due to TSC halts in idle
[  476.919982] processor LNXCPU:00: registered as cooling_device0
[  476.920367] ACPI: SSDT 00000000cff88310 00235 (v01 DpgPmm  P002Ist 00000012 INTL 20060113)
[  476.920707] ACPI: SSDT 00000000cff88e90 00085 (v01  PmRef  P002Cst 00003000 INTL 20060113)
[  476.921274] Switching to clocksource hpet
[  476.921434] processor LNXCPU:01: registered as cooling_device1
[  476.921824] ACPI: SSDT 00000000cff88550 00235 (v01 DpgPmm  P003Ist 00000012 INTL 20060113)
[  476.922172] ACPI: SSDT 00000000cff88f20 00085 (v01  PmRef  P003Cst 00003000 INTL 20060113)
[  476.922866] processor LNXCPU:02: registered as cooling_device2
[  476.923252] ACPI: SSDT 00000000cff88790 00235 (v01 DpgPmm  P004Ist 00000012 INTL 20060113)
[  476.923608] ACPI: SSDT 00000000cff88fb0 00085 (v01  PmRef  P004Cst 00003000 INTL 20060113)
[  476.924277] processor LNXCPU:03: registered as cooling_device3

with disabled 'Intel(R) C-STATE Tech':
[    9.440094] ACPI: SSDT 00000000cff880d0 00235 (v01 DpgPmm  P001Ist 00000011 INTL 20060113)
[    9.440625] processor LNXCPU:00: registered as cooling_device0
[    9.441014] ACPI: SSDT 00000000cff88310 00235 (v01 DpgPmm  P002Ist 00000012 INTL 20060113)
[    9.441511] processor LNXCPU:01: registered as cooling_device1
[    9.441888] ACPI: SSDT 00000000cff88550 00235 (v01 DpgPmm  P003Ist 00000012 INTL 20060113)
[    9.442394] processor LNXCPU:02: registered as cooling_device2
[    9.442778] ACPI: SSDT 00000000cff88790 00235 (v01 DpgPmm  P004Ist 00000012 INTL 20060113)
[    9.443276] processor LNXCPU:03: registered as cooling_device3

Also, BIOS tells that this setting enables/disables C2/C3/C4 states,
while dmesg tells about C-1/C-2/C-3.



Reply to: