[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#678443: Hard lockups due to "lockup-detector" (NMIs) on multi-Pentium-3 SMP systems on all kernel builds since 2.6.38



Hello,

thanks for your reply. Due to a lot of work "at work", I did not yet manage to report the bug, but I will do so soon.

Today I want to add my current uptime and interrupt state for a last time, as I might have to power down the system in a few days for maintenance measures (and anyway want to put and end to ompelled uptime watching related to this bug). In addition to the flawless uptime, the complete system and all running tasks have proven to be absolutely flawless over this amount of time (well, that's the way I expect it from a Linux operating system as long as no very risky software is running - but it also confirms that the hardware really has no problems and my problems were only related to the "lockup detector". Even the amount of shared interrupts and their dependencies on the APIC system and correct driver implementations don't hurt. No kernel errors have been logged since 17 July, and these were link down/up messages due to a switch reboot...


netfinity5000:~$ uptime
 17:14:06 up 46 days, 14 min,  2 users,  load average: 0,05, 0,06, 0,05


netfinity5000:~$ cat /proc/interrupts
           CPU0       CPU1
  0:         49          0   IO-APIC-edge      timer
  1:          3          0   IO-APIC-edge      i8042
  6:          3          0   IO-APIC-edge      floppy
  7:          1          0   IO-APIC-edge      parport0
  8:          0          0   IO-APIC-edge      rtc0
  9:          0          0   IO-APIC-fasteoi   acpi
 12:          1          3   IO-APIC-edge      i8042
 14:         42         74   IO-APIC-edge      ata_generic
 15:          0          0   IO-APIC-edge      ata_generic
 16:         49         48   IO-APIC-fasteoi   aic7xxx, aic7xxx
 17:  154500925  154495377   IO-APIC-fasteoi   eth0
 18:    2657528    2728937   IO-APIC-fasteoi   megaraid, ohci_hcd:usb2
 19:   69807511   69703638   IO-APIC-fasteoi   eth1
22: 91578533 91635430 IO-APIC-fasteoi ehci_hcd:usb1, ohci_hcd:usb3, ohci_hcd:usb4, eth2, eth3
NMI:          1          1   Non-maskable interrupts
LOC:  262393426  323398808   Local timer interrupts
SPU:          0          0   Spurious interrupts
PMI:          0          0   Performance monitoring interrupts
IWI:          0          0   IRQ work interrupts
RTR:          2          0   APIC ICR read retries
RES:    6791711    6755464   Rescheduling interrupts
CAL:    1231644    1607457   Function call interrupts
TLB:     859984     805603   TLB shootdowns
TRM:          0          0   Thermal event interrupts
THR:          0          0   Threshold APIC interrupts
MCE:          0          0   Machine check exceptions
MCP:      13251      13251   Machine check polls
ERR:          0
MIS:          0


netfinity5000:~$ free
             total       used       free     shared    buffers     cached
Mem:       2074804    1340228     734576          0     294672     805404
-/+ buffers/cache:     240152    1834652
Swap:      1943860          0    1943860


Best regards,

Hans-Juergen Mauser


Reply to: