IDE lost interrupt followed by lock-up
Hi, I've had sporadic problems with errors such as the following:
Aug 2 05:09:10 victoria kernel: [616687.192331] ata1: lost interrupt
(Status 0x50)
Aug 2 05:09:10 victoria kernel: [616687.192467] ata1.00: exception
Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
Aug 2 05:09:10 victoria kernel: [616687.192510] ata1.00: failed
command: WRITE DMA EXT
Aug 2 05:09:10 victoria kernel: [616687.192579] ata1.00: cmd
35/00:08:22:5e:00/00:00:14:00:00/e0 tag 0 dma 4096 out
Aug 2 05:09:10 victoria kernel: [616687.192594] res
40/00:00:01:4f:c2/00:00:00:00:00/10 Emask 0x4 (timeout)
Aug 2 05:09:10 victoria kernel: [616687.192628] ata1.00: status: { DRDY }
Aug 2 05:09:10 victoria kernel: [616687.192763] ata1: soft resetting link
Aug 2 05:09:10 victoria kernel: [616687.392871] ata1.00: configured for
UDMA/33
Aug 2 05:09:10 victoria kernel: [616687.409520] ata1.01: configured for
UDMA/33
Aug 2 05:18:57 victoria kernel: [617275.303057] ata1: lost interrupt
(Status 0x50)
Aug 2 05:18:58 victoria kernel: [617275.303196] ata1.00: exception
Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
Aug 2 05:18:58 victoria kernel: [617275.303239] ata1.00: failed
command: READ DMA EXT
Aug 2 05:18:58 victoria kernel: [617275.303309] ata1.00: cmd
25/00:08:d2:2a:79/00:00:13:00:00/e0 tag 0 dma 4096 in
Aug 2 05:18:58 victoria kernel: [617275.303324] res
40/00:00:01:4f:c2/00:00:00:00:00/10 Emask 0x4 (timeout)
Aug 2 05:18:58 victoria kernel: [617275.303357] ata1.00: status: { DRDY }
Aug 2 05:18:58 victoria kernel: [617275.303492] ata1: soft resetting link
Aug 2 05:18:58 victoria kernel: [617275.499876] ata1.00: configured for
UDMA/33
Aug 2 05:18:58 victoria kernel: [617275.515681] ata1.01: configured for
UDMA/33
The machine then locks up and the power has to be disconnected before
the power button can restart the machine.
I haven't found where ata1.00 and ata1.01 are defined as the primary and
secondary IDE controllers, although I assume ata1.00 is the primary
controller and ata1.01 is the secondary controller.
This problem appears related to the pc being in a cool environment (e.g.
10 degrees Celsius), and has occurred almost daily for a while then not
occurring for nearly a week.
The machine is old (1998 era 440BX with PII-266) but has otherwise been
reliable.
# lspci
00:00.0 Host bridge: Intel Corporation 440BX/ZX/DX - 82443BX/ZX/DX Host
bridge (rev 02)
00:01.0 PCI bridge: Intel Corporation 440BX/ZX/DX - 82443BX/ZX/DX AGP
bridge (rev 02)
00:07.0 ISA bridge: Intel Corporation 82371AB/EB/MB PIIX4 ISA (rev 02)
00:07.1 IDE interface: Intel Corporation 82371AB/EB/MB PIIX4 IDE (rev 01)
00:07.2 USB Controller: Intel Corporation 82371AB/EB/MB PIIX4 USB (rev 01)
00:07.3 Bridge: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 02)
00:08.0 SCSI storage controller: Adaptec (formerly DPT) SmartCache/Raid
I-IV Controller (rev 02)
00:09.0 Multimedia audio controller: Creative Labs SB Audigy (rev 04)
00:09.1 Input device controller: Creative Labs SB Audigy Game Port (rev 04)
00:09.2 FireWire (IEEE 1394): Creative Labs SB Audigy FireWire Port (rev 04)
00:0a.0 Ethernet controller: Realtek Semiconductor Co., Ltd.
RTL-8139/8139C/8139C+ (rev 10)
01:00.0 VGA compatible controller: ATI Technologies Inc RV280 [Radeon
9200 SE] (rev 01)
01:00.1 Display controller: ATI Technologies Inc RV280 [Radeon 9200 SE]
(Secondary) (rev 01)
Any suggestions for pinpointing this problem welcome.
Arthur.
Reply to: