[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#808563: really should be serious bug: data loss



On 21/12/2015 17:20, 積丹尼 Dan Jacobson wrote:
ata1.00: exception Emask 0x60 SAct 0x1c SErr 0x800 action 0x6 frozen
ata1.00: irq_stat 0x20000000, host bus error
ata1: SError: { HostInt }
ata1.00: failed command: WRITE FPDMA QUEUED
ata1.00: cmd 61/00:10:00:a8:10/08:00:26:00:00/40 tag 2 ncq 1048576 out
          res 40/00:20:00:48:11/00:00:26:00:00/40 Emask 0x60 (host bus error)
ata1.00: status: { DRDY }
ata1.00: failed command: WRITE FPDMA QUEUED
ata1.00: cmd 61/00:18:00:b0:10/08:00:26:00:00/40 tag 3 ncq 1048576 out
          res 40/00:20:00:48:11/00:00:26:00:00/40 Emask 0x60 (host bus error)
ata1.00: status: { DRDY }
ata1.00: failed command: WRITE FPDMA QUEUED
ata1.00: cmd 61/98:20:00:48:11/03:00:26:00:00/40 tag 4 ncq 471040 out
          res 40/00:20:00:48:11/00:00:26:00:00/40 Emask 0x60 (host bus error)
ata1.00: status: { DRDY }
ata1: hard resetting link

I have a RAID1 (mdadm) with two SSD Samsung 850 EVO and these days i was affected by the same problem with linux-image-4.3.0-1-686-pae (4.3.3-2). During these errors the system is somehow not responsive for about 30-60 seconds but at the end the operation completes and all keep working without error or data loss. Mdadm doesn't notice nothing about these errors: the RAID doesn't get degraded. But it's quite annoying because happens frequently: about one time every few minutes.

An excerpt of my errors:
---------------------------------
dic 22 09:14:26 barone kernel: ata1.00: exception Emask 0x0 SAct 0x700 SErr 0x0 action 0x6 frozen
dic 22 09:14:26 barone kernel: ata1.00: failed command: WRITE FPDMA QUEUED
dic 22 09:14:26 barone kernel: ata1.00: cmd 61/20:40:00:b8:76/07:00:01:00:00/40 tag 8 ncq 933888 out
                                        res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
dic 22 09:14:26 barone kernel: ata1.00: status: { DRDY }
dic 22 09:14:26 barone kernel: ata1.00: failed command: WRITE FPDMA QUEUED
dic 22 09:14:26 barone kernel: ata1.00: cmd 61/e0:48:20:bf:76/08:00:01:00:00/40 tag 9 ncq 1163264 out
                                        res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
dic 22 09:14:26 barone kernel: ata1.00: status: { DRDY }
dic 22 09:14:26 barone kernel: ata1.00: failed command: WRITE FPDMA QUEUED
dic 22 09:14:26 barone kernel: ata1.00: cmd 61/98:50:00:18:77/0e:00:01:00:00/40 tag 10 ncq 1912832 out
                                        res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
dic 22 09:14:26 barone kernel: ata1.00: status: { DRDY }
dic 22 09:14:26 barone kernel: ata1: hard resetting link
---------------------------------

Like you i have an 686-pae 32 bit kernel. Have you got an SSD disk or a rotational disk?

This morning i've tried linux-image-4.4.0-rc5-686 (4.4~rc5-1~exp1) from experimental and everything works fine without errors. Tomorrow i'll try the pae version from experimental to see if it works too.

Cesare.


Reply to: