[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: HDD failing? :'(



Hi Ben,

Here is the "context". Hope it is enough/correct! 


>From /var/log/syslog.0

  Oct 13 13:36:33 nias -- MARK --
  Oct 13 13:47:40 nias kernel: hdc: timeout waiting ^I^I^Ifor dbdma command stop
  Oct 13 13:47:40 nias kernel: hdc: bad status at DMA end, dstat=8480
  Oct 13 13:47:41 nias kernel: scsi0: ERROR on channel 0, id 0, lun 0, CDB: 0x03 00 00 00 40 00 
  Oct 13 13:47:41 nias kernel: Current sd0b:00: sns = 70  3
  Oct 13 13:47:41 nias kernel: ASC= 2 ASCQ= 0
  Oct 13 13:47:41 nias kernel: Raw sense data:0x70 0x00 0x03 0x00 0x00 0x00 0x00 0x0a 0x08 0x00 0x00 0x00 0x02 0x00 0x00 0x00 0x00 0x00 
  Oct 13 13:47:41 nias kernel:  I/O error: dev 0b:00, sector 64
  Oct 13 13:47:41 nias kernel: isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16
  Oct 13 13:48:36 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 13:48:36 nias kernel: ISO 9660 Extensions: RRIP_1991A
  
  .....
  
  Oct 13 20:26:53 nias kernel: hda: bad status at DMA end, dstat=8400
  Oct 13 20:26:53 nias kernel: hda: timeout waiting for DMA
  Oct 13 20:26:53 nias kernel: hda: timeout waiting for DMA
  Oct 13 20:26:53 nias kernel: hda: status error: status=0x58 { DriveReady SeekComplete DataRequest }
  Oct 13 20:26:53 nias kernel: 
  Oct 13 20:26:53 nias kernel: hda: drive not ready for command
  Oct 13 20:26:53 nias kernel: hda: status timeout: status=0xd0 { Busy }
  Oct 13 20:26:53 nias kernel: 
  Oct 13 20:26:53 nias kernel: hda: drive not ready for command
  Oct 13 20:27:08 nias kernel: ide0: reset: success
  Oct 13 20:27:08 nias kernel: blk: queue c02de6a8, I/O limit 4095Mb (mask 0xffffffff)
  Oct 13 20:27:21 nias kernel: hda: irq timeout: status=0xd0 { Busy }
  Oct 13 20:27:21 nias kernel: 
  Oct 13 20:28:16 nias kernel: hda: DMA disabled
  Oct 13 20:28:16 nias kernel: ide0: reset: success
  Oct 13 20:30:36 nias kernel: hda: status timeout: status=0xd0 { Busy }
  Oct 13 20:30:36 nias kernel: 
  Oct 13 20:30:36 nias kernel: hda: no DRQ after issuing WRITE
  Oct 13 20:30:38 nias kernel: ide0: reset: success
  Oct 13 20:30:58 nias kernel: hda: irq timeout: status=0xd0 { Busy }
  Oct 13 20:30:58 nias kernel: 
  Oct 13 20:31:03 nias kernel: ide0: reset: success
  
  ( Here is where I run the Apple Hardware Tests in the CD )

And from var/log/kern.log

  Oct 12 21:25:53 nias kernel: hdc: timeout waiting ^I^I^Ifor dbdma command stop
  Oct 12 21:25:53 nias kernel: hdc: bad status at DMA end, dstat=8480
  Oct 12 21:25:53 nias kernel:  I/O error: dev 0b:00, sector 0
  Oct 12 21:25:53 nias kernel: hdc: timeout waiting ^I^I^Ifor dbdma command stop
  Oct 12 21:25:53 nias kernel: hdc: bad status at DMA end, dstat=8480
  Oct 12 21:25:53 nias kernel:  I/O error: dev 0b:00, sector 0

  ( Funny that the DVD errors also happened the day before :-? )

  Oct 13 13:47:40 nias kernel: hdc: timeout waiting ^I^I^Ifor dbdma command stop
  Oct 13 13:47:40 nias kernel: hdc: bad status at DMA end, dstat=8480
  Oct 13 13:47:41 nias kernel: scsi0: ERROR on channel 0, id 0, lun 0, CDB: 0x03 00 00 00 40 00 
  Oct 13 13:47:41 nias kernel: Current sd0b:00: sns = 70  3
  Oct 13 13:47:41 nias kernel: ASC= 2 ASCQ= 0
  Oct 13 13:47:41 nias kernel: Raw sense data:0x70 0x00 0x03 0x00 0x00 0x00 0x00 0x0a 0x08 0x00 0x00 0x00 0x02 0x00 0x00 0x00 0x00 0x00 
  Oct 13 13:47:41 nias kernel:  I/O error: dev 0b:00, sector 64
  Oct 13 13:47:41 nias kernel: isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16
  Oct 13 13:48:36 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 13:48:36 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 15:12:19 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 15:12:19 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 15:18:49 nias kernel: attempt to access beyond end of device
  Oct 13 15:18:49 nias kernel: 0b:00: rw=0, want=34, limit=2
  Oct 13 15:18:49 nias kernel: isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16
  Oct 13 15:21:25 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 15:21:25 nias kernel: ISOFS: changing to secondary root
  Oct 13 15:23:22 nias kernel: attempt to access beyond end of device
  Oct 13 15:23:22 nias kernel: 0b:00: rw=0, want=34, limit=2
  Oct 13 15:23:22 nias kernel: isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16
  Oct 13 15:23:24 nias kernel: attempt to access beyond end of device
  Oct 13 15:23:24 nias kernel: 0b:00: rw=0, want=34, limit=2
  Oct 13 15:23:24 nias kernel: isofs_read_super: bread failed, dev=0b:00, iso_blknum=16, block=16
  Oct 13 15:24:03 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 15:24:03 nias kernel: ISOFS: changing to secondary root
  Oct 13 15:34:06 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 15:34:06 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 15:35:06 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 15:35:06 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 15:43:15 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 15:43:15 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 15:48:10 nias kernel: sr0: CDROM (ioctl) reports ILLEGAL REQUEST.
  Oct 13 15:48:10 nias kernel: cdrom: open failed.
  Oct 13 15:48:29 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 15:48:29 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 15:55:37 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 15:55:37 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 16:05:21 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 16:05:21 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 16:09:39 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 16:09:39 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 16:14:54 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 16:14:54 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 16:20:47 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 16:20:47 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 16:25:49 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 16:25:49 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 16:31:06 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 16:31:06 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 16:36:30 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 16:36:30 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 16:41:41 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 16:41:41 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 16:46:58 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 16:46:58 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 16:50:22 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 16:50:22 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 16:54:12 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 16:54:12 nias kernel: ISO 9660 Extensions: RRIP_1991A
  Oct 13 19:29:18 nias kernel: sr0: CDROM (ioctl) reports ILLEGAL REQUEST.
  Oct 13 19:29:18 nias kernel: cdrom: open failed.
  Oct 13 19:29:31 nias kernel: sr0: CDROM (ioctl) reports ILLEGAL REQUEST.
  Oct 13 19:29:31 nias kernel: cdrom: open failed.
  Oct 13 19:29:56 nias kernel: ISO 9660 Extensions: Microsoft Joliet Level 3
  Oct 13 19:29:56 nias kernel: ISO 9660 Extensions: RRIP_1991A
  
  .....
  
  Oct 13 20:26:53 nias kernel: hda: bad status at DMA end, dstat=8400
  Oct 13 20:26:53 nias kernel: hda: timeout waiting for DMA
  Oct 13 20:26:53 nias kernel: hda: timeout waiting for DMA
  Oct 13 20:26:53 nias kernel: hda: status error: status=0x58 { DriveReady SeekComplete DataRequest }
  Oct 13 20:26:53 nias kernel: 
  Oct 13 20:26:53 nias kernel: hda: drive not ready for command
  Oct 13 20:26:53 nias kernel: hda: status timeout: status=0xd0 { Busy }
  Oct 13 20:26:53 nias kernel: 
  Oct 13 20:26:53 nias kernel: hda: drive not ready for command
  Oct 13 20:27:08 nias kernel: ide0: reset: success
  Oct 13 20:27:08 nias kernel: blk: queue c02de6a8, I/O limit 4095Mb (mask 0xffffffff)
  Oct 13 20:27:21 nias kernel: hda: irq timeout: status=0xd0 { Busy }
  Oct 13 20:27:21 nias kernel: 
  Oct 13 20:28:16 nias kernel: hda: DMA disabled
  Oct 13 20:28:16 nias kernel: ide0: reset: success
  Oct 13 20:30:36 nias kernel: hda: status timeout: status=0xd0 { Busy }
  Oct 13 20:30:36 nias kernel: 
  Oct 13 20:30:36 nias kernel: hda: no DRQ after issuing WRITE
  Oct 13 20:30:38 nias kernel: ide0: reset: success
  Oct 13 20:30:58 nias kernel: hda: irq timeout: status=0xd0 { Busy }
  Oct 13 20:30:58 nias kernel: 
  Oct 13 20:31:03 nias kernel: ide0: reset: success
  
  ( And here is the same reboot point )
  

So basically, there are two things... stuff happening on HDC (the
superdrive) at around 13:XX and stuff on HDA, my HDD, at around 20:XX.

I am also sending the smartclt -a /dev/hda4 results, in case someone
wiser than me knows how to read more stuff than I do in them! :)

  smartctl version 5.1-18 Copyright (C) 2002-3 Bruce Allen
  Home page is http://smartmontools.sourceforge.net/
  
  === START OF INFORMATION SECTION ===
  Device Model:     FUJITSU MHS2060AT
  Serial Number:    NL24T3114CF3
  Firmware Version: 8105
  Device is:        Not in smartctl database [for details use: -P showall]
  ATA Version is:   6
  ATA Standard is:  ATA/ATAPI-6 T13 1410D revision 3a
  Local Time is:    Tue Oct 14 10:49:41 2003 EST
  SMART support is: Available - device has SMART capability.
  SMART support is: Enabled
  
  === START OF READ SMART DATA SECTION ===
  SMART overall-health self-assessment test result: PASSED
  
  General SMART Values:
  Off-line data collection status: (0x02)	Offline data collection activity was
  					completed without error.
  					Auto Off-line Data Collection: Disabled.
  Self-test execution status:      (   0)	The previous self-test routine completed
  					without error or no self-test has ever 
  					been run.
  Total time to complete off-line 
  data collection: 		 ( 492) seconds.
  Offline data collection
  capabilities: 			 (0x7b) SMART execute Offline immediate.
  					Automatic timer ON/OFF support.
  					Suspend Offline collection upon new
  					command.
  					Offline surface scan supported.
  					Self-test supported.
  					Conveyance Self-test supported.
  					Selective Self-test supported.
  SMART capabilities:            (0x0003)	Saves SMART data before entering
  					power-saving mode.
  					Supports SMART auto save timer.
  Error logging capability:        (0x01)	Error logging supported.
  					No General Purpose Logging support.
  Short self-test routine 
  recommended polling time: 	 (   2) minutes.
  Extended self-test routine
  recommended polling time: 	 (  83) minutes.
  Conveyance self-test routine
  recommended polling time: 	 (   2) minutes.
  
  SMART Attributes Data Structure revision number: 16
  Vendor Specific SMART Attributes with Thresholds:
  ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate     0x000f   100   100   046    Pre-fail  Always       -       191114
    2 Throughput_Performance  0x0005   100   100   030    Pre-fail  Offline      -       292
    3 Spin_Up_Time            0x0003   100   100   025    Pre-fail  Always       -       25601
    4 Start_Stop_Count        0x0032   099   099   000    Old_age   Always       -       382
    5 Reallocated_Sector_Ct   0x0033   080   080   024    Pre-fail  Always       -       497
    7 Seek_Error_Rate         0x000f   100   100   047    Pre-fail  Always       -       776
    8 Seek_Time_Performance   0x0005   100   100   019    Pre-fail  Offline      -       0
    9 Power_On_Hours          0x0032   068   068   000    Old_age   Always       -       17366468
   10 Spin_Retry_Count        0x0013   100   100   020    Pre-fail  Always       -       0
   12 Power_Cycle_Count       0x0032   099   099   000    Old_age   Always       -       251
  192 Power-Off_Retract_Count 0x0032   099   099   000    Old_age   Always       -       28
  193 Load_Cycle_Count        0x0032   049   049   000    Old_age   Always       -       189502
  194 Temperature_Celsius     0x0022   100   100   000    Old_age   Always       -       39 (Lifetime Min/Max 21/52)
  195 Hardware_ECC_Recovered  0x001a   100   100   000    Old_age   Always       -       9645
  196 Reallocated_Event_Count 0x0032   080   080   000    Old_age   Always       -       480
  197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
  198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
  199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
  200 Multi_Zone_Error_Rate   0x000f   084   071   060    Pre-fail  Always       -       2898836
  203 Run_Out_Cancel          0x0002   100   100   000    Old_age   Always       -       3732310457836
  
  SMART Error Log Version: 1
  No Errors Logged
  
  SMART Self-test log structure revision number 1
  No self-tests have been logged
  



Thanks in advance!


-- 
J. Javier Maestro
<jjmaestro@computer.org>
http://rigel.homelinux.com


On Oct Mon 13 2003 20:28, Benjamin Herrenschmidt wrote:
> 
> > 
> > as:~# grep -i error /var/log/syslog
> > Oct 13 13:47:41 nias kernel: scsi0: ERROR on channel 0, id 0, lun 0, CDB: 0x03 00 00 00 40 00
> > Oct 13 13:47:41 nias kernel:  I/O error: dev 0b:00, sector 64
> > Oct 13 20:26:53 nias kernel: hda: status error: status=0x58 { DriveReady SeekComplete DataRequest }
> > 
> > nias:~# grep -i error /var/log/kern.log
> > Oct 12 21:25:53 nias kernel:  I/O error: dev 0b:00, sector 0
> > Oct 12 21:25:53 nias kernel:  I/O error: dev 0b:00, sector 0
> > Oct 13 13:47:41 nias kernel: scsi0: ERROR on channel 0, id 0, lun 0, CDB: 0x03 00 00 00 40 00
> > Oct 13 13:47:41 nias kernel:  I/O error: dev 0b:00, sector 64
> > Oct 13 20:26:53 nias kernel: hda: status error: status=0x58 { DriveReady SeekComplete DataRequest }
> 
> can you give some context around the HD errors ?
> 
> Ben.



Reply to: