Re: ibook g4 hard drive failures


On Mon, Oct 31, 2005 at 11:56:02AM +0100, Emanuele Olivetti wrote:
> What about using a live cd and smartmontools? I had a similar
> problem. Just 'smartctl -a /dev/hda' is enough to get lots of info
> about the status of the hd and its problems.

good idea.

> In my case few sectors died but they were (probably) in the partition
> map... so my partition map disapperad and my partitions were lost. It
> was not a big issue since I had to repartition in those days and
> everything was backupped. After some tries (few hours?) bad sectors
> were replaced by hd hardware and now my disk seems totally clean (no
> more bad sectors detected). Since every disk utility is not able to
> detect anomalies I think my disk is safe now.

looks like mine has something worst. for the record, here is an instance
of the output of 'smartctl -a /dev/hda'. 

cheers, paul

smartctl version 5.32 Copyright (C) 2002-4 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

Device Model:     TOSHIBA MK3025GAS
Serial Number:    54B76586T
Firmware Version: KA300B
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   6
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Mon Oct 31 11:34:03 2005 UTC
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

SMART overall-health self-assessment test result: FAILED!
Drive failure expected in less than 24 hours. SAVE ALL DATA.
See vendor-specific Attribute list for failed Attributes.

General SMART Values:
Offline data collection status:  (0x00)	Offline data collection activity
					was never started.
					Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		 ( 149) seconds.
Offline data collection
capabilities: 			 (0x1b) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					Offline surface scan supported.
					Self-test supported.
					No Conveyance Self-test supported.
					No Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					No General Purpose Logging support.
Short self-test routine 
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 (  30) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
  1 Raw_Read_Error_Rate     0x000b   100   100   050    Pre-fail  Always       -       0
  2 Throughput_Performance  0x0005   100   100   050    Pre-fail  Offline      -       0
  3 Spin_Up_Time            0x0027   100   100   001    Pre-fail  Always       -       769
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       9011
  5 Reallocated_Sector_Ct   0x0033   001   001   050    Pre-fail  Always   FAILING_NOW 1023
  7 Seek_Error_Rate         0x000b   100   100   050    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0005   100   100   050    Pre-fail  Offline      -       0
  9 Power_On_Hours          0x0032   088   088   000    Old_age   Always       -       4919
 10 Spin_Retry_Count        0x0033   253   100   030    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       1098
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       69
193 Load_Cycle_Count        0x0032   093   093   000    Old_age   Always       -       75583
194 Temperature_Celsius     0x0022   100   100   000    Old_age   Always       -       40 (Lifetime Min/Max 15/60)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       247
197 Current_Pending_Sector  0x0032   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   253   000    Old_age   Always       -       0
220 Disk_Shift              0x0002   100   100   000    Old_age   Always       -       150
222 Loaded_Hours            0x0032   091   091   000    Old_age   Always       -       3898
223 Load_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
224 Load_Friction           0x0022   100   100   000    Old_age   Always       -       0
226 Load-in_Time            0x0026   100   100   000    Old_age   Always       -       129
240 Head_Flying_Hours       0x0001   100   100   001    Pre-fail  Offline      -       0

Warning: device does not support General Purpose Logging
SMART Error Log Version: 1
ATA Error Count: 389 (device log contains only the most recent five errors)
	CR = Command Register [HEX]
	FR = Features Register [HEX]
	SC = Sector Count Register [HEX]
	SN = Sector Number Register [HEX]
	CL = Cylinder Low Register [HEX]
	CH = Cylinder High Register [HEX]
	DH = Device/Head Register [HEX]
	DC = Device Command Register [HEX]
	ER = Error register [HEX]
	ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 389 occurred at disk power-on lifetime: 4918 hours (204 days + 22 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  -- -- -- -- -- -- --
  04 11 00 85 5a 67 e3

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  ec 00 00 9c 5b 67 e0 00      00:14:55.742  IDENTIFY DEVICE
  35 00 d8 c5 5a 67 e0 00      00:13:58.386  WRITE DMA EXT
  35 00 d0 f5 59 67 e0 00      00:13:58.382  WRITE DMA EXT
  35 00 b8 3d 58 67 e0 00      00:13:58.277  WRITE DMA EXT
  35 00 e8 55 57 67 e0 00      00:13:58.173  WRITE DMA EXT

Error 388 occurred at disk power-on lifetime: 4918 hours (204 days + 22 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  -- -- -- -- -- -- --
  04 51 00 74 5b 67 e0  Error: ABRT at LBA = 0x00675b74 = 6773620

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  35 00 b0 c5 57 67 e0 00      00:26:53.269  WRITE DMA EXT
  35 00 b0 c5 57 67 e0 00      00:26:35.999  WRITE DMA EXT
  10 00 3f 00 00 00 e0 00      00:26:35.996  RECALIBRATE [OBS-4]
  35 00 b0 c5 57 67 e0 00      00:26:27.783  WRITE DMA EXT
  35 00 b0 c5 57 67 e0 00      00:26:21.904  WRITE DMA EXT

Error 387 occurred at disk power-on lifetime: 4918 hours (204 days + 22 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  -- -- -- -- -- -- --
  04 51 00 74 5b 67 e0  Error: ABRT at LBA = 0x00675b74 = 6773620

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  35 00 b0 c5 57 67 e0 00      00:26:35.999  WRITE DMA EXT
  10 00 3f 00 00 00 e0 00      00:26:35.996  RECALIBRATE [OBS-4]
  35 00 b0 c5 57 67 e0 00      00:26:27.783  WRITE DMA EXT
  35 00 b0 c5 57 67 e0 00      00:26:21.904  WRITE DMA EXT
  35 00 e0 e5 56 67 e0 00      00:26:21.812  WRITE DMA EXT

Error 386 occurred at disk power-on lifetime: 4918 hours (204 days + 22 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  -- -- -- -- -- -- --
  04 51 00 74 5b 67 e0  Error: ABRT at LBA = 0x00675b74 = 6773620

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  35 00 b0 c5 57 67 e0 00      00:26:21.904  WRITE DMA EXT
  35 00 e0 e5 56 67 e0 00      00:26:21.812  WRITE DMA EXT
  35 00 e0 e5 56 67 e0 00      00:26:12.553  WRITE DMA EXT
  25 00 08 00 00 00 e0 00      00:25:52.921  READ DMA EXT
  25 00 08 38 3e 7e e0 00      00:25:52.921  READ DMA EXT

Error 385 occurred at disk power-on lifetime: 4918 hours (204 days + 22 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  -- -- -- -- -- -- --
  04 51 00 c4 57 67 e0  Error: ABRT at LBA = 0x006757c4 = 6772676

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  35 00 e0 e5 56 67 e0 00      00:26:12.553  WRITE DMA EXT
  25 00 08 00 00 00 e0 00      00:25:52.921  READ DMA EXT
  25 00 08 38 3e 7e e0 00      00:25:52.921  READ DMA EXT
  25 00 08 30 3e 7e e0 00      00:25:52.921  READ DMA EXT
  25 00 08 20 3e 7e e0 00      00:25:52.921  READ DMA EXT

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

Device does not support Selective Self Tests/Logging

