On Thu, Sep 13, 2012 at 07:48:46AM -0400, Kamaraju S Kusumanchi wrote:
> 1) The man page of smartctl is not much helpful in figuring out what various
> fields in the output of "smartctl -a" stand for. For example, what does
> fields such as Raw_Read_Error_Rate, Seek_Error_Rate, Hardware_ECC_Recovered,
> Multi_Zone_Error_Rate mean? Is there any page that describes all these
> fields in good detail?
Actually, the man page has useful information, just perhaps not in a
useful format. Data, rather than information.
In particular, it details that values in the ATRRIBUTES table are
normalised (by the drive firmware) to a VALUE between 1 and 254. Also
noted is a THREShold value (between 0 and 254). If an attribute is *less
than or equal to* its threshold, it is deemed to have failed. The man
page also notes that if this is a "Pre-Fail" attribute (rather than an
"Old_age" attribute), then this indicates drive failure is imminent.
>
> 2) Can someone please tell me if this hard drive is dying. The following is
> the difference between two smartctl outputs that are a week apart.
Bearing the above in mind
>
> $diff smartctl_20120905 smartctl_20120912
> 14c14
> < Local Time is: Wed Sep 5 21:55:18 2012 EDT
> ---
> > Local Time is: Wed Sep 12 20:03:10 2012 EDT
> 58c58
ID ATTRIBUTE FLAG VALUE WORST THRESH Type
> < 1 Raw_Read_Error_Rate 0x000f 100 100 046 Pre-fail Always
> > 1 Raw_Read_Error_Rate 0x000f 100 100 046 Pre-fail Always
100 > 46, so this is fine
> < 4 Start_Stop_Count 0x0032 099 099 000 Old_age Always
> > 4 Start_Stop_Count 0x0032 099 099 000 Old_age Always
> < 7 Seek_Error_Rate 0x000e 100 100 000 Old_age Always
> > 7 Seek_Error_Rate 0x000e 100 100 000 Old_age Always
> < 9 Power_On_Seconds 0x0032 023 023 000 Old_age Always
> > 9 Power_On_Seconds 0x0032 023 023 000 Old_age Always
> < 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always
> > 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always
These are also all steady and above their threshold. Note, however, that
Power_On_Seconds is a little low. It's "Old_age" so when that reaches 0,
you don't need to worry, but you can probably expect that that's what
the drive manufacturer believes is the typical life of the drive.
> < 193 Load_Cycle_Count 0x0032 069 069 000 Old_age Always
> - 635379
> < 194 Temperature_Celsius 0x0022 100 090 000 Old_age Always
> - 51 (Min/Max 8/62)
Here we see that the temperature IS fine, but HAS BEEN slightly towards
failure (in other words, it probably got warm). This is nothing too
worry about.
> < 195 Hardware_ECC_Recovered 0x001a 100 100 000 Old_age Always
> - 9258
> ---
> > 193 Load_Cycle_Count 0x0032 069 069 000 Old_age Always
> - 635384
> > 194 Temperature_Celsius 0x0022 100 090 000 Old_age Always
> - 49 (Min/Max 8/62)
> > 195 Hardware_ECC_Recovered 0x001a 100 100 000 Old_age Always
> - 15601
> 76,77c76,77
> < 200 Multi_Zone_Error_Rate 0x000e 100 100 000 Old_age Always
> - 1038
> < 203 Run_Out_Cancel 0x0002 100 100 000 Old_age Always
> - 3732311179847
> ---
> > 200 Multi_Zone_Error_Rate 0x000e 100 100 000 Old_age Always
> - 5133
> > 203 Run_Out_Cancel 0x0002 100 100 000 Old_age Always
> - 2632791622157
And again, these are all well above their threshold.
>
> This is a hard drive in Dell Inspiron E1505 laptop. Any help is greatly
> appreciated.
>
According to SMART, your hard drive is fine.
Attachment:
signature.asc
Description: Digital signature