[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: smartctl test error



On Wed, 18 Aug 2010 10:56:07 +0530, vishnuvardhan wrote:

> *I have run the following command and it is showing the following error
> for all the disks [ hda, hda1, hda2, hda5 ]* : 
> # smartctl -l selftest /dev/hda

Not errors "per se". It's a report of the disk activity.

(...)
 
> === START OF READ SMART DATA SECTION === SMART overall-health
> self-assessment test result: PASSED
                  ^^^^^^^^^^^^^^^^^^^
That is important. It means your disk is not dying "right now" :-)

> SMART Attributes Data Structure revision number: 16 Vendor Specific
> SMART Attributes with Thresholds: 
> ID# ATTRIBUTE_NAME          FLAG    VALUE WORST THRESH TYPE UPDATED  WHEN_FAILED RAW_VALUE
>    1 Raw_Read_Error_Rate     0x000b   100   100   062    Pre-fail Always       -       0
>    2 Throughput_Performance  0x0005   105   105   040    Pre-fail Offline      -       4573
>    3 Spin_Up_Time            0x0007   253   253   033    Pre-fail Always       -       1
>    4 Start_Stop_Count        0x0012   093   093   000    Old_age Always       -       11195
>    5 Reallocated_Sector_Ct   0x0033   100   100   005    Pre-fail Always       -       0
>    7 Seek_Error_Rate         0x000b   100   100   067    Pre-fail Always       -       0
>    8 Seek_Time_Performance   0x0005   122   122   040    Pre-fail Offline      -       39
>    9 Power_On_Hours          0x0012   065   065   000    Old_age Always       -       15650
>   10 Spin_Retry_Count        0x0013   100   100   060    Pre-fail Always       -       0
>   12 Power_Cycle_Count       0x0032   097   097   000    Old_age Always       -       6195
> 191 G-Sense_Error_Rate      0x000a   099   099   000    Old_age Always       -       3
> 192 Power-Off_Retract_Count 0x0032   097   097   000    Old_age Always       -       247202527
> 193 Load_Cycle_Count        0x0012   089   089   000    Old_age Always       -       114999
> 194 Temperature_Celsius     0x0002   157   157   000    Old_age Always       -       35 (Lifetime Min/Max 17/50) 
> 196 Reallocated_Event_Count 0x0032   100   100   000    Old_age Always       -       132
> 197 Current_Pending_Sector  0x0022   100   100   000    Old_age Always       -       0
> 198 Offline_Uncorrectable   0x0008   100   100   000    Old_age Offline      -       0
> 199 UDMA_CRC_Error_Count    0x000a   200   200   000    Old_age Always       -       0

The data of the above table is also important. Unless you see "pre-fail" 
attributes increasing from time to time, you should not worry (needless 
to say that keeping a good backup is a must on these days, regardless the 
healthy status of the disks).

> Error 191 occurred at disk power-on lifetime: 10995 hours (458 days + 3
> hours)

(...)

> Error 189 occurred at disk power-on lifetime: 10995 hours (458 days + 3
> hours)

(...)

> Error 188 occurred at disk power-on lifetime: 10995 hours (458 days + 3
> hours)

(...)
 
> Error 187 occurred at disk power-on lifetime: 10995 hours (458 days + 3
> hours)

Read that as "logs events", not warnings. New hard disks record the events 
and try to correct the errors in real time (i.e., bad sectors are remapped 
whenever possible). 

> *Can somebody please guide me how to proceed or the above errors are
> normal ?*

There are some "how to read smartcl results" articles there. For example:

Linux Harddisk Monitoring with SmartMonTools (smartctl)
http://www.captain.at/howto-linux-smartmontools-smartctl.php

Monitoring Hard Disks with SMART
http://www.linuxjournal.com/magazine/monitoring-hard-disks-smart

BTW, I keep smartmontools daemon running and monitoring the disks status.

Greetings,

-- 
Camaleón


Reply to: