[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: résultat d'un smartctl -a



Frédéric Massot wrote:
rudu a écrit :
Bonjour la liste,
Quelqu'un pourrait m'aider à analyser la sortie d'un :
# smartctl -a /dev/hda
(fichier joint)
?

J'y entrave que dalle ...

Dans le tableau "SMART Attributes", la colonne "VALUE" indique la valeur actuelle, la colonne "WORST" le minimum atteint, et "THRESH" le minimum à ne pas dépasser.

On peut voir pour la ligne UDMA_CRC_Error_Count que la valeur du minimum atteint est "001" pour une limite à "000", la valeur actuelle est "200".

Conclusion, tu as eu des gros problèmes avec les transferts en mode DMA ou UDMA, les cinq logs d'erreurs indiquent la même chose des problèmes pour la lecture et l'écriture en mode DMA.

Merci à Grégory et Frédéric pour leurs commentaires.
Quelques précisions:
La machine date de 2001, je l'ai passée en Debian Testing en 2004 environ, et elle tournait quotidiennement sans soucis.

Mais des plantages aléatoires se rapprochent dans le temps...
Je ne peux plus faire de mise à jour de mon système sans redémarrer une ou deux fois pour cause de freeze complet. Et si je laisse la machine reposer dix minutes, ça passe comme une fleur après...
Pourtant la T° du CPU reste dans les 50-55°C.
J'ai fait tourner un memtest86+ la nuit dernière, pas d'erreur détectée.
L'alim a été changée il y a quelques mois seulement.
C'est pourquoi je soupçonne le DD ...

Dans le tableau "SMART Attributes" presque toutes les valeurs sont au-delà du Worst, non?
Ma vieille copine rend elle l'âme ?

Merci,
Jean Marc
smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda ATA III family
Device Model:     ST320414A
Serial Number:    3EC08W8T
Firmware Version: 3.28
User Capacity:    20 020 396 032 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   5
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Tue Aug 19 20:56:21 2008 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82)	Offline data collection activity
					was completed without error.
					Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		 ( 422) seconds.
Offline data collection
capabilities: 			 (0x1b) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					command.
					Offline surface scan supported.
					Self-test supported.
					No Conveyance Self-test supported.
					No Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					No General Purpose Logging support.
Short self-test routine 
recommended polling time: 	 (   1) minutes.
Extended self-test routine
recommended polling time: 	 (  17) minutes.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000e   057   053   025    Old_age   Always       -       32473688
  3 Spin_Up_Time            0x0002   076   070   000    Old_age   Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       182
  5 Reallocated_Sector_Ct   0x0032   100   100   036    Old_age   Always       -       0
  7 Seek_Error_Rate         0x000e   086   060   030    Old_age   Always       -       464098880
  9 Power_On_Hours          0x0032   072   072   000    Old_age   Always       -       24825
 10 Spin_Retry_Count        0x0012   100   099   097    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   098   098   020    Old_age   Always       -       2949
194 Temperature_Celsius     0x0022   034   048   000    Old_age   Always       -       34
195 Hardware_ECC_Recovered  0x001a   063   054   000    Old_age   Always       -       161356587
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   001   000    Old_age   Always       -       36496
200 Multi_Zone_Error_Rate   0x0000   100   100   000    Old_age   Offline      -       0
202 TA_Increase_Count       0x0032   100   253   000    Old_age   Always       -       0

SMART Error Log Version: 1
ATA Error Count: 149 (device log contains only the most recent five errors)
	CR = Command Register [HEX]
	FR = Features Register [HEX]
	SC = Sector Count Register [HEX]
	SN = Sector Number Register [HEX]
	CL = Cylinder Low Register [HEX]
	CH = Cylinder High Register [HEX]
	DH = Device/Head Register [HEX]
	DC = Device Command Register [HEX]
	ER = Error register [HEX]
	ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 149 occurred at disk power-on lifetime: 22413 hours (933 days + 21 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 51 00 da 0d f9 e1  Error: ICRC, ABRT at LBA = 0x01f90dda = 33099226

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 00 da 0d f9 e1 00      01:42:18.599  READ DMA
  ca 00 08 ba 82 e5 e1 00      01:42:18.547  WRITE DMA
  ca 00 08 61 bb b9 e1 00      01:42:18.547  WRITE DMA
  ca 00 10 4a 84 41 e2 00      01:42:18.546  WRITE DMA
  ca 00 08 3a 83 41 e2 00      01:42:18.546  WRITE DMA

Error 148 occurred at disk power-on lifetime: 22378 hours (932 days + 10 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 51 00 ba 24 d8 e1  Error: ICRC, ABRT at LBA = 0x01d824ba = 30942394

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 ba 24 d8 e1 00      11:49:42.379  READ DMA
  ca 00 08 22 82 41 e2 00      11:49:37.042  WRITE DMA
  ca 00 08 32 82 35 e2 00      11:49:37.019  WRITE DMA
  ca 00 08 62 84 2d e2 00      11:49:37.018  WRITE DMA
  ca 00 08 1a 82 2d e2 00      11:49:37.003  WRITE DMA

Error 147 occurred at disk power-on lifetime: 22338 hours (930 days + 18 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 51 00 58 85 49 e1  Error: ICRC, ABRT at LBA = 0x01498558 = 21595480

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 08 58 85 49 e1 00      14:17:36.767  READ DMA
  ca 00 30 ca 53 d2 e1 00      14:17:36.745  WRITE DMA
  ca 00 10 2a 5f 3f e2 00      14:17:36.718  WRITE DMA
  c8 00 10 18 85 49 e1 00      14:17:36.714  READ DMA
  ca 00 08 c2 53 d2 e1 00      14:17:36.695  WRITE DMA

Error 146 occurred at disk power-on lifetime: 22232 hours (926 days + 8 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 51 00 0a 96 fa e1  Error: ICRC, ABRT at LBA = 0x01fa960a = 33199626

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 28 0a 96 fa e1 00      09:08:04.743  READ DMA
  ca 00 58 b2 f6 d1 e1 00      09:08:04.710  WRITE DMA
  c8 00 28 e2 95 fa e1 00      09:08:04.701  READ DMA
  ca 00 08 7a 4a f5 e1 00      09:08:04.669  WRITE DMA
  ca 00 08 82 ca ea e1 00      09:08:04.642  WRITE DMA

Error 145 occurred at disk power-on lifetime: 22228 hours (926 days + 4 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  84 51 00 62 a6 07 e2  Error: ICRC, ABRT at LBA = 0x0207a662 = 34055778

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 18 62 a6 07 e2 00      04:32:01.691  READ DMA
  ca 00 e8 3a 26 d2 e1 00      04:32:01.661  WRITE DMA
  c8 00 08 0a 5c 07 e2 00      04:32:01.660  READ DMA
  c8 00 08 fa 5b 07 e2 00      04:32:01.660  READ DMA
  c8 00 20 b2 5b 07 e2 00      04:32:01.648  READ DMA

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     24825         -

Device does not support Selective Self Tests/Logging

Reply to: