Re: résultat d'un smartctl -a
Frédéric Massot wrote:
rudu a écrit :
Bonjour la liste,
Quelqu'un pourrait m'aider à analyser la sortie d'un :
# smartctl -a /dev/hda
(fichier joint)
?
J'y entrave que dalle ...
Dans le tableau "SMART Attributes", la colonne "VALUE" indique la
valeur actuelle, la colonne "WORST" le minimum atteint, et "THRESH" le
minimum à ne pas dépasser.
On peut voir pour la ligne UDMA_CRC_Error_Count que la valeur du
minimum atteint est "001" pour une limite à "000", la valeur actuelle
est "200".
Conclusion, tu as eu des gros problèmes avec les transferts en mode
DMA ou UDMA, les cinq logs d'erreurs indiquent la même chose des
problèmes pour la lecture et l'écriture en mode DMA.
Merci à Grégory et Frédéric pour leurs commentaires.
Quelques précisions:
La machine date de 2001, je l'ai passée en Debian Testing en 2004
environ, et elle tournait quotidiennement sans soucis.
Mais des plantages aléatoires se rapprochent dans le temps...
Je ne peux plus faire de mise à jour de mon système sans redémarrer une
ou deux fois pour cause de freeze complet. Et si je laisse la machine
reposer dix minutes, ça passe comme une fleur après...
Pourtant la T° du CPU reste dans les 50-55°C.
J'ai fait tourner un memtest86+ la nuit dernière, pas d'erreur détectée.
L'alim a été changée il y a quelques mois seulement.
C'est pourquoi je soupçonne le DD ...
Dans le tableau "SMART Attributes" presque toutes les valeurs sont
au-delà du Worst, non?
Ma vieille copine rend elle l'âme ?
Merci,
Jean Marc
smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF INFORMATION SECTION ===
Model Family: Seagate Barracuda ATA III family
Device Model: ST320414A
Serial Number: 3EC08W8T
Firmware Version: 3.28
User Capacity: 20 020 396 032 bytes
Device is: In smartctl database [for details use: -P show]
ATA Version is: 5
ATA Standard is: Exact ATA specification draft version not indicated
Local Time is: Tue Aug 19 20:56:21 2008 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 422) seconds.
Offline data collection
capabilities: (0x1b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
No Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
No General Purpose Logging support.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 17) minutes.
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000e 057 053 025 Old_age Always - 32473688
3 Spin_Up_Time 0x0002 076 070 000 Old_age Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 182
5 Reallocated_Sector_Ct 0x0032 100 100 036 Old_age Always - 0
7 Seek_Error_Rate 0x000e 086 060 030 Old_age Always - 464098880
9 Power_On_Hours 0x0032 072 072 000 Old_age Always - 24825
10 Spin_Retry_Count 0x0012 100 099 097 Old_age Always - 0
12 Power_Cycle_Count 0x0032 098 098 020 Old_age Always - 2949
194 Temperature_Celsius 0x0022 034 048 000 Old_age Always - 34
195 Hardware_ECC_Recovered 0x001a 063 054 000 Old_age Always - 161356587
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 001 000 Old_age Always - 36496
200 Multi_Zone_Error_Rate 0x0000 100 100 000 Old_age Offline - 0
202 TA_Increase_Count 0x0032 100 253 000 Old_age Always - 0
SMART Error Log Version: 1
ATA Error Count: 149 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 149 occurred at disk power-on lifetime: 22413 hours (933 days + 21 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 00 da 0d f9 e1 Error: ICRC, ABRT at LBA = 0x01f90dda = 33099226
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 00 da 0d f9 e1 00 01:42:18.599 READ DMA
ca 00 08 ba 82 e5 e1 00 01:42:18.547 WRITE DMA
ca 00 08 61 bb b9 e1 00 01:42:18.547 WRITE DMA
ca 00 10 4a 84 41 e2 00 01:42:18.546 WRITE DMA
ca 00 08 3a 83 41 e2 00 01:42:18.546 WRITE DMA
Error 148 occurred at disk power-on lifetime: 22378 hours (932 days + 10 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 00 ba 24 d8 e1 Error: ICRC, ABRT at LBA = 0x01d824ba = 30942394
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 08 ba 24 d8 e1 00 11:49:42.379 READ DMA
ca 00 08 22 82 41 e2 00 11:49:37.042 WRITE DMA
ca 00 08 32 82 35 e2 00 11:49:37.019 WRITE DMA
ca 00 08 62 84 2d e2 00 11:49:37.018 WRITE DMA
ca 00 08 1a 82 2d e2 00 11:49:37.003 WRITE DMA
Error 147 occurred at disk power-on lifetime: 22338 hours (930 days + 18 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 00 58 85 49 e1 Error: ICRC, ABRT at LBA = 0x01498558 = 21595480
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 08 58 85 49 e1 00 14:17:36.767 READ DMA
ca 00 30 ca 53 d2 e1 00 14:17:36.745 WRITE DMA
ca 00 10 2a 5f 3f e2 00 14:17:36.718 WRITE DMA
c8 00 10 18 85 49 e1 00 14:17:36.714 READ DMA
ca 00 08 c2 53 d2 e1 00 14:17:36.695 WRITE DMA
Error 146 occurred at disk power-on lifetime: 22232 hours (926 days + 8 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 00 0a 96 fa e1 Error: ICRC, ABRT at LBA = 0x01fa960a = 33199626
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 28 0a 96 fa e1 00 09:08:04.743 READ DMA
ca 00 58 b2 f6 d1 e1 00 09:08:04.710 WRITE DMA
c8 00 28 e2 95 fa e1 00 09:08:04.701 READ DMA
ca 00 08 7a 4a f5 e1 00 09:08:04.669 WRITE DMA
ca 00 08 82 ca ea e1 00 09:08:04.642 WRITE DMA
Error 145 occurred at disk power-on lifetime: 22228 hours (926 days + 4 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 00 62 a6 07 e2 Error: ICRC, ABRT at LBA = 0x0207a662 = 34055778
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 18 62 a6 07 e2 00 04:32:01.691 READ DMA
ca 00 e8 3a 26 d2 e1 00 04:32:01.661 WRITE DMA
c8 00 08 0a 5c 07 e2 00 04:32:01.660 READ DMA
c8 00 08 fa 5b 07 e2 00 04:32:01.660 READ DMA
c8 00 20 b2 5b 07 e2 00 04:32:01.648 READ DMA
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 24825 -
Device does not support Selective Self Tests/Logging
Reply to: