Koozali.org: home of the SME Server

Problema SMART

Offline FrancescoC

  • *****
  • 226
  • +0/-0
Problema SMART
« on: December 19, 2012, 11:58:43 AM »
Qualche giorno fa mi sono accorto che il disco sda del mio SME 7.5.1 in raid 1 era stato rimosso dal raid e perciò l'ho riagganciato e lui si è risincronizzato. Ma continuo  ad avere questo log :

Dec 19 10:00:48 server2010 smartd[4334]: Device: /dev/sda, 3 Offline uncorrectable sectors
Dec 19 10:00:48 server2010 smartd[4334]: Device: /dev/sda, SMART Prefailure Attribute: 7 Seek_Error_Rate changed from 200 to 100
Dec 19 10:00:48 server2010 smartd[4334]: Device: /dev/sdb, SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 76 to 77
Dec 19 10:00:48 server2010 smartd[4334]: Device: /dev/sdb, SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 50 to 49
Dec 19 10:03:32 server2010 sshd(pam_unix)[6646]: session opened for user root by (uid=0)
Dec 19 10:03:42 server2010 su(pam_unix)[6684]: session opened for user admin by root(uid=0)
Dec 19 10:03:51 server2010 su(pam_unix)[6684]: session closed for user admin
Dec 19 10:19:51 server2010 sshd(pam_unix)[6646]: session closed for user root
Dec 19 10:20:05 server2010 HORDE[5662]: [imp] Login success for admin [192.168.1.54] to {localhost:143 [imap/notls]} [pid 5662 on line 307 of "/home/httpd/html/horde/imp/lib/Session.php"]
Dec 19 10:20:06 server2010 slapd[4586]: conn=1 fd=7 ACCEPT from IP=127.0.0.1:32895 (IP=0.0.0.0:389)
Dec 19 10:20:06 server2010 slapd[4586]: conn=1 op=0 BIND dn="" method=128
Dec 19 10:20:06 server2010 slapd[4586]: conn=1 op=0 RESULT tag=97 err=0 text=
Dec 19 10:20:06 server2010 slapd[4586]: conn=1 op=1 UNBIND
Dec 19 10:20:06 server2010 slapd[4586]: conn=1 fd=7 closed
Dec 19 10:20:16 server2010 HORDE[5659]: [imp] Logout for admin [192.168.1.54] to {localhost:143 [imap/notls]} [pid 5659 on line 67 of "/home/httpd/html/horde/imp/login.php"]
Dec 19 10:22:38 server2010 smbd[7372]: [2012/12/19 10:22:38, 0] smbd/nttrans.c:call_nt_transact_ioctl(2466)
Dec 19 10:22:38 server2010 smbd[7372]:   call_nt_transact_ioctl(0x901af): Currently not implemented.
Dec 19 10:30:48 server2010 smartd[4334]: Device: /dev/sda, 3 Offline uncorrectable sectors
Dec 19 10:30:48 server2010 smartd[4334]: Device: /dev/sdb, SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 77 to 78
Dec 19 10:30:48 server2010 smartd[4334]: Device: /dev/sdb, SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 49 to 48
Dec 19 11:00:47 server2010 smartd[4334]: Device: /dev/sda, 3 Offline uncorrectable sectors
Dec 19 11:18:32 server2010 smbd[9290]: [2012/12/19 11:18:32, 0] smbd/nttrans.c:call_nt_transact_ioctl(2466)
Dec 19 11:18:32 server2010 smbd[9290]:   call_nt_transact_ioctl(0x901af): Currently not implemented.
Dec 19 11:27:30 server2010 sshd(pam_unix)[9599]: session opened for user root by (uid=0)
Dec 19 11:30:47 server2010 smartd[4334]: Device: /dev/sda, 3 Offline uncorrectable sectors
Dec 19 11:30:47 server2010 smartd[4334]: Device: /dev/sdb, SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 48 to 47
Dec 19 11:34:33 server2010 smbd[7372]: [2012/12/19 11:34:33, 0] lib/util_sock.c:read_data(540)
Dec 19 11:34:33 server2010 smbd[7372]:   read_data: read failure for 4 bytes to client 192.168.1.24. Error = Connection timed out
Dec 19 11:38:02 server2010 sshd(pam_unix)[9599]: session closed for user root

E' il caso che sostituisco un disco ? Quale ?

Offline Stefano

  • *
  • 10,894
  • +3/-0
Re: Problema SMART
« Reply #1 on: December 19, 2012, 12:11:25 PM »
si
sda

:-)

Offline FrancescoC

  • *****
  • 226
  • +0/-0
Re: Problema SMART
« Reply #2 on: December 19, 2012, 12:12:05 PM »
Aggiungo che analizzando i dischi con il cd di diagnostica dell'Hp , perchè si tratta di un Proliant ml 110 g6 , mi dice che tutto è a posto. Chi ha ragione ?

Offline Stefano

  • *
  • 10,894
  • +3/-0
Re: Problema SMART
« Reply #3 on: December 19, 2012, 12:14:36 PM »
i tuoi dati comandano...

Code: [Select]
/dev/sda, 3 Offline uncorrectable sectors

se ti si corrompono i dati, la corruzione verrà passata (non in termini di danno fisico al disco ovviamente) anche sul disco sano..

il costo di un disco certamente non vale i miei dati.. i tuoi?

Offline FrancescoC

  • *****
  • 226
  • +0/-0
Re: Problema SMART
« Reply #4 on: December 19, 2012, 12:19:37 PM »
Hai perfettamente ragione !!!!
Ma sono un po' di coccio ....  :?  Quindi è normale che sda lanciando un  smartctl -a /dev/sda
Mi restituisce :

smartctl version 5.33 [i686-redhat-linux-gnu] Copyright (C) 2002-4 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model:     GB0250EAFYK
Serial Number:    WCAT1E859111
Firmware Version: HPG2
User Capacity:    250,059,350,016 bytes
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   7
ATA Standard is:  ATA/ATAPI-7 T13 1532D revision 4a
Local Time is:    Wed Dec 19 12:16:45 2012 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84) Offline data collection activity
                                        was suspended by an interrupting command from host.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                 (4680) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (  58) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       11714
  3 Spin_Up_Time            0x0027   201   200   021    Pre-fail  Always       -       950
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       46
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002f   100   253   051    Pre-fail  Always       -       0
  9 Power_On_Hours          0x0032   070   070   000    Old_age   Always       -       21972
 10 Spin_Retry_Count        0x0033   100   253   051    Pre-fail  Always       -       0
 11 Calibration_Retry_Count 0x0033   100   253   051    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       45
184 Unknown_Attribute       0x0033   100   100   097    Pre-fail  Always       -       0
187 Unknown_Attribute       0x0032   100   001   000    Old_age   Always       -       11774
188 Unknown_Attribute       0x0032   100   093   000    Old_age   Always       -       11
190 Unknown_Attribute       0x0022   067   057   045    Old_age   Always       -       33
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       44
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       1
194 Temperature_Celsius     0x0022   110   100   000    Old_age   Always       -       33
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       3
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       3

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     21969         -
# 2  Short offline       Aborted by host               90%     21969         -
# 3  Short offline       Completed without error       00%     21968         -
# 4  Short offline       Completed: read failure       10%     21954         432321839
# 5  Short offline       Completed: read failure       90%      4405         432321839
# 6  Short offline       Completed: read failure       90%      4405         432321839
# 7  Short offline       Aborted by host               90%      4405         -
# 8  Short offline       Completed without error       00%         3         -
# 9  Short offline       Aborted by host               90%         3         -
#10  Short offline       Completed without error       00%         1         -
#11  Short offline       Aborted by host               90%         1         -
#12  Short offline       Aborted by host               90%         1         -
#13  Short offline       Aborted by host               90%         0         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Offline FrancescoC

  • *****
  • 226
  • +0/-0
Re: Problema SMART
« Reply #5 on: December 19, 2012, 12:21:28 PM »
Mentre sdb :

smartctl -a /dev/sdb
smartctl version 5.33 [i686-redhat-linux-gnu] Copyright (C) 2002-4 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model:     GB0250EAFJF
Serial Number:    9SF0ZRSF
Firmware Version: HPG6
User Capacity:    250,059,350,016 bytes
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   7
ATA Standard is:  ATA/ATAPI-7 T13 1532D revision 4a
Local Time is:    Wed Dec 19 12:20:10 2012 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                 ( 642) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (  60) minutes.
Conveyance self-test routine
recommended polling time:        (   3) minutes.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   078   063   044    Pre-fail  Always       -       79117300
  3 Spin_Up_Time            0x0003   099   099   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       81
  5 Reallocated_Sector_Ct   0x0033   097   097   036    Pre-fail  Always       -       76
  7 Seek_Error_Rate         0x000f   078   060   030    Pre-fail  Always       -       30629038161
  9 Power_On_Hours          0x0032   068   068   000    Old_age   Always       -       28674
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   037   020    Old_age   Always       -       79
184 Unknown_Attribute       0x0033   100   100   003    Pre-fail  Always       -       0
187 Unknown_Attribute       0x0032   001   001   000    Old_age   Always       -       612
188 Unknown_Attribute       0x0032   100   099   000    Old_age   Always       -       1
189 Unknown_Attribute       0x003a   100   100   000    Old_age   Always       -       0
190 Unknown_Attribute       0x0022   071   060   045    Old_age   Always       -       488308765
194 Temperature_Celsius     0x0022   029   040   000    Old_age   Always       -       29 (Lifetime Min/Max 0/16)
195 Hardware_ECC_Recovered  0x001a   046   031   000    Old_age   Always       -       79117300
196 Reallocated_Event_Count 0x0033   097   097   036    Pre-fail  Always       -       76
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0

SMART Error Log Version: 1
ATA Error Count: 1097 (device log contains only the most recent five errors)
        CR = Command Register [HEX]
        FR = Features Register [HEX]
        SC = Sector Count Register [HEX]
        SN = Sector Number Register [HEX]
        CL = Cylinder Low Register [HEX]
        CH = Cylinder High Register [HEX]
        DH = Device/Head Register [HEX]
        DC = Device Command Register [HEX]
        ER = Error register [HEX]
        ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 1097 occurred at disk power-on lifetime: 28671 hours (1194 days + 15 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 08 ff ff ff 4f 00  49d+12:41:45.240  [RESERVED FOR SERIAL ATA]
  61 00 08 05 54 18 42 00  49d+12:41:45.212  [RESERVED FOR SERIAL ATA]
  61 00 08 35 13 18 42 00  49d+12:41:45.210  [RESERVED FOR SERIAL ATA]
  61 00 08 7d 10 18 42 00  49d+12:41:45.209  [RESERVED FOR SERIAL ATA]
  61 00 08 9d 06 17 42 00  49d+12:41:45.209  [RESERVED FOR SERIAL ATA]

Error 1096 occurred at disk power-on lifetime: 28637 hours (1193 days + 5 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 e5 b7 17 0b

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  61 00 48 ad ae 03 40 00  48d+02:25:22.424  [RESERVED FOR SERIAL ATA]
  60 00 08 dd b8 17 4b 00  48d+02:25:22.355  [RESERVED FOR SERIAL ATA]
  60 00 f8 e5 b7 17 4b 00  48d+02:25:22.355  [RESERVED FOR SERIAL ATA]
  61 00 08 fd b1 2d 42 00  48d+02:25:22.354  [RESERVED FOR SERIAL ATA]
  61 00 08 a5 b1 2d 42 00  48d+02:25:22.354  [RESERVED FOR SERIAL ATA]

Error 1095 occurred at disk power-on lifetime: 28612 hours (1192 days + 4 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 4d f1 2f 0e

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 00 3d f2 2f 4e 00  47d+01:42:08.221  [RESERVED FOR SERIAL ATA]
  60 00 f0 4d f1 2f 4e 00  47d+01:42:08.211  [RESERVED FOR SERIAL ATA]
  60 00 00 85 e4 2d 4e 00  47d+01:42:08.003  [RESERVED FOR SERIAL ATA]
  60 00 00 85 e3 2d 4e 00  47d+01:42:07.988  [RESERVED FOR SERIAL ATA]
  60 00 00 85 e2 2d 4e 00  47d+01:42:07.976  [RESERVED FOR SERIAL ATA]

Error 1094 occurred at disk power-on lifetime: 28590 hours (1191 days + 6 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 c8 ff ff ff 4f 00  46d+04:03:43.806  [RESERVED FOR SERIAL ATA]
  60 00 c8 ff ff ff 4f 00  46d+04:03:43.797  [RESERVED FOR SERIAL ATA]
  61 00 08 bd 9e 03 40 00  46d+04:03:43.779  [RESERVED FOR SERIAL ATA]
  61 00 40 7d 9e 03 40 00  46d+04:03:43.765  [RESERVED FOR SERIAL ATA]
  60 00 c8 ff ff ff 4f 00  46d+04:03:43.762  [RESERVED FOR SERIAL ATA]

Error 1093 occurred at disk power-on lifetime: 28589 hours (1191 days + 5 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 00 ff ff ff 0f

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  60 00 08 ff ff ff 4f 00  46d+03:05:43.821  [RESERVED FOR SERIAL ATA]
  61 00 08 ff ff ff 4f 00  46d+03:05:43.819  [RESERVED FOR SERIAL ATA]
  60 00 08 1d 3e 2d 42 00  46d+03:05:43.805  [RESERVED FOR SERIAL ATA]
  61 00 08 f5 0d 04 40 00  46d+03:05:43.790  [RESERVED FOR SERIAL ATA]
  61 00 90 65 0d 04 40 00  46d+03:05:43.778  [RESERVED FOR SERIAL ATA]

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     28673         -
# 2  Short offline       Completed without error       00%     28671         -
# 3  Short offline       Aborted by host               90%     28671         -
# 4  Short offline       Completed without error       00%     28671         -
# 5  Short captive       Completed without error       00%     11056         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Offline Stefano

  • *
  • 10,894
  • +3/-0
Re: Problema SMART
« Reply #6 on: December 19, 2012, 12:40:11 PM »
io vedo che sda ha degli errori "read failure"..

sono 2 hd da 250 Gb, quindi hanno un po' di tempo, quasi certamente sono fuori garanzia..

2 serial ata di dimensione maggiore (occhio a non prendere quelli green) e li sostituisci entrambi.. un paio di centinaia di euro (ma anche meno) e vivi felice..

non male, direi

Offline FrancescoC

  • *****
  • 226
  • +0/-0
Re: Problema SMART
« Reply #7 on: December 19, 2012, 12:47:15 PM »
Ok grazie mille !!!
Inizierò a sostituire sda per primo.

Offline ndr3w

  • ****
  • 97
  • +0/-0
Re: Problema SMART
« Reply #8 on: December 19, 2012, 09:22:28 PM »
@Stefano

Come mai i green no?

Offline Stefano

  • *
  • 10,894
  • +3/-0
Re: Problema SMART
« Reply #9 on: December 19, 2012, 10:02:24 PM »
perchè i green hanno delle features che non sono adatte ad un uso continuato h24 in un server ed in raid..
google (sempre da quella parte ---> ) saprà illuminarti :-)

Offline FrancescoC

  • *****
  • 226
  • +0/-0
Re: Problema SMART
« Reply #10 on: December 21, 2012, 08:56:54 AM »
Rieccomi  8) per dirvi che ho cambiato sda e poi ho risincronizzato il raid . Poi ho ricevuto questa mail :

SMART error (ErrorCount) detected on host: server2010

This email was generated by the smartd daemon running on:

   host name: server2010
  DNS domain: test.local
  NIS domain: (none)

The following warning/error was logged by the smartd daemon:

Device: /dev/sdb, ATA error count increased from 1097 to 1098

For details see host's SYSLOG (default: /var/log/messages).

You can also use the smartctl utility for further investigation.
Another email message will be sent in 1 days if the problem persists

e quindi ho cambiato anche sdb e poi ho risincronizzato il raid.
Ma ancora sui log ho questo :

Dec 21 01:25:03 server2010 smartd[4267]: Device: /dev/sda, SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 119 to 100
Dec 21 01:25:03 server2010 smartd[4267]: Device: /dev/sda, SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 58 to 63
Dec 21 01:25:03 server2010 smartd[4267]: Device: /dev/sdb, SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 117 to 118
Dec 21 01:25:03 server2010 smartd[4267]: Device: /dev/sdb, SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 60 to 61
Dec 21 01:30:04 server2010 squid[4793]: storeDirWriteCleanLogs: Starting...
Dec 21 01:30:04 server2010 squid[4793]:   Finished.  Wrote 37 entries.
Dec 21 01:30:04 server2010 squid[4793]:   Took 0.0 seconds (4569.0 entries/sec).
Dec 21 01:30:04 server2010 squid[4793]: logfileRotate: /var/log/squid/store.log
Dec 21 01:30:04 server2010 squid[4793]: logfileRotate: /var/log/squid/access.log
Dec 21 01:55:04 server2010 smartd[4267]: Device: /dev/sda, SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 63 to 61
Dec 21 02:25:04 server2010 smartd[4267]: Device: /dev/sda, SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 100 to 101
Dec 21 02:25:04 server2010 smartd[4267]: Device: /dev/sda, SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 61 to 62
Dec 21 03:25:04 server2010 smartd[4267]: Device: /dev/sda, SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 62 to 61
Dec 21 04:25:03 server2010 smartd[4267]: Device: /dev/sda, SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 101 to 102
Dec 21 04:55:04 server2010 smartd[4267]: Device: /dev/sda, SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 61 to 60
Dec 21 07:55:04 server2010 smartd[4267]: Device: /dev/sda, SMART Usage Attribute: 195 Hardware_ECC_Recovered changed from 60 to 59

Possibile? Che devo fare ?  :(

Offline Stefano

  • *
  • 10,894
  • +3/-0
Re: Problema SMART
« Reply #11 on: December 21, 2012, 11:04:17 AM »
dovresti "impratichirti" di più con smartctl per comprendere quali sono i segnali "seri" e quali quelli di "rumore di fondo"

google è il punto da cui partire, magari inserendo come key i vari "Hardware_ECC_Recovered" ecc

Offline FrancescoC

  • *****
  • 226
  • +0/-0
Re: Problema SMART
« Reply #12 on: December 21, 2012, 12:31:27 PM »
Scusa ma non potrebbe essere che mi sono portato dietro degli errori logici dai dischi precendeti e magari con uno "scandisk" di linux posso risolvere?

Offline Stefano

  • *
  • 10,894
  • +3/-0
Re: Problema SMART
« Reply #13 on: December 21, 2012, 01:02:01 PM »
no, quelli che vedi non sono errori logici ma variazioni di stato di grandezze dell'hd.. sul come interpretarli devi impararlo tramite google