Reinhold,
Hey Hey, what do you mean "when your adaptec dies.. your RAID isn't worth a penny" ??
I mean, If one of the disks breaks down, i'd buy 2 larger scsi disks, rebuild the disk with 1 of them, then fail the original disk, attach the second large disk, and then grow the raid... or??
if the controller breaks down (unlikely as it has worked for some years now) I can attach the disks to another controller (happen to have one spare) and happily go on??? of course I'd be offline, but surely i can recover from the disks as they are not in hardware raid ??????
Just checked.. the cabling of hdc is a nice long 80pins.. I'm attaching the output of smartctl, just to show off.. well i checked with some googling, smart clearly tells it is an old drive and it's going to fail anytime. that's OK to me as it just holds a backup copy of online data.
btw this is /var/log/raidmonitor.. I don't understand the date format, but it sure talks a log about spares..
ciao and many thanks again, Michel
[root@www raidmonitor]# ll
total 12
-rw-r--r-- 1 smelog smelog 3216 May 10 18:08 @40000000464c1b2d28315c2c.u
-rw-r--r-- 1 smelog smelog 6993 Aug 3 15:03 current
-rw------- 1 smelog smelog 0 May 8 19:10 lock
-rw-r--r-- 1 smelog smelog 0 May 17 11:06 state
[root@www raidmonitor]# cat current
@40000000464c1b300091f53c mdadm: only specify super-minor once, super-minor=2 ignored.
@40000000464c1b300093934c mdadm: only specify super-minor once, super-minor=1 ignored.
@40000000464c1b303840797c Event: DegradedArray, Device: /dev/md1, Member:
@40000000464c1b312e91516c Event: SparesMissing, Device: /dev/md1, Member:
@40000000464c1b3138ac56c4 Event: DegradedArray, Device: /dev/md2, Member:
@40000000464c1b321aa1e8c4 Event: SparesMissing, Device: /dev/md2, Member:
@4000000046515e1e090b8764 Event: DegradedArray, Device: /dev/md1, Member:
@4000000046515e1e1fac2d5c Event: SparesMissing, Device: /dev/md1, Member:
@4000000046515e1e2e4563ec Event: DegradedArray, Device: /dev/md2, Member:
@4000000046515e1f01baf094 Event: SparesMissing, Device: /dev/md2, Member:
@4000000046518dc506295abc Event: DegradedArray, Device: /dev/md1, Member:
@4000000046518dc51982a99c Event: SparesMissing, Device: /dev/md1, Member:
@4000000046518dc523e110f4 Event: DegradedArray, Device: /dev/md2, Member:
@4000000046518dc5323c021c Event: SparesMissing, Device: /dev/md2, Member:
@400000004651d1010e4f7d84 Event: DegradedArray, Device: /dev/md1, Member:
@400000004651d10202c2d72c Event: SparesMissing, Device: /dev/md1, Member:
@400000004651d1020d66521c Event: DegradedArray, Device: /dev/md2, Member:
@400000004651d1021d660a04 Event: SparesMissing, Device: /dev/md2, Member:
@400000004651da782c939154 Event: DegradedArray, Device: /dev/md1, Member:
@400000004651da790b61ac64 Event: SparesMissing, Device: /dev/md1, Member:
@400000004651da7915a7b18c Event: DegradedArray, Device: /dev/md2, Member:
@400000004651da79261e4724 Event: SparesMissing, Device: /dev/md2, Member:
@40000000465577d5218444d4 Event: DegradedArray, Device: /dev/md1, Member:
@40000000465577d600ee9044 Event: SparesMissing, Device: /dev/md1, Member:
@40000000465577d60ad546fc Event: DegradedArray, Device: /dev/md2, Member:
@40000000465577d61907862c Event: SparesMissing, Device: /dev/md2, Member:
@4000000046558468180cbb0c Event: DegradedArray, Device: /dev/md1, Member:
@40000000465584682ba97f74 Event: SparesMissing, Device: /dev/md1, Member:
@4000000046558468357eb99c Event: DegradedArray, Device: /dev/md2, Member:
@400000004655846905faff8c Event: SparesMissing, Device: /dev/md2, Member:
@4000000046558ba438744f04 Event: DegradedArray, Device: /dev/md1, Member:
@4000000046558ba52bd7ceec Event: SparesMissing, Device: /dev/md1, Member:
@4000000046558ba535ef55bc Event: DegradedArray, Device: /dev/md2, Member:
@4000000046558ba609c6c8e4 Event: SparesMissing, Device: /dev/md2, Member:
@400000004655952028d49d24 Event: DegradedArray, Device: /dev/md1, Member:
@40000000465595211d747954 Event: SparesMissing, Device: /dev/md1, Member:
@400000004655952127f95c3c Event: DegradedArray, Device: /dev/md2, Member:
@4000000046559522030f4d64 Event: SparesMissing, Device: /dev/md2, Member:
@40000000465599a40e28cb44 Event: DegradedArray, Device: /dev/md1, Member:
@40000000465599a438edad54 Event: SparesMissing, Device: /dev/md1, Member:
@40000000465599a50a9918bc Event: DegradedArray, Device: /dev/md2, Member:
@40000000465599a5184f32ac Event: SparesMissing, Device: /dev/md2, Member:
@4000000046559d2b06b3d7dc Event: DegradedArray, Device: /dev/md1, Member:
@4000000046559d2c0393850c Event: SparesMissing, Device: /dev/md1, Member:
@4000000046559d2c1012ccac Event: DegradedArray, Device: /dev/md2, Member:
@4000000046559d2c1e7d3aac Event: SparesMissing, Device: /dev/md2, Member:
@400000004655a18815fcbefc Event: DegradedArray, Device: /dev/md1, Member:
@400000004655a18900906ab4 Event: SparesMissing, Device: /dev/md1, Member:
@400000004655a1890d9db1e4 Event: DegradedArray, Device: /dev/md2, Member:
@400000004655a1891c4fbbec Event: SparesMissing, Device: /dev/md2, Member:
@40000000466d605637a66bfc Event: DegradedArray, Device: /dev/md1, Member:
@40000000466d605711e398f4 Event: SparesMissing, Device: /dev/md1, Member:
@40000000466d60571be096cc Event: DegradedArray, Device: /dev/md2, Member:
@40000000466d60572b037b74 Event: SparesMissing, Device: /dev/md2, Member:
@400000004676870811cf1adc Event: DegradedArray, Device: /dev/md1, Member:
@400000004676870824be8c2c Event: SparesMissing, Device: /dev/md1, Member:
@40000000467687082e9b4c1c Event: DegradedArray, Device: /dev/md2, Member:
@400000004676870900cc3f1c Event: SparesMissing, Device: /dev/md2, Member:
@400000004676a48f18df2254 Event: RebuildStarted, Device: /dev/md2, Member:
@400000004676a50723b1330c Event: Rebuild20, Device: /dev/md2, Member:
@400000004676a5bb3300e85c Event: Rebuild40, Device: /dev/md2, Member:
@400000004676a63408d596cc Event: Rebuild60, Device: /dev/md2, Member:
@400000004676a6e81b3e8e54 Event: Rebuild80, Device: /dev/md2, Member:
@400000004676a79c2c2137e4 Event: RebuildFinished, Device: /dev/md2, Member:
@400000004676a79c39c5e0fc Event: SpareActive, Device: /dev/md2, Member: /dev/sda2
@400000004683a0fb3a7ad14c Event: DegradedArray, Device: /dev/md1, Member:
@400000004683a0fc12f318dc Event: SparesMissing, Device: /dev/md1, Member:
@400000004683a0fc1cd94eac Event: DegradedArray, Device: /dev/md2, Member:
@400000004683a0fc3023ebd4 Event: SparesMissing, Device: /dev/md2, Member:
@4000000046a4a22421794084 Event: DegradedArray, Device: /dev/md1, Member:
@4000000046a4a225042dc16c Event: SparesMissing, Device: /dev/md1, Member:
@4000000046a4a2250f42363c Event: DegradedArray, Device: /dev/md2, Member:
@4000000046a4a2252347f98c Event: SparesMissing, Device: /dev/md2, Member:
@4000000046a4b9112138efac Event: DegradedArray, Device: /dev/md1, Member:
@4000000046a4b91207c4bb3c Event: SparesMissing, Device: /dev/md1, Member:
@4000000046a4b91212c853a4 Event: DegradedArray, Device: /dev/md2, Member:
@4000000046a4b91227bc10d4 Event: SparesMissing, Device: /dev/md2, Member:
@4000000046a5f42728e7ee74 Event: DegradedArray, Device: /dev/md1, Member:
@4000000046a5f4280d2dbdbc Event: SparesMissing, Device: /dev/md1, Member:
@4000000046a5f428176f3ea4 Event: DegradedArray, Device: /dev/md2, Member:
@4000000046a5f42828d8cb74 Event: SparesMissing, Device: /dev/md2, Member:
@4000000046a8549b0b98ba24 Event: DegradedArray, Device: /dev/md1, Member:
@4000000046a8549b31c42684 Event: SparesMissing, Device: /dev/md1, Member:
@4000000046a8549c02e280a4 Event: DegradedArray, Device: /dev/md2, Member:
@4000000046a8549c19a62b74 Event: SparesMissing, Device: /dev/md2, Member:
@4000000046ad98d211a07d44 Event: DegradedArray, Device: /dev/md1, Member:
@4000000046ad98d22c4146ec Event: SparesMissing, Device: /dev/md1, Member:
@4000000046ad98d2366bd8bc Event: DegradedArray, Device: /dev/md2, Member:
@4000000046ad98d30dbb53d4 Event: SparesMissing, Device: /dev/md2, Member:
@4000000046b3279b27a6eeac Event: DegradedArray, Device: /dev/md1, Member:
@4000000046b3279c1dcefd34 Event: SparesMissing, Device: /dev/md1, Member:
@4000000046b3279c28adeae4 Event: DegradedArray, Device: /dev/md2, Member:
@4000000046b3279d03924c8c Event: SparesMissing, Device: /dev/md2, Member:
[root@www raidmonitor]# smartctl -a /dev/hdc
smartctl version 5.33 [i686-redhat-linux-gnu] Copyright (C) 2002-4 Bruce Allen
Home page is
http://smartmontools.sourceforge.net/=== START OF INFORMATION SECTION ===
Device Model: MAXTOR STM3802110A
Serial Number: 9LR0SQXE
Firmware Version: 3.AAK
User Capacity: 80,026,361,856 bytes
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 7
ATA Standard is: Exact ATA specification draft version not indicated
Local Time is: Mon Aug 6 11:07:46 2007 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 430) seconds.
Offline data collection
capabilities: (0x5b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 27) minutes.
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 117 074 006 Pre-fail Always - 118666658
3 Spin_Up_Time 0x0003 094 094 000 Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 3
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 079 060 030 Pre-fail Always - 96280985
9 Power_On_Hours 0x0032 099 099 000 Old_age Always - 1447
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 8
187 Unknown_Attribute 0x0032 100 100 000 Old_age Always - 0
189 Unknown_Attribute 0x003a 100 100 000 Old_age Always - 0
190 Unknown_Attribute 0x0022 060 056 045 Old_age Always - 740163624
194 Temperature_Celsius 0x0022 040 044 000 Old_age Always - 40 (Lifetime Min/Max 0/27)
195 Hardware_ECC_Recovered 0x001a 050 046 000 Old_age Always - 3228972
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 187 000 Old_age Always - 120
200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age Offline - 0
202 TA_Increase_Count 0x0032 100 253 000 Old_age Always - 0
SMART Error Log Version: 1
ATA Error Count: 167 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 167 occurred at disk power-on lifetime: 1443 hours (60 days + 3 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 00 46 00 e0 e0 Error: ICRC, ABRT at LBA = 0x00e00046 = 14680134
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 80 08 00 00 e0 00 08:21:45.669 READ DMA EXT
25 00 80 08 00 00 e0 00 08:21:45.669 READ DMA EXT
10 00 3f 00 00 00 e0 00 08:21:45.669 RECALIBRATE [OBS-4]
25 00 80 08 00 00 e0 00 08:21:45.669 READ DMA EXT
25 00 80 08 00 00 e0 00 08:21:45.642 READ DMA EXT
Error 166 occurred at disk power-on lifetime: 1443 hours (60 days + 3 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 00 46 00 e0 e0 Error: ICRC, ABRT at LBA = 0x00e00046 = 14680134
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 80 08 00 00 e0 00 08:21:45.669 READ DMA EXT
10 00 3f 00 00 00 e0 00 08:21:45.669 RECALIBRATE [OBS-4]
25 00 80 08 00 00 e0 00 08:21:45.669 READ DMA EXT
25 00 80 08 00 00 e0 00 08:21:45.669 READ DMA EXT
ea 00 00 00 00 00 e0 00 08:21:45.642 FLUSH CACHE EXIT
Error 165 occurred at disk power-on lifetime: 1443 hours (60 days + 3 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 00 46 00 e0 e0 Error: ICRC, ABRT at LBA = 0x00e00046 = 14680134
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 80 08 00 00 e0 00 08:21:45.669 READ DMA EXT
25 00 80 08 00 00 e0 00 08:21:45.669 READ DMA EXT
ea 00 00 00 00 00 e0 00 08:21:45.669 FLUSH CACHE EXIT
25 00 80 a8 f7 50 e0 00 08:21:45.669 READ DMA EXT
ea 00 00 00 00 00 e0 00 08:21:45.642 FLUSH CACHE EXIT
Error 164 occurred at disk power-on lifetime: 1443 hours (60 days + 3 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 00 46 00 e0 e0 Error: ICRC, ABRT at LBA = 0x00e00046 = 14680134
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 80 08 00 00 e0 00 08:21:45.669 READ DMA EXT
ea 00 00 00 00 00 e0 00 08:21:45.669 FLUSH CACHE EXIT
25 00 80 a8 f7 50 e0 00 08:21:45.669 READ DMA EXT
ea 00 00 00 00 00 e0 00 08:21:45.669 FLUSH CACHE EXIT
25 00 08 a8 f8 50 e0 00 08:21:45.642 READ DMA EXT
Error 163 occurred at disk power-on lifetime: 1443 hours (60 days + 3 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
84 51 00 46 00 e0 e0 Error: ICRC, ABRT at LBA = 0x00e00046 = 14680134
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
25 00 80 3f e4 50 e0 00 08:20:48.850 READ DMA EXT
25 00 80 3f e4 50 e0 00 08:20:48.400 READ DMA EXT
ea 00 00 00 00 00 e0 00 08:20:48.399 FLUSH CACHE EXIT
25 00 08 00 00 00 e0 00 08:20:48.399 READ DMA EXT
25 00 08 00 00 00 e0 00 08:20:48.330 READ DMA EXT
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed without error 00% 1447 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.