Koozali.org: home of the SME Server
Legacy Forums => General Discussion (Legacy) => Topic started by: BobWilliams on August 25, 2004, 09:39:05 PM
-
Can somebody tell me where to find Darrell May's "Raid1 Monitor How To"? I have looked on the forum and Contribs but can't seem to find it.
Thanks
Bob...
-
http://mirror.contribs.org/smeserver/contribs/dmay/mitel/contrib/raidmonitor/
-
Thats it. Thanks. I didn't see it before when I looked.
Bob...
-
OK, I read Darrell May's "Raid Monitor How To" and installed RaidMonitor. Works great. But now I receive 10 consecutive messages from RaidMonitor with the following content every day:
ALARM! RAID configuration problem
Current configuration is:
Personalities : [raid1]
read_ahead 1024 sectors
md2 : active raid1 hda3[0] hdc3[1]
262016 blocks [2/2] [UU]
md1 : active raid1 hda2[0] hdc2[1]
32662656 blocks [2/2] [UU]
md0 : active raid1 hda1[0] hdc1[1]
102208 blocks [2/2] [UU]
unused devices: <none>
Last known good configuration was:
Personalities : [raid1]
read_ahead 1024 sectors
md2 : active raid1 hdc3[1] hda3[0]
262016 blocks [2/2] [UU]
md1 : active raid1 hdc2[1] hda2[0]
32662656 blocks [2/2] [UU]
md0 : active raid1 hdc1[1] hda1[0]
102208 blocks [2/2] [UU]
unused devices: <none>
Only thing I can see different is that the dev addresses are reversed. i.e. good entry hdc3[1] hda3 [1] bad entry hda3
I don't understand whats wrong. Raid-1 seems to be working just fine.
Bob Williams...
-
run raidmonitor -iv to save the current status.
-
Thanks Boris..
Yes, I do that but the messages come back again!
So far everything worked out great. I simulated a primary master and secondary master disk drive failure and recovered from both with out any problem.
I feel confident that I can support this server as far as disk failures are concerned. Even used the Emergency Boot diskette when the primary master failed and the server wouldn't boot. My configuration has a spare disk drive with the exact same drive as both masters and I have it partitioned and formated ready to go in case of a failure. On my next server build if I have to, can I use a larger drive as a spare in case I can't find an exact same drive for a replacement?
Bob...
-
>>> On my next server build if I have to, can I use a larger drive as a spare in case I can't find an exact same drive for a replacement?
Yes you can. Some space on it will be left unused, but otherwise it would work.
-
.... but the messages come back again ...
I solved it the following way. Not elegant, but it worked out for me and raidmonitor now keeps quit. I think it is a config issue with lilo sometimes recognizing false partition data/drive geometry/partition tables reported through the bios.
- take a look cat /proc/mdstat for your convenience
- take a look here for lsraid
and mdadm
and other raid-related data http://www.tldp.org/HOWTO/Software-RAID-HOWTO.html#toc6
I had to rebuild the array, you don't have to as i see from your posting.
- uninstall raidmonitor via rpm -e.
- delete the raidmon cron-entry in /etc/cron.d
- make a backup of /etc/lilo.conf
- issue /sbin/lilo to write a new lilo.conf
- post-upgrade, reboot, reinstall raidmon an raidmonotor -iv it.
- wait
hope it helps