Koozali.org: home of the SME Server

Obsolete Releases => SME Server 7.x => Topic started by: Graham on June 02, 2008, 09:27:53 PM

Title: Raid Issue
Post by: Graham on June 02, 2008, 09:27:53 PM
I'm having a problem with our RAID1 config.

Personalities : [raid1]
md2 : active raid1 sda2[2](F) sdb2[1]
      78043648 blocks [2/1] [_U]

md1 : active raid1 sda1[0] sdb1[1]
      104320 blocks [2/2] [UU]

It looks like it's failed; I've read the documents on the wiki but can't seem to fix it.

Any help would be greatly appreciated.
Title: Re: Raid Issue
Post by: Stefano on June 02, 2008, 09:53:59 PM
Hi..

connect via ssh to your server (or login directly if you can) and do:
Code: [Select]
mdadm -r /dev/md2 /dev/sda2
mdadm -a /dev/md2 /dev/sda2

and watch your /var/log/messages with
Code: [Select]

tail -f /var/log/messages

to see if everything goes ok..

if something goes wrong then your hd must be changed.

HTH

Ciao
Stefano

p.s. I'm sure there are many answers to your question here in the forums ;-)
Title: Re: Raid Issue
Post by: Graham on June 02, 2008, 09:59:06 PM
Thanks I'll give that a try now, did try and search the forum but couldn't find much
Title: Re: Raid Issue
Post by: CharlieBrady on June 02, 2008, 10:18:19 PM
if something goes wrong then your hd must be changed.

Most likely something already *has* gone wrong, and the hd should be changed.
Title: Re: Raid Issue
Post by: Stefano on June 02, 2008, 10:32:25 PM
Hi Charlie..

mmmhh... no..

in my experience (about 5 years with sme and sw raid) I have seen many cases of "broken" raid which goes ok again..
my home server, with 2 scsi hds, for example :-)

in this case (hd in 2 partitions and only one which seems to have problems) I'd do a test.. if both of them are failed.. well.. it's different..

all IMHO
ciao

Stefano
Title: Re: Raid Issue
Post by: CharlieBrady on June 02, 2008, 11:05:02 PM
in my experience (about 5 years with sme and sw raid) I have seen many cases of "broken" raid which goes ok again..

Well, yes, there are false positives. But it's safer to assume the drive is bad (and not too expensive these days either). At the very least, look through /var/log/messages.* for any driver errors, and check the SMART status of the driver, before thinking that it might be OK.
Title: Re: Raid Issue
Post by: Graham on June 02, 2008, 11:55:03 PM
Well it's finished, got this from the log

Jun  2 22:51:39 server-1 kernel: md: md2: sync done.
Jun  2 22:51:39 server-1 kernel: RAID1 conf printout:
Jun  2 22:51:39 server-1 kernel:  --- wd:2 rd:2
Jun  2 22:51:39 server-1 kernel:  disk 0, wo:0, o:1, dev:sda2
Jun  2 22:51:39 server-1 kernel:  disk 1, wo:0, o:1, dev:sdb2
Title: Re: Raid Issue
Post by: Gert on June 04, 2008, 09:50:10 PM
I had the same problem on a raid 5 array where one drive fell out of the array every couple second day or so. Turned out it was the sata cable. After i replaced that its been running fine for months now.