Koozali.org: home of the SME Server

broken system after power fail

Offline jameswilson

  • *
  • 794
  • +0/-0
    • Security Warehouse, trade security equipment
Re: broken system after power fail
« Reply #15 on: October 26, 2013, 09:34:29 PM »
i mean i should be able to boot it in a degraded state

Offline _alex

  • ****
  • 103
  • +0/-0
Re: broken system after power fail
« Reply #16 on: October 26, 2013, 11:07:09 PM »
You should backup your data now, a degraded raid may not survive a reboot.
Once done, check all your disks condition with smartctl (and maybe badblocks if You want to double check)

You may try to rebuild your failed raid (add the ficked disk), but personnaly, I would rebuild the whole thing.
Ah, and don't forget to setup a daily backup and upsd ;)

Offline jameswilson

  • *
  • 794
  • +0/-0
    • Security Warehouse, trade security equipment
Re: broken system after power fail
« Reply #17 on: October 26, 2013, 11:10:22 PM »
well i use hdparm to identify the bad sdb. replaced drive and use the recusecd to resync md1. Rebooted, it came up into sme and added the hot spare (sdg) into the array. its now resyncing. I have other issues but in 2000 mins i should have a stable md2

Offline stephdl

  • *
  • 1,523
  • +0/-0
    • Linux et Geekeries
Re: broken system after power fail
« Reply #18 on: October 27, 2013, 10:26:55 AM »
you should setup a wiki page on your adventure for the next man concerned  with this kind of issues.
See http://wiki.contribs.org/Koozali_Foundation
irc : Freenode #sme_server #sme-fr

!!! Please write your knowledge to the Wiki !!!

Offline jameswilson

  • *
  • 794
  • +0/-0
    • Security Warehouse, trade security equipment
Re: broken system after power fail
« Reply #19 on: October 27, 2013, 06:15:50 PM »
good shout

Its far from  finished yet, ive never had this happen before, but the arrays are wierd atm

Offline jameswilson

  • *
  • 794
  • +0/-0
    • Security Warehouse, trade security equipment
Re: broken system after power fail
« Reply #20 on: October 27, 2013, 07:00:33 PM »
Quote
[root@sme-big ~]# cat /proc/mdstat
Personalities : [raid6] [raid5] [raid4] [raid1]
md1 : active raid1 sdb[1]
      104320 blocks [7/1] [_U_____]

md127 : active raid1 sda1[0] sdc1[2] sdd1[3] sde1[4] sdf1[5] sdg1[6]
      104320 blocks [7/6] [U_UUUUU]
        resync=DELAYED

md2 : active raid6 sdg2[6] sdf2[5] sde2[4] sdd2[3] sdc2[2] sda2[0]
      7813629952 blocks level 6, 256k chunk, algorithm 2 [6/5] [U_UUUU]
      [================>....]  recovery = 82.3% (1609272576/1953407488) finish=326.8min speed=17546K/sec

unused devices: <none>

Need to sort out the sdb mistake i made