I had to do a wipe and reinstall.
I rsync ibays and users directories to another computer.
After reinstalling SME, upgading, setting up domains, users, etc from scratch, every aspect of SME works as well as before. User's IMAPs was not recovered. Their files are on the other computers in case some day I decide to attempt to recover them. I'm not sure where to start.
The server is actually running better than before (go figure). My SMB users can now once again login. VPN is working again. Everyone is happy accpet for the loss of mail. Most users are POPing their mail now instead of IMAP because they don't trust the server. It's going to take a while to regain that trust from them and my boss.
This task was also the attempt to reduce the number os servers we have running from three to one. With the inabilty to get one of the five hard drives not to be left unsed as a spare, we are still running short of spacce and will have to go through this again in mid-sumer (2007). But atleast we're down to two servers. I will be searching bugtracker about the hard drive problem. I always seem to run into problems when I have an odd number of hard drives installed.
So in the mean time, I am looking into the DAR site (I'm unnderstanding that better than the initial read of DAR for SME). Maybe after I under stand it, I can better understand DAR for SME. Also looking into what someone suggested: AFFE(?).
My old traditional UNIX head buddies are recommending I take a look at CPIO (and I thought rsync command lines are scary). I'm scare of TAR for very large backup files (lots of large fraphic files for a newspaper). Some one else said to look into dd and dump.
I think (no one can ever be 100% sure) that we have our nearby/short term and offsite/long term archiving under control. It's that bare iron eight hours to deadline restore we are lacking. Recovering enough mail AND being able to access it would be nice.
It use to be easy last year with a few files on a old Compaq 6000 server (upon which SME no longer works), now it is becoming a real concern on our slower, older standby server now running RAID5.
Again, I say pilot error, not a bug. Need a better parachute.
Thanks to everyone that help. I learn quite a bit and have even more respect for SME as an appliance and a platform for developers to deliver plug and play solutions. My users don't even realize how many fewer problems they've had since mgration from our old OS X Server 10.3.X on a G3 for basic services.
I (me) will be donating (couldn't swing it in the budget last year during the beta/test phase).
-cljunkie
PS-Any one need a Compaq Proliant 6000 for parts?