Hi,
we have seen strange problems on 2 recent servers that we have never had before:
They lock up in a way that there is no console (black screen only on Alt-F1, -F2, etc), no access from workstations on the LAN (all Samba clients loose their connection) and no access from remote (ssh, https) through the external NIC.
The server's hardware components are running (HDs, PSU, etc).
I had to switch off/on to get them started again. One server did this for the first time after about 3 weeks after it's initial install and a second time about 10 days later, the second server 16 days after it's installation.
And No, they are not stock SME 5.5. We add sysmon, awstats, upgraded to Samba 2.2.5 plus more but all of those we have done also on other servers without problems.
The only common denominator that I can see at the moment seems to be the hardware for both:
1 x Athlon XP 2000, 1 x XP 1800 on KT3 ULTRA2 DDR MS-6380E mainboard (VIA KT333 Chipset), 1 x 512, 1 x 256 MB RAM, Accussys IDE RAID controller, 60/40 GB HDs.
Both servers are protected with UPS's of course. On 1 server we had IDE DMA access enabled, on the other it was disabled.
After the restart I checked every (?) single log file for something looking strange or unusual but could not find anything. Eneo system monitor also does not show any higher or unusual resource usage at the moment when the server stops working. I don't believe that there was a very high temperature those days when it happened.
I am a bit lost where to start to analyse this. Help/ideas would be much appreciated.
Thanks,
Michael Doerner