Thanks for the response. I'll check the bios and hardware as you suggested. One question, though: the box has ECC RAM, would that make a difference? I mean, even if error correction does not work properly, would one not expect some log entry to be written if a memory error occurs? I'm asking because it's a production server, and while it is not business critical, running a reliable memtest takes quite some time. If possible, I would therefore like to pinpoint the problem as exactly as possible before taking the box down altogether.
Thanks.