Koozali.org: home of the SME Server

Harddrive errors?

Offline Salmoneus

  • **
  • 33
  • +0/-0
Harddrive errors?
« on: September 12, 2008, 11:12:36 AM »
One of our SME 7.3 servers are producing this in the messages log once every 5 minutes (at least):

Sep 12 11:00:01 server kernel: hda: drive_cmd: status=0x51 { DriveReady SeekComplete Error }
Sep 12 11:00:01 server kernel: hda: drive_cmd: error=0x04Aborted Command

The server is running on physical hardware, a RAID-5 array consisting of three disks and none of them have any alert LED's lit. So I assume the "drive" is physically ok.

So what could the problem be? How can I proceed in my troubleshooting? Any hints?

/Salmoneus


Offline zatnikatel

  • *****
  • 190
  • +0/-0
Re: Harddrive errors?
« Reply #1 on: September 12, 2008, 03:20:40 PM »
is every thing still running ok the the raid seperate to the sme box are you running a raid card can you give a little more about your setup please

Offline pmstewart

  • *
  • 73
  • +0/-0
Re: Harddrive errors?
« Reply #2 on: September 12, 2008, 10:47:25 PM »
You could try running the SMART commands on the drive and see if this produces any useful info:

Code: [Select]
smartctl -a /dev/hdx (where x is the letter of the drive in question)

Best of Luck......

Paul

Offline byte

  • *
  • 2,183
  • +2/-0
Re: Harddrive errors?
« Reply #3 on: September 12, 2008, 11:39:56 PM »
So what could the problem be? How can I proceed in my troubleshooting? Any hints?

I normally find those to be hard drive issues which sooner or later the hard drive will/could fail.  It could be a cable issue but that is not likely.
--[byte]--

Have you filled in a Bug Report over @ http://bugs.contribs.org ? Please don't wait to be told this way you help us to help you/others - Thanks!

Offline Salmoneus

  • **
  • 33
  • +0/-0
Re: Harddrive errors?
« Reply #4 on: September 15, 2008, 10:35:33 AM »
It is a Dell Poweredge 2650 with three internal scsi disks configured as RAID-5. All disk indicators are green.


smartctl -a produces this output:

[root@server ~]# smartctl -a /dev/sda
smartctl version 5.33 [i686-redhat-linux-gnu] Copyright (C) 2002-4 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

Device: DELL     Virtual Disk 0   Version: V1.0
scsiModePageOffset: response length too short, resp_len=4 offset=4 bd_len=0
>> Terminate command early due to bad response to IEC mode page
A mandatory SMART command failed: exiting. To continue, add one or more '-T permissive' options.


The '-T permissive' switch doesn't make it better...


Offline CharlieBrady

  • *
  • 6,918
  • +3/-0
Re: Harddrive errors?
« Reply #5 on: September 15, 2008, 09:02:32 PM »
Device: DELL     Virtual Disk 0   Version: V1.0

DIsable the virtual disk. You should read the Dell DRAC documentation - my guess is that Cntrl-E during bootup will get you to the right config screen.

Offline Salmoneus

  • **
  • 33
  • +0/-0
Re: Harddrive errors?
« Reply #6 on: September 16, 2008, 07:35:23 AM »
But something must have happened, as these log entries have started to appear only recently.

What exactly is a "virtual disk" in this case? Tt is the only disk I've got and smartctl doesn't like it...




Offline CharlieBrady

  • *
  • 6,918
  • +3/-0
Re: Harddrive errors?
« Reply #7 on: September 16, 2008, 03:37:58 PM »
What exactly is a "virtual disk" in this case?

Ask Dell (or google), or read the documentation, as I have already suggested.

Offline Salmoneus

  • **
  • 33
  • +0/-0
Re: Harddrive errors?
« Reply #8 on: September 16, 2008, 08:45:13 PM »
So the virtual disk is the raid array? But I cannot disable the disk that I got the SME system installed on.
That doesn't make sense...


Offline CharlieBrady

  • *
  • 6,918
  • +3/-0
Re: Harddrive errors?
« Reply #9 on: September 16, 2008, 09:05:03 PM »
One of our SME 7.3 servers are producing this in the messages log once every 5 minutes (at least):

Sep 12 11:00:01 server kernel: hda: drive_cmd: status=0x51 { DriveReady SeekComplete Error }
Sep 12 11:00:01 server kernel: hda: drive_cmd: error=0x04Aborted Command

hda is probably your CD drive. Check dmesg.

Offline Salmoneus

  • **
  • 33
  • +0/-0
Re: Harddrive errors?
« Reply #10 on: September 17, 2008, 09:01:45 PM »
Probing IDE interface ide0...
hda: SAMSUNG CD-ROM SN-124, ATAPI CD/DVD-ROM drive

It sure is. Thanks for that one!
But what is constantly trying to access the cd-rom drive?

Offline CharlieBrady

  • *
  • 6,918
  • +3/-0
Re: Harddrive errors?
« Reply #11 on: September 17, 2008, 09:13:37 PM »
Probing IDE interface ide0...
hda: SAMSUNG CD-ROM SN-124, ATAPI CD/DVD-ROM drive

It sure is. Thanks for that one!
But what is constantly trying to access the cd-rom drive?


Probably haldaemon, which polls for media change.