At my office, we have a server that we suspect its RAID controller (HP Smartarray) is failing. A cold boot, however, does not indicate anything.
Can anyone recommend me a method to stress-test the controller?
Symptoms that makes me suspect a failing controller:
- Disk access getting slower, queue getting longer
Running
dmesg
on the XenServer console I see many messages similar to this one:end_request: I/O error, dev tda, sector 253655584
(the sector number is never the same)
When we move the VM to another physical host, we no longer see the above message
- Running idle (without any running VM), the
dmesg
no longer emit the above message
A search on Google indicated that the above message is most commonly associated with a failing SmartArray controller.
How can I be sure that the SmartArray controller is failing?