We have a custom server with following details:
- Motherboard Supermicro X8DTL-3
- Raid Controller HP Smart array p400(512G BBWC)
- HDD Backplane Supermicro SAS825TQ
- 3 Seagate Barracuda HDD with 1TB(Raid 5)
- Host: Vmware ESXI 6.0
- Vm: CentOS 6.x and 7.x
My server load has been abnormally increased. When I checked I faced with raid errors 1792 and 1779 in the boot process. After re-enabling RAID we checked hard disks and they were shown OK in raid management software.
Then we tested the hard disks with SeaTools for windows(SMART, short and long dst tests). Two hard disks has serious problems and tests were failed.
In a typical HP server like DL380 G7, HDD leds change color from green to orange to indicate problems but in a custom server like ours this feature is not available.
My question is, how we can detect hard disks problem before loosing data?