I've got an Intel linux cluster with IPMI interfaces on the nodes. Lately, the IPMI interfaces have been acting flaky. For example, I can no longer use IPMI commands to get the nodes to PXE boot, and rebooting them via IPMI works sometimes but not always.
I recently discovered that you can test the chassis and BMC with ipmitool, and this was the output:
# ipmitool chassis selftest
Self Test Results : device error
[FRU Internal Use Area corrupted]
# ipmitool bmc selftest
Selftest: device corrupted
Internal Use Area corrupted
It looks like something has gone wrong. Is there any way to restore the IPMI interfaces to their original state? (Note that I don't know what the specific IPMI hardware is here, or how to query them to find out).