On a linux server (8x Quad-Core AMD 8378), I'm getting the following errors:
[Hardware Error]: MC4_STATUS[-|CE|MiscV|-|AddrV|CECC]: 0x9c294c00001d018b
[Hardware Error]: Northbridge Error (node 4): ECC error in L3 cache tag.
[Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: SNP
[Hardware Error]: Machine check events logged
This has happened three times during the last month, but never before (server running for 3 years).
From a quick google-search, it seems this is a serious matter.
However, the vendor support technician said:
I have seen these errors MANY times, and unless you are overclocking your CPU - or have had a fan failure or similar - it is VERY unlikely to be a processor problem. It is more likely that the kernel is misreporting the error.
So - is this a critical error and I should order new parts (replace CPU?) or ignore it?
Many thanks.