2

I'm running a Xeon X3450 on a Supermicro X8SIE-F mainboard. Currently there are 4 reg. ECC DIMMs installed (each 4GB in size; installed as DIMM A-Channel1, DIMM A-Channel2, DIMM B-Channel1, DIMM B-Channel2 - according to mainboard manual page 2-11).

Very recently, this error was logged (and nothing more):

EDAC i7core: New Corrected error(s): dimm0: +0, dimm1: +0, dimm2 +3 

So I assume that 3 correctable errors were detected. But which DIMM is affected?

If I run:

find /sys/devices/system/edac/mc/mc0 -type f -name "*count" -exec cat {} \;

all counters show up as 0.

But

cat /sys/devices/system/edac/mc/mc0/all_channel_counts/udimm2

shows 3 errors.

There are two csrows listed: csrow0, csrow1

csrow0 labels:

./ch0_dimm_label CPU#0Channel#0_DIMM#0
./ch1_dimm_label CPU#0Channel#1_DIMM#0

csrow1 labels:

./ch0_dimm_label CPU#0Channel#0_DIMM#1
./ch1_dimm_label CPU#0Channel#1_DIMM#1

I read

https://www.kernel.org/doc/html/v4.10/admin-guide/ras.html

but can't figure out which dimm corresponds to all_channel_counts/udimm2.

Any help is greatly appreciated.

Paul

Paul Pech
  • 21
  • 1

0 Answers0