Questions tagged [mcelog]
5 questions
2
votes
1 answer
Finding the source of a (memory read) hardware error
When logging into my server, I'm seing lots of these errors:
Message from syslogd@****** at May 31 20:06:59 ...
kernel:[500570.908383] mce: [Hardware Error]: PROCESSOR 0:206d7 TIME 1622484419 SOCKET 0 APIC 0 microcode 71a
Message from…

dvilela
- 121
- 3
1
vote
1 answer
Server kernel panic after boot, don't know what to make of the logs
We just received a brand new Dual CPU server and it keeps crashing with a Kernel Panic shortly after booting, this even happened during the OS setup when it was idle. I was able to get the OS installed and enable mcelog to try and understand what is…

ItsJustMe
- 1,001
- 1
- 8
- 10
1
vote
1 answer
"Intel QPI physical layer detected a QPI in-band reset but aborted initialization"
I have a linux server that has logged the following mcelog error:
Hardware event. This is not a software error.
MCE 0
CPU 0 BANK 20
MISC 800000
TIME 1476167381 Tue Oct 11 06:29:41 2016
MCG status:
MCi status:
Corrected error
MCi_MISC register…

Linker3000
- 668
- 1
- 5
- 14
0
votes
1 answer
mcelog and HP BL460 : understand DIMM error
As title says, on one of my BL460, i have a RedHat installed, and a recurrent message in /var/log/messages from mcelog deamon, telling me:
mcelog: Corrected memory errors on page 61a5dd000 exceed threshold 10 in 24h: 10 in 24h
mcelog: Location…

drkmkzs
- 311
- 1
- 2
- 8
0
votes
0 answers
MCE Errors but no edac-util errors?
I have an older HP Z440 tower with 4x8GB ECC DDR4, running Proxmox VE 6.4.
Recently, it started showing MCE errors every few seconds. I installed rasdaemon and can see that they are memory read errors. However, edac-util doesn't show any sign of…

Dustin Lewis
- 1
- 1