Questions tagged [ecc]

Error Correcting Code (ECC memory) is used in most computers where data corruption cannot be tolerated under any circumstances.

81 questions
5
votes
3 answers

Evaluating uncorrectable ECC errors and fallback methods

I run a server which has just experienced an error I've not encountered before. It emitted a few beeps, rebooted, and got stuck at the startup screen (the part where the bios shows its logo and begins listing information) with the error: Node0:…
Zhro
  • 349
  • 2
  • 12
5
votes
3 answers

Is there any such error logged by CentOS somewhere that can conclusively reveal "it is now time to pay for ECC"

I have a 32GB non-ECC RAM dedicated server with CentOS. Once for day it randomly crashes without any error in /var/log/kern.log, /var/log/messages, mysql, apache. CPU/RAM/IO are not particularly high nor low. Is there any such error logged by CentOS…
wlf
  • 371
  • 2
  • 13
5
votes
5 answers

md5sum of large files gives different results sometimes

I have an AMD quad core, 8 gb RAM, 1 SSD EXT2 (2 months old), 2 HDD EXT4, approximately 1 year old. I'm using Ubuntu 10.04 x86-64 and when I compute the md5sum of large files (9 GB) sometimes I get different values than the one stored on a reference…
Emanuele
  • 203
  • 2
  • 6
4
votes
4 answers

Is an ECC ram enabled GPU necessary for a server, or will a normal gpu work fine in a server?

Is it a requirement for a server to use ECC ram on a GPU while the normal CPU ram is ECC? Im thinking that instead of using a Quadro k6000 or AMD Firepro, we could use a GTX 980 or AMD r9 290...if possible... Also, if it is not necessarily required…
Austin
  • 43
  • 1
  • 4
4
votes
2 answers

What does "single-bit ECC errors were detected on the RAID controller" mean?

I have a Dell T7600 with a Perc H710P RAID controller and 4 attached 3TB drives. Over the past few months the RAID controller has been intermittently reporting errors on boot: "no boot device found", "adapter at baseport is not responding", disks…
jsp
  • 215
  • 1
  • 4
  • 11
3
votes
0 answers

Correct EDAC driver for Supermicro X10SLL-F

Prompted by Debian upgrade (stretch to buster), I've replaced mcelog with rasdaemon. I'm not sure whether things work as expected though. In dmesg I can find a message like this: [ 1.871662] EDAC MC: Ver: 3.0.0 However, edac-utils is less…
yacoob
  • 131
  • 3
3
votes
3 answers

What to do in response to repeat DRAM ECC error notifications for the same memory location?

I woke up this morning to what's a first for me; one of my systems had logged DRAM ECC error notifications. Three of them, in fact, for as far as I can tell the exact same memory location (obviously, the system isn't actually named localhost): Aug…
user
  • 4,335
  • 4
  • 34
  • 71
3
votes
1 answer

Where are the ECC memory error counters stored?

Where are the ECC memory error counters stored: on the DIMM itself, the motherboard, or the host's disk? I'm using memtest86+, but it seems that it doesn't recognize ECC on my system, so if correctable errors occur, memtest86+ just won't see them.…
MWB
  • 187
  • 9
3
votes
0 answers

What are the risks of running a database on a server without ECC RAM?

A lot of branded servers come with ECC RAM, but it is expensive. For a database server or other critical servers, what would be the impact of not using ECC RAM? Data corruption? (I suppose the database software should handle this already,…
Howard
  • 2,135
  • 13
  • 48
  • 72
3
votes
2 answers

SAS/RAID controller non-ecc ram

I have Adaptec 51245 controller (I know it is old but I got it for free) that I use in my server. As far as I know it is highly recommended to use ECC RAM as system memory, but what about RAID controller? It has 512MB DDR2 RAM. Should I turn it off?…
jojo
  • 33
  • 2
2
votes
0 answers

i7 edac: ecc error - which module?

I'm running a Xeon X3450 on a Supermicro X8SIE-F mainboard. Currently there are 4 reg. ECC DIMMs installed (each 4GB in size; installed as DIMM A-Channel1, DIMM A-Channel2, DIMM B-Channel1, DIMM B-Channel2 - according to mainboard manual page…
Paul Pech
  • 21
  • 1
2
votes
1 answer

Do Kaby Lake Pentiums support ECC?

Looking at setting up a SOHO server with a C236 Chipset and ECC RAM and wondering about the CPU to use. Skylake Pentiums (e.g., the G4400) support ECC, but prior to Kaby Lake's release, news outlets reported that ECC support was removed. Looking at…
Michael Stum
  • 4,050
  • 4
  • 36
  • 50
2
votes
2 answers

Can I mix ECC RDIMMs with different rank?

I have an Intel S5520SC motherboard with two Intel Xeon E5620 CPUs installed. It currently has six KVR13R9D4/8I DIMMS - I want to add another six DIMMs (48GB of RAM) to upgrade this workstation to 96GB of RAM. Unfortunately, Kingston has…
chippies
  • 159
  • 1
  • 6
2
votes
3 answers

ECC memory errors causing random server reboots

I'm running ubuntu server 14.04 on Supermicro X10SLM-F / Xeon E3-1271 v3 Memory: SuperTalent 32GB DDR3 1600 ECC About every 4 days, the logs on Ubuntu will show this: {1}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source:…
Kevin Kelly
  • 23
  • 1
  • 1
  • 4
2
votes
2 answers

Alternative file system/volume manager for ZFS w/ non-ECC RAM?

It's not recommended to use ZFS for a computer without ECC RAM. So, what's a good alternative then? Or is the risk the same, so it doesn't matter what manager I use, it'll be the same problem if a bit in RAM flips anyway? I'm trying to determine if…