Questions tagged [drive-failure]
118 questions
3
votes
1 answer
Isolate a failed SSD drive
I have recently had a Samsung 840 Pro 256 GB SSD fail on our server. The drive was configured in a RAID 1 array and a reboot was sufficient to bring things back online.
When looking at the Intel Rapid Storage dashboard I now have the following…

QFDev
- 395
- 2
- 4
- 12
3
votes
1 answer
Is there an automated way to report and open cases with HP for drive failure?
I want to take failed drives, send a diagnostic report to HP so I can automatically have the drives shipped to the right location.
I'm considering writing C# code to automatically log in, post a case below, with the drive diagnostics as an…

makerofthings7
- 8,911
- 34
- 121
- 197
3
votes
1 answer
Taking an image of a failing boot drive
Ok, so my Ubuntu 12.10 install seems to have gone slightly sideways. It currently boots from a Compactflash card that I was planning on replacing soon, but as luck would have it it's gone down now. The boot process gets to a certain point and then…

alanbeard
- 141
- 9
3
votes
1 answer
Smart Array P400i Physical Drive failed after being replaced
We have replaced the bad drives in a few HP systems (ProLiant DL360 G5), but they continue to show failed. The drives are part of RAID, and I think this might be why they are appearing as failed. The firmware/model is not the same, although they are…

EGr
- 609
- 4
- 14
- 29
3
votes
1 answer
reliably and automatically determine connection path of physical position of HDD from /dev/sdX device file
This is kind of a FAQ, but all the answers I found so far are not suited for full automation, which is what I need. So here it goes again.
In Linux:
Is there a reliable way to resolve the udev device name of a HDD (e.g. "/dev/sdg") into it's data…

Jost
- 31
- 2
3
votes
1 answer
HDD Health tool for CentOS?
Does anyone know some good tool for checking HDD failure in Cent OS (Red Hat Systems) without downtime.
And tell me how to use it.
Thanks.

Luka
- 375
- 5
- 21
2
votes
0 answers
How to identify which disk is failing in a HP ProLiant RAID controller?
I have a hard time identifying which drive is failing in my HP ProLiant DL360p Gen8. It has the following RAID controller: Smart Array P420i. I see tons of errors in dmesg:
[40425140.998750] sd 0:1:0:1: [sdb] Unaligned partial completion…

user3125731
- 347
- 5
- 9
2
votes
1 answer
MEGARAID RAID 1 SSD WITH HOT SPARE AVAILABLE
I do not know whether this requires immediate action or not. W2012R2 server with two 500GB SSDs plus one identical Hot Global Spare. RAID is Megaraid. Screenshots from the RAID Utility are attached below. This is a business in a small town where we…

RegularExpression
- 175
- 1
- 4
2
votes
2 answers
ZFS replace disks in freebsd, how and when
I have the following zpool configuration:
zpool status
NAME STATE READ WRITE CKSUM
data ONLINE 0 0 0
raidz2-0 ONLINE 0 0 0
da1 ONLINE 0 0 0
da0 ONLINE…

Sebastian
- 145
- 6
2
votes
1 answer
Do more mirrors in raid 1 reduce the likelihood of spit-brain failures?
I'm setting up a server, and my predecessor had a habit of setting up multiple RAID 1s with only 2 disks, which eventually resulted in split-brain situations which were difficult to recover from.
Since I want the same speed of access, would adding a…

tudor -Reinstate Monica-
- 1,805
- 18
- 36
2
votes
3 answers
Which is the worst-case scenario: dying HDD or SSD?
We all know SSD offer many advantages (such as speed) over HDD drives but which one is safer when it comes to data rescue if, for instance, the drive is failing?
From my understandings on SSD drive data is stored on-chip and when chip dies all data…

Luka
- 375
- 5
- 21
2
votes
1 answer
How to scan an entire empty ZFS raid-z array for bad sectors (blocks)?
Is there a method for scanning all of the devices in a raid-z array for bad sectors (blocks) simultaneously or must this be performed on a device-by-device basis with a separate drive testing application?
So far scrub just doesn't seem to scan…

ylluminate
- 1,155
- 2
- 17
- 35
2
votes
1 answer
Volume Rebuild Failure after disk replacement in HP MSA1500/MSA20
I have a historically flaky MSA1500 with 2xMSA20 enclosures attached. Recently a disk failed in one of the enclosures.
The LCD display reports that interim state recovery for all volumes was successful.
On hot-swapping the failed drive, one of the…
user9797
2
votes
1 answer
Linux software RAID becomes unresponsive after removing a disk from server
I am running a CentOS 7 machine (standard kernel: 3.10.0-327.36.3.el7.x86_64) with a software RAID-10 over 16x 1 TB SSDs (to be more precise, there are two RAID arrays on the disks; one of the arrays is providing the host's swap partition). Last…

user3693367
- 155
- 8
2
votes
1 answer
Should I expect both disks in my RAID1 array to fail at about the same time?
I have an Ubuntu server with two HDD drives in a software RAID1 array. These are identical drives, and I'm assuming they're seeing the same level of read/write activity over their lifetimes (correct me if I'm wrong).
Does that mean it's reasonable…

kontextify
- 189
- 1
- 10