Questions tagged [drive-failure]

118 questions
2
votes
1 answer

Replacing a Self encrypting Hard Drive in an IBM xseries server raid

I have a failed drive in a RAID 5 array in an IBM x3650, Machine Type 7945. The RAID is composed entirely of original IBM SED SAS drives, 300GB ea. Unfortunately I am the purchasing Tech, not the one who goes to the server (just started this week)…
2
votes
1 answer

S.M.A.R.T long self test. Does the test continue after finding bad blocks?

I've been watching a SMART enabled HDD closely recently (connected to an OSX Server, which is not very helpful with SMART output out of the box). The drive is definitely failing- the heads click, SMART tests fail (despite SMART overall-health…
questions
  • 45
  • 1
  • 8
2
votes
1 answer

linux raid 1: right after replacing and syncing one drive, the other disk fails - understanding what is going on with mdstat/mdadm

We have an old RAID 1 Linux server (Ubuntu Lucid 10.04), with four partitions. A few days ago /dev/sdb failed, and today we noticed /dev/sda had pre-failure ominous SMART signs (~4000 reallocated sector count). We replaced /dev/sdb this morning and…
2
votes
3 answers

How to recover resize2fs failure

I was resizing my hard drive last night and was not successful. My system and drives are local on an ESXi 5.1 vm. I'm running Debian 6 x64 and have a 2TB mount that I was resizing. It was about 1.8T that I was resizing to the full 2TB. I ran e2fsck…
tdbui22
  • 103
  • 1
  • 2
  • 6
1
vote
2 answers

Recover harddrive after abruptly power cut

Following is my partition table, mercurial@providence:~$ sudo fdisk -l Disk /dev/sda: 465.78 GiB, 500107862016 bytes, 976773168 sectors Disk model: ST9500420AS Units: sectors of 1 * 512 = 512 bytes Sector size (logical/physical): 512 bytes /…
Mercurial
  • 123
  • 6
1
vote
0 answers

MegaRAID SAS Phy is bad on enclosure, no SAS address

I've inherited a MegaRAID SAS 24 bay RAID, that is also attached to two additional enclosures ( one 24 bay, and one 12 bay). I attempted to load 6 new drives into 6 consecutive empty drive bays on the 12 bay enclosure (bays 6 - 11) and all 6 showed…
BurningKrome
  • 525
  • 2
  • 12
  • 22
1
vote
1 answer

OfflineUncorrectableSector & CurrentPendingSector emails - what should I do?

Running Centos 7: I've just done a yum update followed by a reboot on one of my servers, and when it came back up I got two emails: First: OfflineUncorrectableSector Device: /dev/sdb [SAT], 28 Offline uncorrectable…
Codemonkey
  • 1,086
  • 4
  • 19
  • 41
1
vote
1 answer

HP server with failed disk in hardware RAID 5, how to replace (have same-spec disk)?

My system Admin suddenly took an emergency leave and I am not so good in system administration. Seeking your help. I have a HP server running VMware. The server contain Hardware Raid 5 with three SAS HDD. One of the HDD showing warning sign and I…
Genuity Systems
1
vote
0 answers

Removing failed Raid0 volume from CentOSv6 boot sequence

I need help. To date the majority of my Linux 'software RAID' use has been done via the Graphical Installer. Now, one of the drives comprising my RAID-0 volume has failed and at boot, the system bails out to a command line so I can fix the problem.…
ChiefEngr
  • 11
  • 2
1
vote
1 answer

Is there any way to mount an EXT4 file system with errors?

I may have been somewhat foolish, but kept putting off the warning signs about the impending failure of my EXT4-on-LVM Ubuntu machine (like the partition remounting itself R/O, SMART errors, etc.) and one day, the FSCK shows "filesystem still has…
JonTheNiceGuy
  • 893
  • 7
  • 12
1
vote
2 answers

Repeated disk failure on Dell T610 Server

I purchased a used Poweredge T610 and upgraded it to 2x Hexcore Xeon X5675 processors and 96 GB RAM. Initially, I used 3 WD green 2TB drives in a RAID-5 array (Perc6i controller) and installed Ubuntu server on the virtual disk. This setup served me…
1
vote
0 answers

Smartd supress/change level for FAILURE PREDICTION THRESHOLD EXCEEDED

I'm using smartd to monitor drive health It started compaining about FAILURE PREDICTION THRESHOLD EXCEEDED for one drive, but I extensively tested that drive and it's fully functional That drive is one of many in this machine How can I disable or…
Matthias
  • 187
  • 3
  • 10
1
vote
0 answers

Can ESXi pause VMs on a datastore in APD state?

I know that using VMCP on VMware ESXi, it's possible to make the ESXi host shut down or restart a virtual machine when a datastore that the VM resides on goes into All Paths Down (APD) state. I'm looking to know if it's possible to pause (as in,…
Josh
  • 9,190
  • 28
  • 80
  • 128
1
vote
1 answer

How to identify which disk failed on RAID 10?

RAID 10 set up from 16 SATA (4TB) disks. How to identify from pictures which disk is failed? Here is my screenshots: The output of lcpci: 00:00.0 Host bridge: Intel Corporation Xeon E7 v2/Xeon E5 v2/Core i7 DMI2 (rev 04) 00:01.0 PCI bridge:…
Gani Rakhmatov
  • 227
  • 3
  • 11
1
vote
1 answer

RAID 10 - 2 Disk Failure, how to know which drives are mirrored and if data is safe?

The server is currently experiencing a 2 disk failure and was looking for a way to know if my data is lost or not. I tried searching everyone but I didn't find an answer (I'm sorry - new to this). I ran (cat /proc/mdstat): Personalities : [raid10]…