Questions tagged [smart]

Self-Monitoring, Analysis and Reporting Technology

Self-Monitoring, Analysis and Reporting Technology

This used to monitor a hard drive's state and reliability. It tries to predict failures and warns the user when a disk is degrading.

207 questions
6
votes
3 answers

smartctl & megaraid: how to find the right device node for an adapter #

I can list physical drives on all megaraid adapters using: megacli -PDList -aALL This will display an adapter # for each adapter, and then list the physical drives attached to them. The individual devices in the PDList output also have a Device Id…
lmz
  • 379
  • 2
  • 4
  • 17
6
votes
2 answers

S.M.A.R.T. (smart) attribute 190 failing on Seagate Backup Plus 4TB drive

I am using a Seagate Backup Plus 4TB drive attached over USB to a Linux box. The drive model is ST4000DX000-1CL160. When checking the S.M.A.R.T. attributes I get: $ sudo smartctl -a -d sat /dev/sdb smartctl 5.41 2011-06-09 r3365…
mgd
  • 267
  • 1
  • 3
  • 9
6
votes
2 answers

Munin's smart plugin keeps reporting an error in the past because of the exit code

My hosting provider has inserted a hard drive into my server which seems to have had some sort of error in the past but a full offline smart check showed that everything is (about) ok at the moment. The server has a RAID1 so I can somewhat live with…
akirk
  • 338
  • 4
  • 9
5
votes
1 answer

Unit for Total_LBAs_Written in SMART data

I have 3 machines with disks (actually, SSDs) in RAID1, purposely with different brands in an attempt to make it improbable to loose them both simultaneously due to wear-out. They are functional. They should have approximately the same number of…
fgrieu
  • 365
  • 4
  • 17
5
votes
1 answer

SMART - Understanding Offline Data Collection

I have two Kingston A400 120GB SSD as cache in a Synology NAS, which don't seem to support automatic offline data collection. # smartctl -d sat -c /dev/sdc | grep -i "Auto Offline data collection" Auto Offline Data Collection: Disabled. No Auto…
Bangaio
  • 160
  • 1
  • 8
5
votes
1 answer

smart long test - what is the performance impact on busy server?

I have a busy server with a RAID 1 setup. The application (runs in PHP) is very read/write intensive to the database (MariaDB). A cronjob runs smartctl short test every day and checks the output of smartctl -H and mdadm -D. I would like to run a…
Nuno
  • 553
  • 2
  • 8
  • 26
5
votes
1 answer

HP Proliant G6 reporting failed SSD drive - alternative strategies for monitoring?

I would like to get your feedback about a controversial situation I found myself in some days ago. I was tasked to use a HP Proliant G6 for development purposes with 2 new (< 2 months, never used before) non-HP SSDs used in RAID1 configuration. They…
Deim0s
  • 172
  • 7
5
votes
1 answer

HDD is acting up, but S.M.A.R.T says everything is fine

Before I begin, a quick disclaimer. I'm basically a developer forced into a sysadmin role by circumstances, so I apologize in advance if I say something stupid or seem like I don't know what I'm doing. So, we are having problems with one of the…
panta82
  • 248
  • 2
  • 10
5
votes
3 answers

Confirm disk is broken when it passes all diagnostics

I have a system with a potentially broken disk, but the disk passes all manner of diagnostics. I have been unable to confirm that the disk is broken. What are my options? I could just replace the disk, but because this situation is very similar to…
Halfgaar
  • 8,084
  • 6
  • 45
  • 86
5
votes
2 answers

Is my hard drive about to die?

I have two hard drives set up as a RAID 1 array on my server (Linux, software RAID using mdadm) and one of them just got me this "present" in syslog: Nov 23 02:05:29 h2 kernel: [7305215.338153] ata1.00: exception Emask 0x0 SAct 0x1 SErr 0x0 action…
Hristo Deshev
  • 224
  • 2
  • 9
5
votes
2 answers

The best nagios plugin for SMART?

I've tried the best plugin for nagios to SMART monitoring. There are some, but only monitoring temperature. But with smartctl we can find more data. Do you have some better plugins with all data from smart?
Rafał Kamiński
  • 187
  • 2
  • 5
  • 15
5
votes
2 answers

Understanding the S.M.A.R.T. values from a Synology 1812+ device

I have a Synology 1812+ NAS with 8 3TB drives configured as RAID 5. Its running DSM 4.1. It was purchased to replace USB drives, consolidate storage and short term OS X backups using Time Machine. The device and drives are only 2 months old. Every…
5
votes
1 answer

1TB Seagate Barracuda ST31000528AS - Unexpected "Offline uncorrectable sectors"

I bought a 1TB Seagate Barracuda 7200.12 - ST31000528AS drive about 18 months ago because it had a 5-year warranty, and used it alongside a Samsung Spinpoint under Ubuntu Server as one half of a software RAID-1 array. Yesterday, smartd started…
aSteve
  • 231
  • 1
  • 4
  • 10
4
votes
0 answers

SMART Extended self-test routine changes duration

I purchased a new Western Digital Elements 3TB a couple of weeks ago. As usual, I did all SMART tests, then badblocks, then SMART tests again. When I first got the drive, SMART showed this: Short self-test routine recommended polling time: ( …
4
votes
0 answers

Troubleshooting `Buffer I/O error` on software-raid/md-device

I've got a RAID-5 mdadm array that reliably causes Buffer I/O error on dev md0, logical block 1598030208, async page read to be written do dmesg when reading that block. Of course, the read also factually fails. This behavior is consistent across…
RQM
  • 141
  • 1
  • 3
1 2
3
13 14