Questions tagged [smart]

Self-Monitoring, Analysis and Reporting Technology

Self-Monitoring, Analysis and Reporting Technology

This used to monitor a hard drive's state and reliability. It tries to predict failures and warns the user when a disk is degrading.

207 questions
2
votes
1 answer

Using smartd to monitor eSATA hard drive?

I'm using smartd to monitor the S.M.A.R.T. health of the internal hard drives on my file server and alert me to signs of impending doom. I would also like to monitor the external eSATA hard drives I'll be using with it, but I'm not sure how to…
Kromey
  • 3,641
  • 4
  • 25
  • 30
2
votes
1 answer

RAID controller detecting SSD as foreign after every reboot

My server, an R610, has been detecting my SSD as foreign after every reboot. It has a PERC 6/i RAID controller, but there's no actual RAID stuff (I just have all the drives as their own VDs). This had happened a few times over the last year or two,…
2
votes
2 answers

Relatively new WD Red Pro yielding ATA status: 41 (DRDY ERR), error: 40 (UNC ) on FreeBSD 12.2

I am running a TrueNAS server based on FreeBSD 12.2. I migrated the storage to 10 TB WD Red Pro. They're running for 42 days now. Out of the sudden, during a ZFS scrub, one of the disks yielded 5 errors. All of them more or less…
Peit
  • 131
  • 5
2
votes
1 answer

HDD spins up spuriously for no reason

I have four HDDs in my NAS. Three (Western Digital, all the same model) are put in standby mode (spin down) by hd-idle and they stay in standby until I use them. The reason I use hd-idle and not the internal power saving mechanism via hdparm (-S XX…
Lazarus535
  • 235
  • 2
  • 6
2
votes
2 answers

HDD SMART interpretation

I need your opinion if the drive below is failing. When I run "smartctl -a /dev/sda -d megaraid,1", 2 errors are posted at the end of the output, stating "Error: WP at LBA". I don't see anything suspicious in the SMART parameters. Here is the…
Alexandru
  • 23
  • 4
1
vote
1 answer

UDMA CRC SMART Reporting, Alternative Software/logs for Diag

* Update * As it turns out the human readable portion of SMART reporting numbers is pretty useless for the UDMA CRC errors and you just have to track the RAW value. After flushing through over a dozen hard drives or so I never saw the readable…
Whyudodis
  • 11
  • 3
1
vote
1 answer

S.M.A.R.T. Attributes from OWC Mercury Rack Pro

I'm having difficulty getting SMART Attributes from drives in an OWC Mercury Rack Pro. I can successfully get all the drive info, but I get nothing past the START OF READ SMART DATA SECTION. It is currently connected via eSATA to a…
sevve
  • 21
  • 4
1
vote
1 answer

How to derive bytes written from SMART's host_write_commands?

Using either smartctl 7.0 or nvme 1.7, I get the following data from the SMART log data_units_written : 350,371,149 host_write_commands : 2,974,115,785 Via smartctl, the first line also shows [179 TB], which is…
Gaia
  • 1,855
  • 5
  • 34
  • 60
1
vote
0 answers

Something's wrong w Fedora Core 31 gnome-disks; drops SMART data for unknown reason, THEN doesn't show available disks at all

As discussed here, I've been working to load a great many smaller disks' data to a larger storage repository. The system in question is Fedora Core Server 31 - out just a week ago or less. Of course, I added a few tools to the base Server download…
Richard T
  • 1,206
  • 12
  • 29
1
vote
0 answers

How to collect historical data using smartctl?

I'm trying to do some analysis on the SMART stats of disk drives. Unfortunately, I'm not collecting/storing smart stats data daily for the analysis. I will be writing some procedure to collect it from now. I was thinking, that disk drives has its…
zubug55
  • 111
  • 1
1
vote
1 answer

Is there ready S.M.A.R.T monitoring toolkit for NVME disks (exporter,source,board)?

I already found node_exporter and grafana board for HDD S.M.A.R.T. Can You please get advice or url for ready exporter or other toolkit? I'm trying to wrote my own "messy" text exporter for prometheus, but I think that there must be ready…
Kein
  • 131
  • 3
  • 14
1
vote
1 answer

OfflineUncorrectableSector & CurrentPendingSector emails - what should I do?

Running Centos 7: I've just done a yum update followed by a reboot on one of my servers, and when it came back up I got two emails: First: OfflineUncorrectableSector Device: /dev/sdb [SAT], 28 Offline uncorrectable…
Codemonkey
  • 1,086
  • 4
  • 19
  • 41
1
vote
1 answer

Current Pending Sector S.M.A.R.T

Hello I have a Linux Server mounted with BTRFS in RAID-1 and I would like that someone more expert than me could solve my doubts. I have 3 HDD of 6TB each, in one of them every so often smartd detects errors in some sector. When it happens I use…
1
vote
1 answer

SMART attribute CRC error count raw values high + errors logged, time to replace drive?

I have 2x Hitachi Deskstar P7K500 about a year old in RAID 1, md0 is boot and md1 is used by LVM. Just recently I got a warning in X (FC11) that there are one or more disks failing. I looked at the SMART attributes and I have errors on both…
bxb
1
vote
0 answers

Smartd supress/change level for FAILURE PREDICTION THRESHOLD EXCEEDED

I'm using smartd to monitor drive health It started compaining about FAILURE PREDICTION THRESHOLD EXCEEDED for one drive, but I extensively tested that drive and it's fully functional That drive is one of many in this machine How can I disable or…
Matthias
  • 187
  • 3
  • 10