Questions tagged [smart]

Self-Monitoring, Analysis and Reporting Technology

Self-Monitoring, Analysis and Reporting Technology

This used to monitor a hard drive's state and reliability. It tries to predict failures and warns the user when a disk is degrading.

207 questions
1
vote
0 answers

APC change to smart slot / card interface

I bought a new APC SMT1000C 1000VA UPS, which has the smart card interface on the back. I was planning to reuse my old AP9606 network card, but I discovered that APC has modified the slot to block the old cards. See photos below, new card has 2…
TSG
  • 1,674
  • 7
  • 32
  • 51
1
vote
0 answers

How to add to excludes alerts on smartmontool

I faced with problem and hope for your help. Started getting notifications from smart on Debian 10 server: Device: /dev/nvme1, Critical Warning (0x04): Reliability Found that this alert causing because next attribute: Percentage Used: 107% I also…
remuz150
  • 11
  • 1
1
vote
0 answers

How to interpret smartctl output

I ran a SMART scan of the ssd of my server, and I'm having difficulties to understand the output. Any insights please ? Thanks smartctl 7.0 2018-12-30 r4883 [x86_64-linux-3.10.0-1160.71.1.el7.x86_64] (local build) Copyright (C) 2002-18, Bruce Allen,…
crowd42
  • 11
  • 1
1
vote
1 answer

btrfs - failing disk generated checksum errors, disk replaced, errors remain

I had a pair of 3TB disks in a btrfs raid1 array. One of these disks started failing (smartd shows bad sectors), and so I bought a pair of new 8TB drives to replace both disks in the array. I replaced both with btrfs replace, and ran a btrfs balance…
dkd6
  • 155
  • 1
  • 9
1
vote
1 answer

smartctl "Elements in grown defect list" vs. RAID controller "Media error count"

I am using a hardware raid50 with PERC810 controller in my server and recently encountered a metric I am not sure about. Until now, I have been using a smartctl metric "Elements in grown defect list" as a hint that drive is failing and should be…
chpZ
  • 11
  • 3
1
vote
1 answer

Interpreting SMART Logs of Newly Installed NVMe RAID0 Crashing Everyday

An Ubuntu 20.04 system has been stable for a year until a 2nd and 3rd NVMe drive is installed on the motherboard to form a 2x1TB RAID0 array. Ever since then, there is huge amount of IO load on this RAID0 array 24/7 and the system crashes about once…
Nyxynyx
  • 1,459
  • 11
  • 39
  • 49
1
vote
1 answer

SMART test failures on harddrive

i have probably problem on my system with SMART failures on my harddrive. There is used OS: SLES 11.2 i586. Problem is, that recently there appeared some errors in SMART print of status of my device (smartctl -a /dev/sda) i am attaching. I am…
Martin
  • 11
  • 1
1
vote
2 answers

Resetting SMART on an SSD (hours powered on)

We've been bitten by a known issue on SanDisk SSDs (Dell or HPE branded) where they hard fail after a certain number of hours powered on - 32768 or 40000 depending on the specific model. Is there a reliable way to roll just this SMART attribute…
1
vote
1 answer

(LVM) Moving data to a different physical volume within the same logical volume

I have a logical volume called storage. It contains two ~1TB physical volumes: /dev/sdc and /dev/sdd. smartctl -a /dev/sdd informs me that /dev/sdd is failing. I can still read data off it and I have just backed up all contents off the LV. I have…
Emily Horsman
  • 111
  • 1
  • 3
0
votes
1 answer

G-sense errors in new disks

I have a Synology DiskStation. It used to contain 3xTB drives, I now have 2xTB, 2x8TB drives. The old drives were "Seagate NAS" drives, the new ones are from the same(?) series, named "Ironwolf". The old drives both list G_Sense_Error_Rate at 0. The…
Henrik
  • 101
  • 2
0
votes
1 answer

Trouble Getting the smartd Attributes of a Western Digital 4TB HHD

I am trying to get the smartd attributes of a Western Digital 4TB HHD model WD4001FYYG-01SL3. I would like to get the following attributes: SMART 5 Reallocated Sectors Count SMART 187 Reported Uncorrectable Errors SMART 188 Command Timeout SMART…
Jeff Kubina
  • 427
  • 1
  • 4
  • 14
0
votes
2 answers

Out-of-order lifetimes in SMART self-test log

I was peeking at a server's SMART log, and noticed this (emphasis mine): SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline …
0
votes
2 answers

SMART Input Program

I have a 3ware 9650se card with 4 SATA drives attached to it. I've been trying to figure out how to convert the SMART data I can get from the web interface into something usable, but I haven't found a program that will just take what I give it and…
Nori
  • 211
  • 3
  • 10
0
votes
1 answer

Does "smartctl -H or -all" run anything against disks or just poll data?

I am setting up Smart monitoring currently and I had a question regarding the command smartctl -H /dev/sda === START OF READ SMART DATA SECTION === SMART Health Status: OK Does this actually run anything against the disk, or does it just poll the…
FreeSoftwareServers
  • 515
  • 1
  • 8
  • 26
0
votes
1 answer

Is it possible to reset S.M.A.R.T. Spin_Retry_Count?

I have a 20-bay nas that uses Norco 5 in 2 hard drive cages. It's been at about half capacity most of it's life, but I recently added 5 new drives too it and ever since I've just been having sporadic problems but nothing that S.M.A.R.T would report…
user126715
  • 163
  • 1
  • 1
  • 4