Questions tagged [smart]

Self-Monitoring, Analysis and Reporting Technology

Self-Monitoring, Analysis and Reporting Technology

This used to monitor a hard drive's state and reliability. It tries to predict failures and warns the user when a disk is degrading.

207 questions
0
votes
1 answer

Linux RAID10 hit bad disk while re-syncing. Is it hosed?

I have a Linux software RAID10 device on md0. It's made up of 4 1TB disks sd[abcd]. Yesterday Smart emailed me to say a disk was going bad (seek errors going up and reallocated sectors). I rebooted with a new drive and added it to the array.…
Server Fault
  • 3,714
  • 12
  • 54
  • 89
0
votes
2 answers

Possible to reset SMART logs?

I got two 2TB seagate disks that show SMART error on bootup. I ran the seagate disk utility check and it passed the full test. But the SMART error says both drive have exceeded 70C in temperature so SMART has been tripped. I want to recycle them…
lbanz
  • 1,609
  • 5
  • 20
  • 30
0
votes
1 answer

How can I locally analyze SMART logs from a server?

I have Ubuntu 10.04 server running on a desktop at home. I rebooted, and the machine was unreachable after 10 minutes or so. At this point I should have connected a console (monitor using VGA cable). Instead, I powered off the machine (held down the…
Adam Monsen
  • 725
  • 1
  • 7
  • 24
0
votes
1 answer

Can i remapped bad block on Live RHEL5 system using following command

I am getting following disk error on one of my server smartd[4235]: Device: /dev/sda, 1 Currently unreadable (pending) sectors Can i apply following command to repair bad block on live RHEL5 system . badblocks -n -v /dev/hda1
vnix27
  • 886
  • 2
  • 11
  • 19
0
votes
1 answer

SCSI error: return code = 0x08000002, sense key: Aborted Command

The kernel logs: ata2.00: exception Emask 0x0 SAct 0x7fffffff SErr 0x0 action 0x0 ata2.00: irq_stat 0x40000008 ata2.00: cmd 61/08:f0:6f:5b:97/00:00:00:00:00/40 tag 30 ncq 4096 out res 41/10:01:6f:5b:97/d5:00:00:00:00/40 Emask 0x481 (invalid…
quanta
  • 51,413
  • 19
  • 159
  • 217
0
votes
2 answers

Current Pending Sector S.M.A.R.T. attribute

Today I noticed that in one SATA HD that I'm monitoring has the "Current_Pending_Sector" with RAW_VALUE=1 when inspecting the SMART attributes with smartmontools. Here is the output from smartctl.exe SMART Attributes Data Structure revision number:…
Norba
  • 57
  • 4
  • 11
0
votes
1 answer

SMART values HIGH and growing, but indicator green?

After drive started to freeze up, i them plugged another drive with ubuntu to see SMART data using disk utility. Now i read that "Disk is healthy". Now i'm not familiar with SMART parameters, but looking at them, there are some insane high numbers…
Janis Veinbergs
  • 1,585
  • 4
  • 23
  • 34
0
votes
1 answer

APC Smart UPS network shutdown issue

Here is a bit about our setup: We have 2x Smart-UPS RT 6000 XL units with network management cards We are running Powerchute from a network server Powerchute is connected to the management cards of both UPSs UPSs are set to do a graceful shutdown…
0
votes
2 answers

is it possible to force SMART to re-check Offline sectors?

I have got a HDD drive which has gone into pre-failure state with big count of relocated sectors. However I figured out that the problem is possibly with a HDD slot (cable) but not with a disk — when I plugged other HDDs to this slot they started …
timurb
  • 347
  • 4
  • 12
0
votes
5 answers

Monitoring health (SMART data, temperatures etc.) of many remote computers

Is there any software that would allow you to view the system health data of lots of remote windows computers over the internet? Preferably not overly complex, and free would be a big plus! I would need to monitor the health of individual media…
Acorn
  • 1,057
  • 1
  • 7
  • 10
0
votes
0 answers

Many drives in two RAID6 arrays simultaneously failed, seem to be working after reboot except SMART long test

in my storage server, I operate three RAID6 Linux software arrays. Everything was working fine until it was not. There are two RAID6 arrays and one RAID5 array, all consisting of SATA drives, all connected to a HBA9500-16i controller. Suddenly,…
michalt
  • 11
  • 3
0
votes
1 answer

Interpret smartctl -a output

Please help me understand this root@bdb16e4bb2e3:/opt/scrutiny# smartctl --all /dev/sdb smartctl 7.2 2020-12-30 r5155 [aarch64-linux-5.15.0-1024-raspi] (local build) Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org ===…
pyofey
  • 103
  • 3
0
votes
1 answer

SATA errors showing up in Journalctl while SMART Diagnostic ok - Motherboard issue?

After noticing abnormally long disk operations delays, I looked up journalctl and this is what I found: Feb 22 14:02:11.711182 Onan01 kernel: ata10: hard resetting link Feb 22 14:02:12.186958 Onan01 kernel: ata10: SATA link up 1.5 Gbps (SStatus 113…
fi11222
  • 103
  • 2
0
votes
1 answer

How to monitor Segate HDD health under Linux?

I'd need to monitor the health of several Seagate ST16000NM002G SAS HDDs hosted in a disk server running CentOS 7. As far as I understand, Seagate disks do not expose S.M.A.R.T. attributes due to a precise management decision (see this page), and…
Nicola Mori
  • 281
  • 1
  • 7
0
votes
1 answer

If a hard drive caused "bus errors", should its SMART attributes show problems?

The machine shows "ATA bus errors" occasionally, and has to be rebooted: [24028.505239] ata1.00: exception Emask 0x10 SAct 0x0 SErr 0x48d0002 action 0xe frozen [24028.505249] ata1.00: irq_stat 0x08400040, interface fatal error, connection status…
MWB
  • 187
  • 9
1 2 3
13
14