At our shop we have nagios checks for the SMART status of hard disks in linux servers, but it hasn't been really useful so far: by the time we get a SMART alarm the system already had problems so we already knew about it :)
We then developed a practice to routinely run SMART background self tests on disks (smartctl -t long
, during off-peak hours)and track that data by hand. We log the disk model and s/n, date of the last test, the number of reallocated sectors (we usually try to change every disk with >0 reallocated sectors), and the Power On Hours accumulated by the disk so we can know at a glance which of our disks are older.
Since the number of systems (and thus disks) is increasing we'd like to automate the task of running tests and collecting results. Before starting to reinvent the wheel I started to look out for existing solutions but I had no luck.
Is there any software to automate SMART self tests and collect the resulting data, under Linux - or maybe to integrate that into some hardware inventory management system?