[smartmontools-support] Disk failure notification options

Claudio Kuenzler napsty at gmail.com
Tue Jan 5 09:03:57 CET 2021


Hi Alex

If I recall, the raw_read_error_rate was less than a few hundred about
> a month ago.
>

raw_read_error_rate MAY indicate a failing drive, but doesn't have to.


> I'm hoping this command will do the following:
> - monitor all drive aspects
> - send an alert whenever the Raw_Read_Error_Rate changes
> - send an alert whenever Temperature changes >= 4 Celsius or , >= 45C
> and log a critical alert when temp is >= 55
>
> I just want to be sure I'm not doing something wrong that will
> overlook an early warning alert for this drive failing.
>

You may also take a look at check_smart.pl, which uses smartctl in the
background, for drive monitoring.
As raw_read_error_rate is not part of the default raw attribute list to be
checked, you can expand this:

./check_smart.pl -d /dev/sdc -i ata -r 'Current_Pending_Sector,
Reallocated_Sector_Ct, Program_Fail_Cnt_Total, Uncorrectable_Error_Cnt,
Offline_Uncorrectable, Runtime_Bad_Block, Reported_Uncorrect,
Reallocated_Event_Count, Command_Timeout,Raw_Read_Error_Rate' -w
'Raw_Read_Error_Rate=1947'

This will alert you when Raw_Read_Error_Rate increased (+1 in your case) or
if all other attributes from the raw list (-r) have a non-zero value.

https://www.claudiokuenzler.com/monitoring-plugins/check_smart.php
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://listi.jpberlin.de/pipermail/smartmontools-support/attachments/20210105/5b8e44c7/attachment.htm>


More information about the Smartmontools-support mailing list