[smartmontools-support] Dealing with locked raid device files
Vincas Dargis
vindrg at gmail.com
Mon Dec 31 13:15:46 CET 2018
Hi,
I have reported this Debian wishlist bug report:
https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=916999
Is it possible, and at all reasonable, to deal with these kind of niche problems?
In short, Areca's `cli64` command line interface, that queries hardware raid controller information
(raidset, volume, hardware status, etc), locks `/dev/sgX` device with exclusive lock, and that
prevents `smartctl` and `smartd` to query device, and in the result, errors happens - missing
monitoring checks (for munin and similar), and email "spam" from smartd with "SMART error
(FailedOpenDevice) detected on host..." messages.
In our setup, `cli64` is invoked every minute to catch raidset errors, and send SMS if so, as fast
as possible via Icinga monitoring system. In the result, there are *two* `cli64` invocations per
minute, to get Areca log AND raidset status (these provide different set of important messages).
Maybe it could be possible to detect exclusive locks and retry/wait (for a reasonable
attepts/duration) to reduce "false-possitives"? Or this would be considered as "dirty" workaround..?
Thanks!
More information about the Smartmontools-support
mailing list