[smartmontools-support] smartd wakes powered off drives
David C. Partridge
david.partridge at perdrix.co.uk
Sat Jan 30 02:04:03 CET 2021
I have a RAID 5 behind an Adaptec ASR-8885 Raid Card. The card is configured to power the drives down if there's no activity for 20 minutes.
Problem is that smartd wakes up every 30 minutes and tries to read the drive temperatures which of course wakes the drives ☹
Jan 29 21:42:11 charon smartd[791]: Device: dev/disk/by-path/pci-0000:00:17.0-ata-1 [SAT], SMART Prefailure Attribute: 194 Temperature_Celsius changed from 70 to 71
Jan 29 21:42:11 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_0] [SCSI], failed to read Temperature
Jan 29 21:42:17 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_1] [SCSI], failed to read SMART values
Jan 29 21:42:17 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_1] [SCSI], failed to read Temperature
Jan 29 21:42:23 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_2] [SCSI], failed to read SMART values
Jan 29 21:42:23 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_2] [SCSI], failed to read Temperature
Jan 29 21:42:29 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_3] [SCSI], failed to read SMART values
Jan 29 21:42:29 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_3] [SCSI], failed to read Temperature
Jan 29 21:42:35 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_4] [SCSI], failed to read SMART values
Jan 29 21:42:35 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_4] [SCSI], failed to read Temperature
Jan 29 21:42:41 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_5] [SCSI], failed to read SMART values
Jan 29 21:42:41 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_5] [SCSI], failed to read Temperature
Jan 29 21:42:47 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_6] [SCSI], failed to read SMART values
Jan 29 21:42:47 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_6] [SCSI], failed to read Temperature
Jan 29 21:42:53 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_7] [SCSI], failed to read SMART values
Jan 29 21:42:53 charon smartd[791]: Sending warning via /usr/share/smartmontools/smartd-runner to david.partridge at perdrix.co.uk ...
Jan 29 21:42:53 charon smartd[791]: Warning via /usr/share/smartmontools/smartd-runner to david.partridge at perdrix.co.uk produced unexpected output (183 bytes) to STDOUT/STDERR:
Jan 29 21:42:53 charon smartd[791]: /etc/smartmontools/run.d/10mail:
Jan 29 21:42:53 charon smartd[791]: Your system does not have /usr/bin/mail. Install the mailx or mailutils package
Jan 29 21:42:53 charon smartd[791]: run-parts: /etc/smartmontools/run.d/10mail exited with return code 1
Jan 29 21:42:53 charon smartd[791]: Warning via /usr/share/smartmontools/smartd-runner to david.partridge at perdrix.co.uk: failed (32-bit/8-bit exit status: 256/1)
Jan 29 21:42:53 charon smartd[791]: Device: /dev/disk/by-path/pci-0000:01:00.0-scsi-0:0:0:0 [aacraid_disk_00_00_7] [SCSI], failed to read Temperature
Jan 29 21:42:54 charon smartd[791]: Sending warning via /usr/share/smartmontools/smartd-runner to david.partridge at perdrix.co.uk ...
Jan 29 21:42:54 charon smartd[791]: Warning via /usr/share/smartmontools/smartd-runner to david.partridge at perdrix.co.uk produced unexpected output (183 bytes) to STDOUT/STDERR:
Jan 29 21:42:54 charon smartd[791]: /etc/smartmontools/run.d/10mail:
Jan 29 21:42:54 charon smartd[791]: Your system does not have /usr/bin/mail. Install the mailx or mailutils package
Jan 29 21:42:54 charon smartd[791]: run-parts: /etc/smartmontools/run.d/10mail exited with return code 1
Jan 29 21:42:54 charon smartd[791]: Warning via /usr/share/smartmontools/smartd-runner to david.partridge at perdrix.co.uk: failed (32-bit/8-bit exit status: 256/1)
Jan 29 21:43:30 charon [1031]: [426] Power management state changed to Full rpm: controller 1 ( Adaptec ASR8885 #4D39138FB5F Physical Slot: 16 ), channel: 0, deviceID: 0, WWN: 50000C0F0129DEFD, vendor: WD, model: WD4001FYYG, S/N: WMC1F1532925, firmware level: D1R5.
Jan 29 21:43:30 charon [1031]: [426] Power management state changed to Full rpm: controller 1 ( Adaptec ASR8885 #4D39138FB5F Physical Slot: 16 ), channel: 0, deviceID: 2, WWN: 50000C0F01D0D7C1, vendor: WD, model: WD4001FYYG-01SL3, S/N: WMC1F0918722, firmware level: VR07.
Jan 29 21:43:30 charon [1031]: [426] Power management state changed to Full rpm: controller 1 ( Adaptec ASR8885 #4D39138FB5F Physical Slot: 16 ), channel: 0, deviceID: 3, WWN: 50000C0F023D7B31, vendor: WD, model: WD4001FYYG-01SL3, S/N: WMC1F0D99DRX, firmware level: VR08.
Jan 29 21:43:30 charon [1031]: [426] Power management state changed to Full rpm: controller 1 ( Adaptec ASR8885 #4D39138FB5F Physical Slot: 16 ), channel: 0, deviceID: 4, WWN: 50000C0F01DEB949, vendor: WD, model: WD4001FYYG, S/N: WMC1F1533218, firmware level: D1R5.
Jan 29 21:43:30 charon [1031]: [426] Power management state changed to Full rpm: controller 1 ( Adaptec ASR8885 #4D39138FB5F Physical Slot: 16 ), channel: 0, deviceID: 1, WWN: 50000C0F01C85E41, vendor: WD, model: WD4001FYYG-01SL3, S/N: WMC1F1054265, firmware level: VR02.
Jan 29 21:43:30 charon [1031]: [426] Power management state changed to Full rpm: controller 1 ( Adaptec ASR8885 #4D39138FB5F Physical Slot: 16 ), channel: 0, deviceID: 5, WWN: 50000C0F010E7B41, vendor: IBM-ESXS, model: WD4001FYYG-23S, S/N: F0E0X2F0, firmware level: XA39.
Jan 29 21:43:30 charon [1031]: [426] Power management state changed to Full rpm: controller 1 ( Adaptec ASR8885 #4D39138FB5F Physical Slot: 16 ), channel: 0, deviceID: 6, WWN: 50000C0F0129C691, vendor: WD, model: WD4001FYYG, S/N: WMC1F1528634, firmware level: D1R5.
Jan 29 21:43:30 charon [1031]: [426] Power management state changed to Full rpm: controller 1 ( Adaptec ASR8885 #4D39138FB5F Physical Slot: 16 ), channel: 0, deviceID: 7, WWN: 50000C0F01FF3D59, vendor: WD, model: WD4001FYYG-01SL3, S/N: WMC1F0E1VFXS, firmware level: VR08.
It also triggers a nastygram on my Ubuntu system about /usr/bin//mail for each device (I've removed all but one set of these from log above ...
Is there any way to change make this check the state of the devices before trying or other way to avoid waking the drives up (they don't support reduced RPM mode).
Thanks
David
More information about the Smartmontools-support
mailing list