[smartmontools-support] Unable to run extended SMART selftest

Albin public at inquisitive.se
Tue Jul 31 21:02:49 CEST 2018

> Possibly a firmware issue. In the past, I've seen various issues with 
> the self-test logs of Intel SSDs. Does '-t short' work ?

# smartctl -t short /dev/sda

smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.9.0-7-amd64] (local build)
Sending command: "Execute SMART Short self-test routine immediately in 
off-line mode".
Drive command "Execute SMART Short self-test routine immediately in 
off-line mode" successful.
Testing has begun.
Please wait 2 minutes for test to complete.
Test will complete after Tue Jul 31 20:34:10 2018

Use smartctl -X to abort test.

After some seconds I checked if it'd been interrupted.

# smartctl -l selftest /dev/sda

smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.9.0-7-amd64] (local build)
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining 
LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Aborted by host               00%       110 
# 2  Extended offline    Aborted by host               00%       110 
# 3  Extended offline    Aborted by host               00%       110 
# 4  Extended offline    Aborted by host               00%        94 
# 5  Extended offline    Aborted by host               00%        94 
# 6  Extended offline    Aborted by host               00%        93 
# 7  Extended offline    Aborted by host               00%        93 
# 8  Extended offline    Aborted by host               00%        80 

> See also this thread from Intel SSD forum about another 535 with same 
> firmware:
> https://communities.intel.com/thread/87837
> Self-test log is filled with "Aborted by host" but self-test execution 
> status reports "...completed without error ...".

Sadly there wasn't much information in that thread. Or I'm missing 
something, which is very possible.

>> ...
>> Model Family:     Intel 53x and Pro 2500 Series SSDs
>> Device Model:     INTEL SSDSC2BW240H6
>> ...
>> Self-test execution status:      (   0)    The previous self-test 
>> routine completed
>>                     without error or no self-test has ever
>>                     been run.
> This does not match with info from self-Test log.

That do you mean?

> This is the old error log which does not support 48-bit LBA. Please try 
> 'smartctl -l xerror' (included in 'smartctl -x' but not in '-a') to read 
> the newer error log.
> [...]
> Same for self-test log. Please try 'smartctl -l xselftest' (or 'smartctl 
> -x').

I'm unsure of the full command you want me to run, so I ran `smartctl` 
with the options I think you wanted.

# smartctl -l xerror -d ata /dev/sda

smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.9.0-7-amd64] (local build)
ATA_READ_LOG_EXT (addr=0x00:0x00, page=0, n=1) failed: 48-bit ATA 
commands not implemented
Read GP Log Directory failed

SMART Extended Comprehensive Error Log (GP Log 0x03) not supported

Try '-l [xerror,]error' to read traditional SMART Error Log

# smartctl -l xerror /dev/sda

smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.9.0-7-amd64] (local build)
SMART Extended Comprehensive Error Log (GP Log 0x03) not supported

Try '-l [xerror,]error' to read traditional SMART Error Log

# smartctl -x -d ata /dev/sda

smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.9.0-7-amd64] (local build)
Model Family:     Intel 53x and Pro 2500 Series SSDs
Device Model:     INTEL SSDSC2BW240H6
Serial Number:    CVTR61030A9Q240CGN
LU WWN Device Id: 5 5cd2e4 14cb13669
Firmware Version: RG21
User Capacity:    240,057,409,536 bytes [240 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-3 (minor revision not indicated)
SATA Version is:  SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Jul 31 20:41:53 2018 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM level is:     254 (maximum performance)
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, frozen [SEC2]
Wt Cache Reorder: Unavailable

SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x05)	Offline data collection activity
					was aborted by an interrupting command from host.
					Auto Offline Data Collection: Disabled.
Self-test execution status:      (  16)	The self-test routine was aborted by
					the host.
Total time to complete Offline
data collection: 		( 5860) seconds.
Offline data collection
capabilities: 			 (0x7f) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Abort Offline collection upon new
					Offline surface scan supported.
					Self-test supported.
					Conveyance Self-test supported.
					Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 (  58) minutes.
Conveyance self-test routine
recommended polling time: 	 (   4) minutes.
SCT capabilities: 	       (0x0025)	SCT Status supported.
					SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
   5 Reallocated_Sector_Ct   -O--CK   100   100   000    -    0
   9 Power_On_Hours_and_Msec -O--CK   100   100   000    - 
  12 Power_Cycle_Count       -O--CK   100   100   000    -    105
170 Available_Reservd_Space PO--CK   100   100   010    -    0
171 Program_Fail_Count      -O--CK   100   100   000    -    0
172 Erase_Fail_Count        -O--CK   100   100   000    -    0
174 Unexpect_Power_Loss_Ct  -O--CK   100   100   000    -    7
183 SATA_Downshift_Count    -O--CK   100   100   000    -    1
184 End-to-End_Error        PO--CK   100   100   090    -    0
187 Uncorrectable_Error_Cnt -O--CK   100   100   000    -    0
190 Airflow_Temperature_Cel -O--CK   032   100   000    -    32 (Min/Max 
192 Power-Off_Retract_Count -O--CK   100   100   000    -    7
199 UDMA_CRC_Error_Count    -O--CK   100   100   000    -    0
225 Host_Writes_32MiB       -O--CK   100   100   000    -    84419
226 Workld_Media_Wear_Indic -O--CK   100   100   000    -    65535
227 Workld_Host_Reads_Perc  -O--CK   100   100   000    -    38
228 Workload_Minutes        -O--CK   100   100   000    -    65535
232 Available_Reservd_Space PO--CK   100   100   010    -    0
233 Media_Wearout_Indicator -O--CK   100   100   000    -    0
241 Host_Writes_32MiB       -O--CK   100   100   000    -    84419
242 Host_Reads_32MiB        -O--CK   100   100   000    -    53787
249 NAND_Writes_1GiB        -O--CK   100   100   000    -    1365
                             ||||||_ K auto-keep
                             |||||__ C event count
                             ||||___ R error rate
                             |||____ S speed/performance
                             ||_____ O updated online
                             |______ P prefailure warning

ATA_READ_LOG_EXT (addr=0x00:0x00, page=0, n=1) failed: 48-bit ATA 
commands not implemented
Read GP Log Directory failed

SMART Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00           SL  R/O      1  Log Directory
0x04           SL  R/O      1  Device Statistics log
0x06           SL  R/O      1  SMART self-test log
0x09           SL  R/W      1  Selective self-test log
0x30           SL  R/O     16  IDENTIFY DEVICE data log
0x80-0x9f      SL  R/W     16  Host vendor specific log
0xb7           SL  VS      16  Device vendor specific log
0xe0           SL  R/W      1  SCT Command/Status
0xe1           SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log (GP Log 0x03) not supported

SMART Error Log not supported

SMART Extended Self-test Log (GP Log 0x07) not supported

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining 
LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Aborted by host               00%       110 
# 2  Extended offline    Aborted by host               00%       110 
# 3  Extended offline    Aborted by host               00%       110 
# 4  Extended offline    Aborted by host               00%        94 
# 5  Extended offline    Aborted by host               00%        94 
# 6  Extended offline    Aborted by host               00%        93 
# 7  Extended offline    Aborted by host               00%        93 
# 8  Extended offline    Aborted by host               00%        80 

SMART Selective self-test log data structure revision number 1
     1        0        0  Not_testing
     2        0        0  Not_testing
     3        0        0  Not_testing
     4        0        0  Not_testing
     5        0        0  Not_testing
Selective self-test flags (0x0):
   After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       0 (0x0000)
SCT Support Level:                   1
Device State:                        Active (0)
Current Temperature:                    32 Celsius
Power Cycle Min/Max Temperature:     18/45 Celsius
Lifetime    Min/Max Temperature:     18/64 Celsius
Under/Over Temperature Limit Count:   0/0

Write SCT Data Table failed: Inappropriate ioctl for device
Read SCT Temperature History failed

SCT Error Recovery Control command not supported

Device Statistics (SMART Log 0x04)
Page  Offset Size        Value Flags Description
ATA_SMART_READ_LOG failed: Multi-sector ATA commands not implemented
Read Device Statistics pages 0x00-0x07 failed

ATA_READ_LOG_EXT (addr=0x11:0x00, page=0, n=1) failed: 48-bit ATA 
commands not implemented
Read SATA Phy Event Counters failed

