[smartmontools-support] vmware esxi nvme not working

Alexander Liuckanov sasho at speednet.bg
Sun Jul 7 01:06:28 CEST 2019


Hi,

Under vmware esxi when try to access NVMe device the smartctl return an error

[root at esxi:/tmp] ./smartctl -d nvme  -x /dev/disks/t10.NVMe____SAMSUNG_MZWLL800HEHP2D00003______________460BB081E5382500
smartctl 7.1 2019-07-01 r4934 [x86_64-linux-6.7.0] (CircleCI)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org

Read NVMe Identify Controller failed: NVME_IOCTL_ADMIN_CMD: Function not implemented


The build in smart status in vmware work ok with NVMe but it not return the raw value and it is useless in this variant


[root at nod31:/tmp] esxcli storage core device smart get -d t10.NVMe____SAMSUNG_MZWLL800HEHP2D00003______________460BB081E5382500
Parameter                     Value  Threshold  Worst
----------------------------  -----  ---------  -----
Health Status                 OK     N/A        N/A
Media Wearout Indicator       N/A    N/A        N/A
Write Error Count             N/A    N/A        N/A
Read Error Count              N/A    N/A        N/A
Power-on Hours                2330   N/A        N/A
Power Cycle Count             14     N/A        N/A
Reallocated Sector Count      0      90         N/A
Raw Read Error Rate           N/A    N/A        N/A
Drive Temperature             36     78         N/A
Driver Rated Max Temperature  N/A    N/A        N/A
Write Sectors TOT Count       N/A    N/A        N/A
Read Sectors TOT Count        N/A    N/A        N/A
Initial Bad Block Count       N/A    N/A        N/A

Can you suggest solution for this error under vmware esxi?



P.S.
smartctl works OK with SATA and SCSI disks under vmware esxi


[root at esxi:~] /opt/smartmontools/smartctl -d sat --all /dev/disks/naa.5002538c4007bc35
smartctl 6.6 2016-05-10 r4321 [x86_64-linux-6.7.0] (daily-20160510)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Samsung based SSDs
Device Model:     SAMSUNG MZ7KM240HAGR-00005
Serial Number:    S2HRNXAGA00115
LU WWN Device Id: 5 002538 c4007bc35
Firmware Version: GXM1003Q
User Capacity:    240,057,409,536 bytes [240 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2, ATA8-ACS T13/1699-D revision 4c
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sat Jul  6 23:02:15 2019 UTC
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART Status not supported: Incomplete response, ATA output registers missing
SMART overall-health self-assessment test result: PASSED
Warning: This result is based on an Attribute check.

General SMART Values:
Offline data collection status:  (0x02) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                ( 1740) seconds.
Offline data collection
capabilities:                    (0x53) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        No Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (  29) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
  9 Power_On_Hours          0x0032   093   093   000    Old_age   Always       -       31401
 12 Power_Cycle_Count       0x0032   099   099   000    Old_age   Always       -       18
177 Wear_Leveling_Count     0x0013   080   080   005    Pre-fail  Always       -       7835
179 Used_Rsvd_Blk_Cnt_Tot   0x0013   100   100   010    Pre-fail  Always       -       0
180 Unused_Rsvd_Blk_Cnt_Tot 0x0013   100   100   010    Pre-fail  Always       -       1935
181 Program_Fail_Cnt_Total  0x0032   100   100   010    Old_age   Always       -       0
182 Erase_Fail_Count_Total  0x0032   100   100   010    Old_age   Always       -       0
183 Runtime_Bad_Block       0x0013   100   100   010    Pre-fail  Always       -       0
184 End-to-End_Error        0x0033   100   100   097    Pre-fail  Always       -       0
187 Uncorrectable_Error_Cnt 0x0032   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0032   072   056   000    Old_age   Always       -       28
195 ECC_Error_Rate          0x001a   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   100   100   000    Old_age   Always       -       0
199 CRC_Error_Count         0x003e   100   100   000    Old_age   Always       -       0
202 Exception_Mode_Status   0x0033   100   100   010    Pre-fail  Always       -       0
235 POR_Recovery_Count      0x0012   099   099   000    Old_age   Always       -       5
241 Total_LBAs_Written      0x0032   099   099   000    Old_age   Always       -       1774096469741
242 Total_LBAs_Read         0x0032   099   099   000    Old_age   Always       -       333626536362
243 SATA_Downshift_Ct       0x0032   100   100   000    Old_age   Always       -       0
244 Thermal_Throttle_St     0x0032   100   100   000    Old_age   Always       -       0
245 Timed_Workld_Media_Wear 0x0032   100   100   000    Old_age   Always       -       65535
246 Timed_Workld_RdWr_Ratio 0x0032   100   100   000    Old_age   Always       -       65535
247 Timed_Workld_Timer      0x0032   100   100   000    Old_age   Always       -       65535
251 NAND_Writes             0x0032   100   100   000    Old_age   Always       -       4098927083032

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
  255        0    65535  Read_scanning was completed without error
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.



[root at esxi:/tmp] ./smartctl -d scsi -x /dev/disks/naa.5000c500bc210e4f
smartctl 7.1 2019-07-01 r4934 [x86_64-linux-6.7.0] (CircleCI)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Vendor:               SEAGATE
Product:              ST2000NX0273
Revision:             E004
Compliance:           SPC-4
User Capacity:        2,000,398,934,016 bytes [2.00 TB]
Logical block size:   512 bytes
Physical block size:  4096 bytes
LU is fully provisioned
Rotation Rate:        7200 rpm
Form Factor:          2.5 inches
Logical Unit id:      0x5000c500bc210e4f
Serial number:        W461R8PK0000E9099U1V
Device type:          disk
Transport protocol:   SAS (SPL-3)
Local Time is:        Sat Jul  6 23:03:21 2019 UTC
SMART support is:     Available - device has SMART capability.
SMART support is:     Enabled
Temperature Warning:  Enabled
Read Cache is:        Enabled
Writeback Cache is:   Disabled

=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK

Grown defects during certification <not available>
Total blocks reassigned during format <not available>
Total new blocks reassigned <not available>
Power on minutes since format <not available>
Current Drive Temperature:     26 C
Drive Trip Temperature:        60 C

Manufactured in week 41 of year 2018
Specified cycle count over device lifetime:  10000
Accumulated start-stop cycles:  11
Specified load-unload count over device lifetime:  300000
Accumulated load-unload cycles:  110
Elements in grown defect list: 0

Vendor (Seagate Cache) information
  Blocks sent to initiator = 781595088
  Blocks received from initiator = 3212253592
  Blocks read from cache and sent to initiator = 1559674483
  Number of read and write commands whose size <= segment size = 16559271
  Number of read and write commands whose size > segment size = 16275

Vendor (Seagate/Hitachi) factory information
  number of hours powered up = 2330.43
  number of minutes until next internal SMART test = 26

Error counter log:
           Errors Corrected by           Total   Correction     Gigabytes    Total
               ECC          rereads/    errors   algorithm      processed    uncorrected
           fast | delayed   rewrites  corrected  invocations   [10^9 bytes]  errors
read:   2604945464        0         0  2604945464          0       9196.270           0
write:         0        0         0         0          0       3844.865           0

Non-medium error count:       14


[GLTSD (Global Logging Target Save Disable) set. Enable Save with '-S on']
No Self-tests have been logged

Background scan results log
  Status: waiting until BMS interval timer expires
    Accumulated power on time, hours:minutes 2330:26 [139826 minutes]
    Number of background scans performed: 18,  scan progress: 0.00%
    Number of background medium scans performed: 18

Protocol Specific port log page for SAS SSP
relative target port id = 1
  generation code = 8
  number of phys = 1
  phy identifier = 0
    attached device type: SAS or SATA device
    attached reason: unknown
    reason: unknown
    negotiated logical link rate: phy enabled; 12 Gbps
    attached initiator port: ssp=1 stp=1 smp=1
    attached target port: ssp=0 stp=0 smp=0
    SAS address = 0x5000c500bc210e4d
    attached SAS address = 0x500605b00e7075b3
    attached phy identifier = 3
    Invalid DWORD count = 0
    Running disparity error count = 0
    Loss of DWORD synchronization = 9
    Phy reset problem = 0
    Phy event descriptors:
     Invalid word count: 0
     Running disparity error count: 0
     Loss of dword synchronization count: 9
     Phy reset problem count: 0
relative target port id = 2
  generation code = 8
  number of phys = 1
  phy identifier = 1
    attached device type: no device attached
    attached reason: unknown
    reason: unknown
    negotiated logical link rate: phy enabled; unknown
    attached initiator port: ssp=0 stp=0 smp=0
    attached target port: ssp=0 stp=0 smp=0
    SAS address = 0x5000c500bc210e4e
    attached SAS address = 0x0
    attached phy identifier = 0
    Invalid DWORD count = 0
    Running disparity error count = 0
    Loss of DWORD synchronization = 0
    Phy reset problem = 0
    Phy event descriptors:
     Invalid word count: 0
     Running disparity error count: 0
     Loss of dword synchronization count: 0
     Phy reset problem count: 0


Best Reagrds

Alexander Liuckanov
Mobile Phone: +359 899988622




More information about the Smartmontools-support mailing list