[smartmontools-support] interface or disk problem?

Bruce Allen bruce.allen at aei.mpg.de
Tue Mar 24 15:03:06 CET 2020


I agree with Claudio' suggestion.  These types of errors are often
connected to problems with cables or connectors.


On 20.03.20 20:06, Claudio Kuenzler wrote:
> Thx!
> 
> The only thing I currently see are the udma crc errors count, which
> would prove your idea that it could be a connection issue.
> 
> The count of errors in data transfer via the interface cable as
> determined by ICRC (Interface Cyclic Redundancy Check).
> 
> Did you already try another sata cable or sata connector/slot?
> 
> On Fri, 20 Mar 2020, 19:37 Alex <mysqlstudent at gmail.com
> <mailto:mysqlstudent at gmail.com>> wrote:
> 
>     On Fri, Mar 20, 2020 at 12:23 PM Claudio Kuenzler <napsty at gmail.com
>     <mailto:napsty at gmail.com>> wrote:
>     >
>     > Run
>     >
>     > smartctl -a /dev/sdd
>     >
>     > And show the output here
> 
>     Ah, thanks, I meant to do that initially. I also just ran a short test
>     and it appears to have passed successfully.
> 
>     # smartctl --all /dev/sdd
>     smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.3.12-200.fc30.x86_64]
>     (local build)
>     Copyright (C) 2002-19, Bruce Allen, Christian Franke,
>     www.smartmontools.org <http://www.smartmontools.org>
> 
>     === START OF INFORMATION SECTION ===
>     Model Family:     Western Digital Red
>     Device Model:     WDC WD30EFRX-68N32N0
>     Serial Number:    WD-WCCXXXXXX
>     LU WWN Device Id: 5 0014ee 20f3b0dee
>     Firmware Version: 82.00A82
>     User Capacity:    3,000,592,982,016 bytes [3.00 TB]
>     Sector Sizes:     512 bytes logical, 4096 bytes physical
>     Rotation Rate:    5400 rpm
>     Form Factor:      3.5 inches
>     Device is:        In smartctl database [for details use: -P show]
>     ATA Version is:   ACS-3 T13/2161-D revision 5
>     SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 3.0 Gb/s)
>     Local Time is:    Fri Mar 20 14:33:28 2020 EDT
>     SMART support is: Available - device has SMART capability.
>     SMART support is: Enabled
> 
>     === START OF READ SMART DATA SECTION ===
>     SMART overall-health self-assessment test result: PASSED
> 
>     General SMART Values:
>     Offline data collection status:  (0x00) Offline data collection activity
>                                             was never started.
>                                             Auto Offline Data
>     Collection: Disabled.
>     Self-test execution status:      (   0) The previous self-test
>     routine completed
>                                             without error or no
>     self-test has ever
>                                             been run.
>     Total time to complete Offline
>     data collection:                (32820) seconds.
>     Offline data collection
>     capabilities:                    (0x7b) SMART execute Offline immediate.
>                                             Auto Offline data collection
>     on/off support.
>                                             Suspend Offline collection
>     upon new
>                                             command.
>                                             Offline surface scan supported.
>                                             Self-test supported.
>                                             Conveyance Self-test supported.
>                                             Selective Self-test supported.
>     SMART capabilities:            (0x0003) Saves SMART data before entering
>                                             power-saving mode.
>                                             Supports SMART auto save timer.
>     Error logging capability:        (0x01) Error logging supported.
>                                             General Purpose Logging
>     supported.
>     Short self-test routine
>     recommended polling time:        (   2) minutes.
>     Extended self-test routine
>     recommended polling time:        ( 349) minutes.
>     Conveyance self-test routine
>     recommended polling time:        (   5) minutes.
>     SCT capabilities:              (0x303d) SCT Status supported.
>                                             SCT Error Recovery Control
>     supported.
>                                             SCT Feature Control supported.
>                                             SCT Data Table supported.
> 
>     SMART Attributes Data Structure revision number: 16
>     Vendor Specific SMART Attributes with Thresholds:
>     ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
>     UPDATED  WHEN_FAILED RAW_VALUE
>       1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail
>     Always       -       0
>       3 Spin_Up_Time            0x0027   205   164   021    Pre-fail
>     Always       -       4733
>       4 Start_Stop_Count        0x0032   100   100   000    Old_age
>     Always       -       80
>       5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail
>     Always       -       0
>       7 Seek_Error_Rate         0x002e   200   200   000    Old_age
>     Always       -       0
>       9 Power_On_Hours          0x0032   074   074   000    Old_age
>     Always       -       19193
>      10 Spin_Retry_Count        0x0032   100   253   000    Old_age
>     Always       -       0
>      11 Calibration_Retry_Count 0x0032   100   253   000    Old_age
>     Always       -       0
>      12 Power_Cycle_Count       0x0032   100   100   000    Old_age
>     Always       -       80
>     192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age
>     Always       -       57
>     193 Load_Cycle_Count        0x0032   200   200   000    Old_age
>     Always       -       114
>     194 Temperature_Celsius     0x0022   117   106   000    Old_age
>     Always       -       33
>     196 Reallocated_Event_Count 0x0032   200   200   000    Old_age
>     Always       -       0
>     197 Current_Pending_Sector  0x0032   200   200   000    Old_age
>     Always       -       0
>     198 Offline_Uncorrectable   0x0030   100   253   000    Old_age
>     Offline      -       0
>     199 UDMA_CRC_Error_Count    0x0032   200   197   000    Old_age
>     Always       -       127
>     200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age
>     Offline      -       0
> 
>     SMART Error Log Version: 1
>     No Errors Logged
> 
>     SMART Self-test log structure revision number 1
>     Num  Test_Description    Status                  Remaining
>     LifeTime(hours)  LBA_of_first_error
>     # 1  Short offline       Completed without error       00%   
>      19190         -
>     # 2  Extended offline    Completed without error       00%     
>     9343         -
> 
>     SMART Selective self-test log data structure revision number 1
>      SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
>         1        0        0  Not_testing
>         2        0        0  Not_testing
>         3        0        0  Not_testing
>         4        0        0  Not_testing
>         5        0        0  Not_testing
>     Selective self-test flags (0x0):
>       After scanning selected spans, do NOT read-scan remainder of disk.
>     If Selective self-test is pending on power-up, resume after 0 minute
>     delay.
> 
>     Thanks,
>     Alex
> 
>     >
>     > On Fri, 20 Mar 2020, 16:56 Alex <mysqlstudent at gmail.com
>     <mailto:mysqlstudent at gmail.com>> wrote:
>     >>
>     >> Hi,
>     >>
>     >> I have a fedora30 system that was working fine until a message
>     similar
>     >> to the one below occurred. This is for a WDC WD30EFRX-68N disk. It
>     >> seems to indicate it's an interface problem, but I can't be sure. I'm
>     >> hoping someone can help me identify the problem for sure.
>     >>
>     >> What more information can I provide to help troubleshoot this
>     problem?
>     >>
>     >> [    6.754777] ata4: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
>     >> [    6.755530] ata4.00: ATA-10: WDC WD30EFRX-68N32N0, 82.00A82,
>     max UDMA/133
>     >> [    6.755676] ata4.00: 5860533168 sectors, multi 16: LBA48 NCQ
>     (depth 32), AA
>     >> [    6.756530] ata4.00: configured for UDMA/133
>     >> [    6.756936] scsi 3:0:0:0: Direct-Access     ATA      WDC
>     >> WD30EFRX-68N 0A82 PQ: 0 ANSI: 5
>     >> [    6.757470] sd 3:0:0:0: Attached scsi generic sg3 type 0
>     >> [    6.757511] sd 3:0:0:0: [sdd] 5860533168 512-byte logical blocks:
>     >> (3.00 TB/2.73 TiB)
>     >> [    6.757858] sd 3:0:0:0: [sdd] 4096-byte physical blocks
>     >> [    6.758014] sd 3:0:0:0: [sdd] Write Protect is off
>     >> [    6.758154] sd 3:0:0:0: [sdd] Mode Sense: 00 3a 00 00
>     >> [    6.758171] sd 3:0:0:0: [sdd] Write cache: enabled, read cache:
>     >> enabled, doesn't support DPO or FUA
>     >> [    6.785443] ata4.00: exception Emask 0x50 SAct 0x20000000 SErr
>     >> 0x280901 action 0x6 frozen
>     >> [    6.785683] ata4.00: irq_stat 0x08000000, interface fatal error
>     >> [    6.785825] ata4: SError: { RecovData UnrecovData HostInt
>     10B8B BadCRC }
>     >> [    6.785971] ata4.00: failed command: READ FPDMA QUEUED
>     >> [    6.786117] ata4.00: cmd 60/08:e8:00:00:00/00:00:00:00:00/40
>     tag 29
>     >> ncq dma 4096 in
>     >>                         res 40/00:e8:00:00:00/00:00:00:00:00/40 Emask
>     >> 0x50 (ATA bus error)
>     >> [    6.786561] ata4.00: status: { DRDY }
>     >> [    6.786700] ata4: hard resetting link
>     >> [    7.098766] ata4: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
>     >> [    7.100161] ata4.00: configured for UDMA/133
>     >> [    7.100313] ata4: EH complete
>     >> [    7.118461] ata4: limiting SATA link speed to 3.0 Gbps
>     >> [    7.118608] ata4.00: exception Emask 0x50 SAct 0x1 SErr 0x280900
>     >> action 0x6 frozen
>     >> [    7.118844] ata4.00: irq_stat 0x08000000, interface fatal error
>     >> [    7.118987] ata4: SError: { UnrecovData HostInt 10B8B BadCRC }
>     >> [    7.119129] ata4.00: failed command: READ FPDMA QUEUED
>     >> [    7.119274] ata4.00: cmd 60/08:00:00:00:00/00:00:00:00:00/40 tag 0
>     >> ncq dma 4096 in
>     >>                         res 40/00:00:00:00:00/00:00:00:00:00/40 Emask
>     >> 0x50 (ATA bus error)
>     >> [    7.119713] ata4.00: status: { DRDY }
>     >> [    7.119852] ata4: hard resetting link
>     >> [    7.426739] ata4: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
>     >> [    7.428175] ata4.00: configured for UDMA/133
>     >> [    7.428326] ata4: EH complete
>     >> [    7.467092]  sdd: sdd1
>     >> [    7.467737] sd 3:0:0:0: [sdd] Attached SCSI disk
>     >> [    7.468101] sdd: detected capacity change from 0 to 3000592982016
>     >> [    7.468393] sdd: detected capacity change from 0 to 3000592982016
>     >> _______________________________________________
>     >> Smartmontools-support mailing list
>     >> Smartmontools-support at listi.jpberlin.de
>     <mailto:Smartmontools-support at listi.jpberlin.de>
>     >> https://listi.jpberlin.de/mailman/listinfo/smartmontools-support
> 
> 
> _______________________________________________
> Smartmontools-support mailing list
> Smartmontools-support at listi.jpberlin.de
> https://listi.jpberlin.de/mailman/listinfo/smartmontools-support
> 

-- 
--------------------------------------------------------------------------
Prof. Dr. Bruce Allen, Director
Max Planck Institute for Gravitational Physics (Albert Einstein Institute)
Callinstrasse 38
D-30167 Hannover,  Germany
Tel +49-511-762-17145
Fax +49-511-762-17182
Email: bruce.allen at aei.mpg.de


More information about the Smartmontools-support mailing list