[smartmontools-support] interface or disk problem?

Alex mysqlstudent at gmail.com
Fri Mar 20 19:38:15 CET 2020


Hi,

In case my previous email was illegible, I've made the smartctl output
available here:
https://pastebin.com/SvDnjaYC

On Fri, Mar 20, 2020 at 2:37 PM Alex <mysqlstudent at gmail.com> wrote:
>
> On Fri, Mar 20, 2020 at 12:23 PM Claudio Kuenzler <napsty at gmail.com> wrote:
> >
> > Run
> >
> > smartctl -a /dev/sdd
> >
> > And show the output here
>
> Ah, thanks, I meant to do that initially. I also just ran a short test
> and it appears to have passed successfully.
>
> # smartctl --all /dev/sdd
> smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.3.12-200.fc30.x86_64]
> (local build)
> Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org
>
> === START OF INFORMATION SECTION ===
> Model Family:     Western Digital Red
> Device Model:     WDC WD30EFRX-68N32N0
> Serial Number:    WD-WCCXXXXXX
> LU WWN Device Id: 5 0014ee 20f3b0dee
> Firmware Version: 82.00A82
> User Capacity:    3,000,592,982,016 bytes [3.00 TB]
> Sector Sizes:     512 bytes logical, 4096 bytes physical
> Rotation Rate:    5400 rpm
> Form Factor:      3.5 inches
> Device is:        In smartctl database [for details use: -P show]
> ATA Version is:   ACS-3 T13/2161-D revision 5
> SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 3.0 Gb/s)
> Local Time is:    Fri Mar 20 14:33:28 2020 EDT
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
>
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
>
> General SMART Values:
> Offline data collection status:  (0x00) Offline data collection activity
>                                         was never started.
>                                         Auto Offline Data Collection: Disabled.
> Self-test execution status:      (   0) The previous self-test routine completed
>                                         without error or no self-test has ever
>                                         been run.
> Total time to complete Offline
> data collection:                (32820) seconds.
> Offline data collection
> capabilities:                    (0x7b) SMART execute Offline immediate.
>                                         Auto Offline data collection
> on/off support.
>                                         Suspend Offline collection upon new
>                                         command.
>                                         Offline surface scan supported.
>                                         Self-test supported.
>                                         Conveyance Self-test supported.
>                                         Selective Self-test supported.
> SMART capabilities:            (0x0003) Saves SMART data before entering
>                                         power-saving mode.
>                                         Supports SMART auto save timer.
> Error logging capability:        (0x01) Error logging supported.
>                                         General Purpose Logging supported.
> Short self-test routine
> recommended polling time:        (   2) minutes.
> Extended self-test routine
> recommended polling time:        ( 349) minutes.
> Conveyance self-test routine
> recommended polling time:        (   5) minutes.
> SCT capabilities:              (0x303d) SCT Status supported.
>                                         SCT Error Recovery Control supported.
>                                         SCT Feature Control supported.
>                                         SCT Data Table supported.
>
> SMART Attributes Data Structure revision number: 16
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
> UPDATED  WHEN_FAILED RAW_VALUE
>   1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail
> Always       -       0
>   3 Spin_Up_Time            0x0027   205   164   021    Pre-fail
> Always       -       4733
>   4 Start_Stop_Count        0x0032   100   100   000    Old_age
> Always       -       80
>   5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail
> Always       -       0
>   7 Seek_Error_Rate         0x002e   200   200   000    Old_age
> Always       -       0
>   9 Power_On_Hours          0x0032   074   074   000    Old_age
> Always       -       19193
>  10 Spin_Retry_Count        0x0032   100   253   000    Old_age
> Always       -       0
>  11 Calibration_Retry_Count 0x0032   100   253   000    Old_age
> Always       -       0
>  12 Power_Cycle_Count       0x0032   100   100   000    Old_age
> Always       -       80
> 192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age
> Always       -       57
> 193 Load_Cycle_Count        0x0032   200   200   000    Old_age
> Always       -       114
> 194 Temperature_Celsius     0x0022   117   106   000    Old_age
> Always       -       33
> 196 Reallocated_Event_Count 0x0032   200   200   000    Old_age
> Always       -       0
> 197 Current_Pending_Sector  0x0032   200   200   000    Old_age
> Always       -       0
> 198 Offline_Uncorrectable   0x0030   100   253   000    Old_age
> Offline      -       0
> 199 UDMA_CRC_Error_Count    0x0032   200   197   000    Old_age
> Always       -       127
> 200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age
> Offline      -       0
>
> SMART Error Log Version: 1
> No Errors Logged
>
> SMART Self-test log structure revision number 1
> Num  Test_Description    Status                  Remaining
> LifeTime(hours)  LBA_of_first_error
> # 1  Short offline       Completed without error       00%     19190         -
> # 2  Extended offline    Completed without error       00%      9343         -
>
> SMART Selective self-test log data structure revision number 1
>  SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
>     1        0        0  Not_testing
>     2        0        0  Not_testing
>     3        0        0  Not_testing
>     4        0        0  Not_testing
>     5        0        0  Not_testing
> Selective self-test flags (0x0):
>   After scanning selected spans, do NOT read-scan remainder of disk.
> If Selective self-test is pending on power-up, resume after 0 minute delay.
>
> Thanks,
> Alex
>
> >
> > On Fri, 20 Mar 2020, 16:56 Alex <mysqlstudent at gmail.com> wrote:
> >>
> >> Hi,
> >>
> >> I have a fedora30 system that was working fine until a message similar
> >> to the one below occurred. This is for a WDC WD30EFRX-68N disk. It
> >> seems to indicate it's an interface problem, but I can't be sure. I'm
> >> hoping someone can help me identify the problem for sure.
> >>
> >> What more information can I provide to help troubleshoot this problem?
> >>
> >> [    6.754777] ata4: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
> >> [    6.755530] ata4.00: ATA-10: WDC WD30EFRX-68N32N0, 82.00A82, max UDMA/133
> >> [    6.755676] ata4.00: 5860533168 sectors, multi 16: LBA48 NCQ (depth 32), AA
> >> [    6.756530] ata4.00: configured for UDMA/133
> >> [    6.756936] scsi 3:0:0:0: Direct-Access     ATA      WDC
> >> WD30EFRX-68N 0A82 PQ: 0 ANSI: 5
> >> [    6.757470] sd 3:0:0:0: Attached scsi generic sg3 type 0
> >> [    6.757511] sd 3:0:0:0: [sdd] 5860533168 512-byte logical blocks:
> >> (3.00 TB/2.73 TiB)
> >> [    6.757858] sd 3:0:0:0: [sdd] 4096-byte physical blocks
> >> [    6.758014] sd 3:0:0:0: [sdd] Write Protect is off
> >> [    6.758154] sd 3:0:0:0: [sdd] Mode Sense: 00 3a 00 00
> >> [    6.758171] sd 3:0:0:0: [sdd] Write cache: enabled, read cache:
> >> enabled, doesn't support DPO or FUA
> >> [    6.785443] ata4.00: exception Emask 0x50 SAct 0x20000000 SErr
> >> 0x280901 action 0x6 frozen
> >> [    6.785683] ata4.00: irq_stat 0x08000000, interface fatal error
> >> [    6.785825] ata4: SError: { RecovData UnrecovData HostInt 10B8B BadCRC }
> >> [    6.785971] ata4.00: failed command: READ FPDMA QUEUED
> >> [    6.786117] ata4.00: cmd 60/08:e8:00:00:00/00:00:00:00:00/40 tag 29
> >> ncq dma 4096 in
> >>                         res 40/00:e8:00:00:00/00:00:00:00:00/40 Emask
> >> 0x50 (ATA bus error)
> >> [    6.786561] ata4.00: status: { DRDY }
> >> [    6.786700] ata4: hard resetting link
> >> [    7.098766] ata4: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
> >> [    7.100161] ata4.00: configured for UDMA/133
> >> [    7.100313] ata4: EH complete
> >> [    7.118461] ata4: limiting SATA link speed to 3.0 Gbps
> >> [    7.118608] ata4.00: exception Emask 0x50 SAct 0x1 SErr 0x280900
> >> action 0x6 frozen
> >> [    7.118844] ata4.00: irq_stat 0x08000000, interface fatal error
> >> [    7.118987] ata4: SError: { UnrecovData HostInt 10B8B BadCRC }
> >> [    7.119129] ata4.00: failed command: READ FPDMA QUEUED
> >> [    7.119274] ata4.00: cmd 60/08:00:00:00:00/00:00:00:00:00/40 tag 0
> >> ncq dma 4096 in
> >>                         res 40/00:00:00:00:00/00:00:00:00:00/40 Emask
> >> 0x50 (ATA bus error)
> >> [    7.119713] ata4.00: status: { DRDY }
> >> [    7.119852] ata4: hard resetting link
> >> [    7.426739] ata4: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
> >> [    7.428175] ata4.00: configured for UDMA/133
> >> [    7.428326] ata4: EH complete
> >> [    7.467092]  sdd: sdd1
> >> [    7.467737] sd 3:0:0:0: [sdd] Attached SCSI disk
> >> [    7.468101] sdd: detected capacity change from 0 to 3000592982016
> >> [    7.468393] sdd: detected capacity change from 0 to 3000592982016
> >> _______________________________________________
> >> Smartmontools-support mailing list
> >> Smartmontools-support at listi.jpberlin.de
> >> https://listi.jpberlin.de/mailman/listinfo/smartmontools-support


More information about the Smartmontools-support mailing list