[smartmontools-support] interface or disk problem?
Bruce Allen
bruce.allen at aei.mpg.de
Tue Mar 24 15:03:06 CET 2020
I agree with Claudio' suggestion. These types of errors are often
connected to problems with cables or connectors.
On 20.03.20 20:06, Claudio Kuenzler wrote:
> Thx!
>
> The only thing I currently see are the udma crc errors count, which
> would prove your idea that it could be a connection issue.
>
> The count of errors in data transfer via the interface cable as
> determined by ICRC (Interface Cyclic Redundancy Check).
>
> Did you already try another sata cable or sata connector/slot?
>
> On Fri, 20 Mar 2020, 19:37 Alex <mysqlstudent at gmail.com
> <mailto:mysqlstudent at gmail.com>> wrote:
>
> On Fri, Mar 20, 2020 at 12:23 PM Claudio Kuenzler <napsty at gmail.com
> <mailto:napsty at gmail.com>> wrote:
> >
> > Run
> >
> > smartctl -a /dev/sdd
> >
> > And show the output here
>
> Ah, thanks, I meant to do that initially. I also just ran a short test
> and it appears to have passed successfully.
>
> # smartctl --all /dev/sdd
> smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.3.12-200.fc30.x86_64]
> (local build)
> Copyright (C) 2002-19, Bruce Allen, Christian Franke,
> www.smartmontools.org <http://www.smartmontools.org>
>
> === START OF INFORMATION SECTION ===
> Model Family: Western Digital Red
> Device Model: WDC WD30EFRX-68N32N0
> Serial Number: WD-WCCXXXXXX
> LU WWN Device Id: 5 0014ee 20f3b0dee
> Firmware Version: 82.00A82
> User Capacity: 3,000,592,982,016 bytes [3.00 TB]
> Sector Sizes: 512 bytes logical, 4096 bytes physical
> Rotation Rate: 5400 rpm
> Form Factor: 3.5 inches
> Device is: In smartctl database [for details use: -P show]
> ATA Version is: ACS-3 T13/2161-D revision 5
> SATA Version is: SATA 3.1, 6.0 Gb/s (current: 3.0 Gb/s)
> Local Time is: Fri Mar 20 14:33:28 2020 EDT
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
>
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
>
> General SMART Values:
> Offline data collection status: (0x00) Offline data collection activity
> was never started.
> Auto Offline Data
> Collection: Disabled.
> Self-test execution status: ( 0) The previous self-test
> routine completed
> without error or no
> self-test has ever
> been run.
> Total time to complete Offline
> data collection: (32820) seconds.
> Offline data collection
> capabilities: (0x7b) SMART execute Offline immediate.
> Auto Offline data collection
> on/off support.
> Suspend Offline collection
> upon new
> command.
> Offline surface scan supported.
> Self-test supported.
> Conveyance Self-test supported.
> Selective Self-test supported.
> SMART capabilities: (0x0003) Saves SMART data before entering
> power-saving mode.
> Supports SMART auto save timer.
> Error logging capability: (0x01) Error logging supported.
> General Purpose Logging
> supported.
> Short self-test routine
> recommended polling time: ( 2) minutes.
> Extended self-test routine
> recommended polling time: ( 349) minutes.
> Conveyance self-test routine
> recommended polling time: ( 5) minutes.
> SCT capabilities: (0x303d) SCT Status supported.
> SCT Error Recovery Control
> supported.
> SCT Feature Control supported.
> SCT Data Table supported.
>
> SMART Attributes Data Structure revision number: 16
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
> UPDATED WHEN_FAILED RAW_VALUE
> 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail
> Always - 0
> 3 Spin_Up_Time 0x0027 205 164 021 Pre-fail
> Always - 4733
> 4 Start_Stop_Count 0x0032 100 100 000 Old_age
> Always - 80
> 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail
> Always - 0
> 7 Seek_Error_Rate 0x002e 200 200 000 Old_age
> Always - 0
> 9 Power_On_Hours 0x0032 074 074 000 Old_age
> Always - 19193
> 10 Spin_Retry_Count 0x0032 100 253 000 Old_age
> Always - 0
> 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age
> Always - 0
> 12 Power_Cycle_Count 0x0032 100 100 000 Old_age
> Always - 80
> 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age
> Always - 57
> 193 Load_Cycle_Count 0x0032 200 200 000 Old_age
> Always - 114
> 194 Temperature_Celsius 0x0022 117 106 000 Old_age
> Always - 33
> 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age
> Always - 0
> 197 Current_Pending_Sector 0x0032 200 200 000 Old_age
> Always - 0
> 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age
> Offline - 0
> 199 UDMA_CRC_Error_Count 0x0032 200 197 000 Old_age
> Always - 127
> 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age
> Offline - 0
>
> SMART Error Log Version: 1
> No Errors Logged
>
> SMART Self-test log structure revision number 1
> Num Test_Description Status Remaining
> LifeTime(hours) LBA_of_first_error
> # 1 Short offline Completed without error 00%
> 19190 -
> # 2 Extended offline Completed without error 00%
> 9343 -
>
> SMART Selective self-test log data structure revision number 1
> SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
> 1 0 0 Not_testing
> 2 0 0 Not_testing
> 3 0 0 Not_testing
> 4 0 0 Not_testing
> 5 0 0 Not_testing
> Selective self-test flags (0x0):
> After scanning selected spans, do NOT read-scan remainder of disk.
> If Selective self-test is pending on power-up, resume after 0 minute
> delay.
>
> Thanks,
> Alex
>
> >
> > On Fri, 20 Mar 2020, 16:56 Alex <mysqlstudent at gmail.com
> <mailto:mysqlstudent at gmail.com>> wrote:
> >>
> >> Hi,
> >>
> >> I have a fedora30 system that was working fine until a message
> similar
> >> to the one below occurred. This is for a WDC WD30EFRX-68N disk. It
> >> seems to indicate it's an interface problem, but I can't be sure. I'm
> >> hoping someone can help me identify the problem for sure.
> >>
> >> What more information can I provide to help troubleshoot this
> problem?
> >>
> >> [ 6.754777] ata4: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
> >> [ 6.755530] ata4.00: ATA-10: WDC WD30EFRX-68N32N0, 82.00A82,
> max UDMA/133
> >> [ 6.755676] ata4.00: 5860533168 sectors, multi 16: LBA48 NCQ
> (depth 32), AA
> >> [ 6.756530] ata4.00: configured for UDMA/133
> >> [ 6.756936] scsi 3:0:0:0: Direct-Access ATA WDC
> >> WD30EFRX-68N 0A82 PQ: 0 ANSI: 5
> >> [ 6.757470] sd 3:0:0:0: Attached scsi generic sg3 type 0
> >> [ 6.757511] sd 3:0:0:0: [sdd] 5860533168 512-byte logical blocks:
> >> (3.00 TB/2.73 TiB)
> >> [ 6.757858] sd 3:0:0:0: [sdd] 4096-byte physical blocks
> >> [ 6.758014] sd 3:0:0:0: [sdd] Write Protect is off
> >> [ 6.758154] sd 3:0:0:0: [sdd] Mode Sense: 00 3a 00 00
> >> [ 6.758171] sd 3:0:0:0: [sdd] Write cache: enabled, read cache:
> >> enabled, doesn't support DPO or FUA
> >> [ 6.785443] ata4.00: exception Emask 0x50 SAct 0x20000000 SErr
> >> 0x280901 action 0x6 frozen
> >> [ 6.785683] ata4.00: irq_stat 0x08000000, interface fatal error
> >> [ 6.785825] ata4: SError: { RecovData UnrecovData HostInt
> 10B8B BadCRC }
> >> [ 6.785971] ata4.00: failed command: READ FPDMA QUEUED
> >> [ 6.786117] ata4.00: cmd 60/08:e8:00:00:00/00:00:00:00:00/40
> tag 29
> >> ncq dma 4096 in
> >> res 40/00:e8:00:00:00/00:00:00:00:00/40 Emask
> >> 0x50 (ATA bus error)
> >> [ 6.786561] ata4.00: status: { DRDY }
> >> [ 6.786700] ata4: hard resetting link
> >> [ 7.098766] ata4: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
> >> [ 7.100161] ata4.00: configured for UDMA/133
> >> [ 7.100313] ata4: EH complete
> >> [ 7.118461] ata4: limiting SATA link speed to 3.0 Gbps
> >> [ 7.118608] ata4.00: exception Emask 0x50 SAct 0x1 SErr 0x280900
> >> action 0x6 frozen
> >> [ 7.118844] ata4.00: irq_stat 0x08000000, interface fatal error
> >> [ 7.118987] ata4: SError: { UnrecovData HostInt 10B8B BadCRC }
> >> [ 7.119129] ata4.00: failed command: READ FPDMA QUEUED
> >> [ 7.119274] ata4.00: cmd 60/08:00:00:00:00/00:00:00:00:00/40 tag 0
> >> ncq dma 4096 in
> >> res 40/00:00:00:00:00/00:00:00:00:00/40 Emask
> >> 0x50 (ATA bus error)
> >> [ 7.119713] ata4.00: status: { DRDY }
> >> [ 7.119852] ata4: hard resetting link
> >> [ 7.426739] ata4: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
> >> [ 7.428175] ata4.00: configured for UDMA/133
> >> [ 7.428326] ata4: EH complete
> >> [ 7.467092] sdd: sdd1
> >> [ 7.467737] sd 3:0:0:0: [sdd] Attached SCSI disk
> >> [ 7.468101] sdd: detected capacity change from 0 to 3000592982016
> >> [ 7.468393] sdd: detected capacity change from 0 to 3000592982016
> >> _______________________________________________
> >> Smartmontools-support mailing list
> >> Smartmontools-support at listi.jpberlin.de
> <mailto:Smartmontools-support at listi.jpberlin.de>
> >> https://listi.jpberlin.de/mailman/listinfo/smartmontools-support
>
>
> _______________________________________________
> Smartmontools-support mailing list
> Smartmontools-support at listi.jpberlin.de
> https://listi.jpberlin.de/mailman/listinfo/smartmontools-support
>
--
--------------------------------------------------------------------------
Prof. Dr. Bruce Allen, Director
Max Planck Institute for Gravitational Physics (Albert Einstein Institute)
Callinstrasse 38
D-30167 Hannover, Germany
Tel +49-511-762-17145
Fax +49-511-762-17182
Email: bruce.allen at aei.mpg.de
More information about the Smartmontools-support
mailing list