How to correct FAILURE - READ_DMA



  • Upgraded from 1.2.3 to 2.0RC3, after few days got:

    Sep  5 04:02:21 tbfirewall kernel: ad0: FAILURE - READ_DMA status=51 <ready,dsc,error>error=40 <uncorrectable>LBA=3919
    Sep  5 04:02:21 tbfirewall kernel: g_vfs_done():ad0s1a[READ(offset=1966080, length=16384)]error = 5
    Sep  5 04:02:22 tbfirewall kernel: ad0: FAILURE - READ_DMA status=51 <ready,dsc,error>error=40 <uncorrectable>LBA=3919
    Sep  5 04:02:22 tbfirewall kernel: g_vfs_done():ad0s1a[READ(offset=1966080, length=16384)]error = 5
    Sep  5 04:02:23 tbfirewall kernel: ad0: FAILURE - READ_DMA status=51 <ready,dsc,error>error=40 <uncorrectable>LBA=3919
    Sep  5 04:02:23 tbfirewall kernel: g_vfs_done():ad0s1a[READ(offset=1966080, length=16384)]error = 5

    SMART Attributes Data Structure revision number: 11
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME          FLAG    VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
      1 Raw_Read_Error_Rate    0x0029  100  253  020    Pre-fail  Offline      -      0
      3 Spin_Up_Time            0x0027  080  079  020    Pre-fail  Always      -      2585
      4 Start_Stop_Count        0x0032  096  096  008    Old_age  Always      -      3044
      5 Reallocated_Sector_Ct  0x0033  100  100  020    Pre-fail  Always      -      2
      7 Seek_Error_Rate        0x000b  093  024  023    Pre-fail  Always      -      1
      9 Power_On_Hours          0x0012  087  087  001    Old_age  Always      -      8787
    10 Spin_Retry_Count        0x0026  100  100  000    Old_age  Always      -      0
    11 Calibration_Retry_Count 0x0013  100  100  020    Pre-fail  Always      -      0
    12 Power_Cycle_Count      0x0032  096  096  008    Old_age  Always      -      3035
    13 Read_Soft_Error_Rate    0x000b  100  100  023    Pre-fail  Always      -      0
    195 Hardware_ECC_Recovered  0x001a  100  001  000    Old_age  Always      -      176692277
    196 Reallocated_Event_Count 0x0010  100  100  020    Old_age  Offline      -      0
    197 Current_Pending_Sector  0x0032  100  100  020    Old_age  Always      -      2
    198 Offline_Uncorrectable  0x0010  100  253  000    Old_age  Offline      -      0
    199 UDMA_CRC_Error_Count    0x001a  077  077  000    Old_age  Always      -      123

    Warning: device does not support Error Logging
    SMART Error Log Version: 0
    No Errors Logged

    Warning: device does not support Self Test Logging
    SMART Self-test log structure revision number 1
    Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error

    1  Extended offline    Completed without error      00%      4923        -

    2  Short offline      Completed without error      00%      4923        -

    3  Extended offline    Completed: read failure      90%      4922        4872

    found this thread http://forums.freebsd.org/showthread.php?t=4508

    can't get how fsdb should be installed if I do:
    /usr/sbin/pkg_add fsdb>>>>>>pkg_add: can't stat package file 'fsdb'

    searched with find but got no match.

    Can you please kindly help me?</uncorrectable></ready,dsc,error></uncorrectable></ready,dsc,error></uncorrectable></ready,dsc,error>



  • Hope this can help others with this stressing problem with IDE disks:

    • check the bios, disable LBA mode, set IDE to AHCI.

    • disable DMA: in /boot/loader.conf add hw.ata.atapi_dma=0

    bye


  • Netgate Administrator

    To add packages you would need to use something like:

    pkg_add -r fsdb
    

    Steve

    Edit: However it isn't an available package on the FreeBSD ftp server.


  • Rebel Alliance Developer Netgate

    Backup your data, the drive is likely dying.

    You can also check Diagnostics > SMART Status

    If it were random LBAs/offsets, I might be inclined to say it could be a driver/cable/controller issue, but to see the same LBA/offset over and over screams "dying disk".


Log in to reply