Hardware Canucks

Hardware Canucks (http://www.hardwarecanucks.com/forum/)
-   Troubleshooting (http://www.hardwarecanucks.com/forum/troubleshooting/)
-   -   [S.M.A.R.T.] Raw Read Error Rate Failed? (http://www.hardwarecanucks.com/forum/troubleshooting/55147-s-m-r-t-raw-read-error-rate-failed.html)

Zaqueo June 30, 2012 09:56 AM

[S.M.A.R.T.] Raw Read Error Rate Failed?
 
Hello guys, thanks for watching this post. So I just wake up and the OS told me that the HD was having some issues. So I did a Chkdsk on Windows.. and there was no problem. Then I booted Ubuntu and saw that SMART found a problem. And then I saw this. (Look at ID:01 - Raw Read Error Rate)

Code:

HD Tune: SAMSUNG HD502HJ Health

ID                              Current  Worst    ThresholdData    Status 
(01) Raw Read Error Rate        1        1        51      33031    Failed 
(02) Throughput Performance      252      252      0        0        Ok     
(03) Spin Up Time                83      80      25      5238    Ok     
(04) Start/Stop Count            100      100      0        492      Ok     
(05) Reallocated Sector Count    252      252      10      0        Ok     
(07) Seek Error Rate            252      252      51      0        Ok     
(08) Seek Time Performance      252      252      15      0        Ok     
(09) Power On Hours Count        100      100      0        1556    Ok     
(0A) Spin Retry Count            252      252      51      0        Ok     
(0B) Calibration Retry Count    252      252      0        0        Ok     
(0C) Power Cycle Count          100      100      0        504      Ok     
(BF) G-sense Error Rate          100      100      0        1        Ok     
(C0) Power Off Retract Count    252      252      0        0        Ok     
(C2) Temperature                64      56      0        917538  Ok     
(C3) Hardware ECC Recovered      100      100      0        0        Ok     
(C4) Reallocated Event Count    252      252      0        0        Ok     
(C5) Current Pending Sector      99      99      0        72      Ok     
(C6) Offline Uncorrectable      252      252      0        0        Ok     
(C7) Ultra DMA CRC Error Count  100      100      0        1        Ok     
(C8) Write Error Rate            100      100      0        28      Ok     
(DF) Load/Unload Retry Count    252      252      0        0        Ok     
(E1) Load/Unload Cycle Count    100      100      0        507      Ok     

Power On Time        : 1556
Health Status        : Failed

I googled some other SMART HD examples from other HDs and for Raw Read Error Rate, the "worst" value is between 100 or 200, something like that..
Why mine is 1? Maybe it's not that bad? Actually I'm on Windows right now, of course SMART is still telling me that there is a problem but i haven't lost any file.. I'm worried to not be able to boot in the future. Maybe its a problem from SMART? What do you recommend guys?

Thanks for the patience.
c ya

moocow June 30, 2012 02:38 PM

Back up all your stuff, wipe data and RMA the drive. Once SMART fails there's no point in thinking what the problem is as SMART is a self monitoring feature. I RMA drives that didn't trip SMART but still got crap load of bad sectors.

Zaqueo July 1, 2012 07:17 PM

Thanks for the response moocow.

I backed up all my data. Luckily the PC is still in warranty and I'll use it to RMA the HDD for a new one (hope so!)

Not important, but, any advice about what should I say to the people about the proplem? I hope they understand at least what is a "SMART error..." and why its critical!

Thanks again,
Great community!

frontier204 July 1, 2012 09:23 PM

You can also say the hard drive failed a self-test: any half-competent hard disk manufacturer would issue RMA for that. Given that 1 is the lowest possible value a drive can be assigned, I'm surprised you can still boot off that hard drive.

Since you have a Ubuntu Live CD, you can run a self test as well typing this into the terminal:
Code:

sudo smartctl --test=long /dev/sdX
Where X is the letter Linux gave your drive (probably /dev/sda).
Assuming the hard disk isn't dead by the end of the test, you can review the test result:
Code:

sudo smartctl --all /dev/sdX
Also see:
https://help.ubuntu.com/community/Sm...esting_a_Drive

Zaqueo July 1, 2012 09:46 PM

Thanks Frontier204, I'll test my HDD tonight and, if needed, I'll print the result as proof of the failure. (i have a bunch of proofs anyways..)

Quote:

Originally Posted by frontier204 (Post 638847)
Assuming the hard disk isn't dead by the end of the test, you can review the test result:

LOL! Haha, this'll be its final test =P
Thaks again, i'll keep you updated.

EDIT: Here is the log from the smartctl command, if interested. See ya!

Code:

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
Drive failure expected in less than 24 hours. SAVE ALL DATA.
See vendor-specific Attribute list for failed Attributes.

General SMART Values:
Offline data collection status:  (0x00)    Offline data collection activity
                    was never started.
                    Auto Offline Data Collection: Disabled.
Self-test execution status:      ( 137)    The previous self-test completed having
                    a test element that failed and the
                    device is suspected of having handling
                    damage.
Total time to complete Offline
data collection:        ( 4740) seconds.
Offline data collection
capabilities:              (0x5b) SMART execute Offline immediate.
                    Auto Offline data collection on/off support.
                    Suspend Offline collection upon new
                    command.
                    Offline surface scan supported.
                    Self-test supported.
                    No Conveyance Self-test supported.
                    Selective Self-test supported.
SMART capabilities:            (0x0003)    Saves SMART data before entering
                    power-saving mode.
                    Supports SMART auto save timer.
Error logging capability:        (0x01)    Error logging supported.
                    General Purpose Logging supported.
Short self-test routine
recommended polling time:      (  2) minutes.
Extended self-test routine
recommended polling time:      (  79) minutes.
SCT capabilities:            (0x003f)    SCT Status supported.
                    SCT Error Recovery Control supported.
                    SCT Feature Control supported.
                    SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG    VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate    0x002f  001  001  051    Pre-fail  Always  FAILING_NOW 33250
  2 Throughput_Performance  0x0026  252  252  000    Old_age  Always      -      0
  3 Spin_Up_Time            0x0023  083  080  025    Pre-fail  Always      -      5408
  4 Start_Stop_Count        0x0032  100  100  000    Old_age  Always      -      494
  5 Reallocated_Sector_Ct  0x0033  252  252  010    Pre-fail  Always      -      0
  7 Seek_Error_Rate        0x002e  252  252  051    Old_age  Always      -      0
  8 Seek_Time_Performance  0x0024  252  252  015    Old_age  Offline      -      0
  9 Power_On_Hours          0x0032  100  100  000    Old_age  Always      -      1587
 10 Spin_Retry_Count        0x0032  252  252  051    Old_age  Always      -      0
 11 Calibration_Retry_Count 0x0032  252  252  000    Old_age  Always      -      0
 12 Power_Cycle_Count      0x0032  100  100  000    Old_age  Always      -      506
191 G-Sense_Error_Rate      0x0022  100  100  000    Old_age  Always      -      1
192 Power-Off_Retract_Count 0x0022  252  252  000    Old_age  Always      -      0
194 Temperature_Celsius    0x0002  063  056  000    Old_age  Always      -      37 (Min/Max 14/44)
195 Hardware_ECC_Recovered  0x003a  100  100  000    Old_age  Always      -      0
196 Reallocated_Event_Count 0x0032  252  252  000    Old_age  Always      -      0
197 Current_Pending_Sector  0x0032  099  099  000    Old_age  Always      -      73
198 Offline_Uncorrectable  0x0030  252  252  000    Old_age  Offline      -      0
199 UDMA_CRC_Error_Count    0x0036  100  100  000    Old_age  Always      -      1
200 Multi_Zone_Error_Rate  0x002a  100  100  000    Old_age  Always      -      28
223 Load_Retry_Count        0x0032  252  252  000    Old_age  Always      -      0
225 Load_Cycle_Count        0x0032  100  100  000    Old_age  Always      -      509

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: handling damage??  90%      1585        0
# 2  Extended offline    Completed: handling damage??  90%      1585        0
# 3  Short offline      Completed: handling damage??  90%      1558        0
# 4  Short offline      Completed: handling damage??  90%      1558        0

Note: selective self-test log revision number (0) not 1 implies that no selective self-test has ever been run
SMART Selective self-test log data structure revision number 0
Note: revision number not 1 implies that no selective self-test has ever been run
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Completed_handling_damage?? [90% left] (0-65535)
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


Zaqueo July 12, 2012 01:54 PM

Ok guys. Just a quick update and you can close the thread.
Finally I RMA the HDD and right now I have a SeaGate 500GB, same characteristics than the Samsung (i guess)
There was nothing I could do about the broken HDD.. so yea, the solution is to RMA the HDD.

Thanks all for your help and interest.
Salutes!


All times are GMT -7. The time now is 08:56 PM.