[hard drive] New HD failing
Links: home · search · speed test · login · more ·
Links: Reply New Topic
Forums » Technical » Computer Hardware Help » [hard drive] New HD failing
page: 1 · 2
Oleg @ 26th Oct 05:21PM:
[hard drive] New HD failing
I have just got a new external and seems like it could be on it's way out if you look at the S.M.A.R.T
Looking at HDT721010SLA360
reply
srr2 @ 26th Oct 07:29PM:
Re: [hard drive] New HD failing
What problem? Everything looks normal.
reply
Oleg @ 26th Oct 07:39PM:
Re: [hard drive] New HD failing
I am looking at Threshold 60 and other attributes numbers they should be 0's and HD running very hot 43C.
reply
pnjunction @ 26th Oct 07:54PM:
Re: [hard drive] New HD failing
Both drives say threshold 60. Also, the thresholds are fixed values if crossed by the actual information from the drive, might indicate a problem. Notice how the thresholds are exactly the same for both drives?
The things that have 0 threshold seem to be things that don't have thresholds per se, there's no number of power cycles that would indicate a problem it's just information.
The stats like read error rate, seek error rate and spinup retry count that would indicate real problems are all zero.
I don't know where you get 43c from the data it's showing, but that's not even that high of a temperature.
In summary, maybe you should just not bother looking at SMART data, or just stick to looking at the OK's down the side.
reply
Oleg @ 26th Oct 08:00PM:
Re: [hard drive] New HD failing
Thank so much for the answer pnjunction so this is normal and 43c coming from Everest reading.
reply
anon @ 27th Oct 12:39AM:
Re: [hard drive] New HD failing
110°F is "very hot" ????
reply
Anonymous_ @ 27th Oct 03:06AM:
Re: [hard drive] New HD failing
said by Oleg :
I am looking at Threshold 60 and other attributes numbers they should be 0's and HD running very hot 43C.
lol @ 43C my are running at 50 to 60C for the past 5 years
reply
pnjunction @ 27th Oct 03:16AM:
Re: [hard drive] New HD failing
said by Anonymous_ :said by Oleg :
I am looking at Threshold 60 and other attributes numbers they should be 0's and HD running very hot 43C.
lol @ 43C my are running at 50 to 60C for the past 5 years
I was going to say Speedfan is good, you can click 'in-depth analysis' and it gives you a bit more info on the values like 'watch', 'normal' or 'very good' compared to other drives in their database.
It told me to 'watch' my drive temperatures because one of my enclosures (7200.10 250GB) is 48C and the average for the drive is 37C. It told me to watch my new enclosed one (Barracuda LP 5900.12 1.5TB) as well because it is 41C. My internal drives are in RAID so no SMART data.
I don't think my temps are that bad though, but man 60C seems a bit much isn't it? You have those things running in an oven mitt? ;)
reply
Oleg @ 27th Oct 11:33AM:
Re: [hard drive] New HD failing
So 40C-50C are normal temps for external aluminum enclosure or any enclosure :)
reply
pnjunction @ 27th Oct 11:40AM:
Re: [hard drive] New HD failing
said by Oleg :
So 40C-50C are normal temps for external aluminum enclosure or any enclosure :)
I would think so, unless you get a fancy one with cooling.
Depends on the drive. My 7200.10 was 48C in both the enclosures I've had it in. The 5900.12 LP is 41C.
I would recommend an LP/Green drive for an enclosure, especially if you're going to use USB which will limit the speed anyways. In my case it is eSATA but the LP drive is still plenty fast, I benched it at 117/77/97 MB/s max/min/avg and 12.4ms access time.
Seagate has done a good job with their LPs, mine is much faster than the old 7200.10 (80/40/64 and 13.2ms) thanks to higher platter density and improved design.
reply
Oleg @ 27th Oct 02:33PM:
Re: [hard drive] New HD failing
Ops forgot to say what my brand of enclosure is it's Cavalry CAXA3701T0 1TB »www.newegg.com/Product/Product.a···XA3701T0
reply
Oleg @ 27th Oct 02:38PM:
Re: [hard drive] New HD failing
Very strange i have just turned it on again and it's running at 22C :huh:
reply
Oleg @ 27th Oct 02:41PM:
Re: [hard drive] New HD failing
No it has jumped to 25C :hmm:
reply
Tursiops_G @ 27th Oct 05:30PM:
Re: [hard drive] New HD failing
Forget the Temp readings... Let it warm up for awhile, then hold the enclosure between your hands... Comfortably Warm is OK, but If it's Uncomfortably *HOT* to the Touch, then perhaps consider getting an external enclosure that has forced air (fan) cooling.
I have a 1TB SATA WD Caviar "Green" drive in a $25 No-Name, unvented external USB enclosure. It runs warm, but still cooler than the back of my LCD Monitor... ;)
-Tursiops_G.
--
If You're Unsure, "RTFM"... If You're SURE, "RTFM" Anyway. ;)
reply
Oleg @ 27th Oct 05:40PM:
Re: [hard drive] New HD failing
1T for $25 where did you find it so cheap?
Well i let it warm up and it stays at 43C.
reply
Tursiops_G @ 27th Oct 05:51PM:
Re: [hard drive] New HD failing
$25 was the cost of the bare USB Enclosure itself.
The WD Green 1TB drive cost $79 at a local Computer Fair. :)
-Tursiops_G.
--
If You're Unsure, "RTFM"... If You're SURE, "RTFM" Anyway. ;)
reply
Oleg @ 27th Oct 05:55PM:
Re: [hard drive] New HD failing
said by Tursiops_G :
$25 was the cost of the bare USB Enclosure itself.
The WD Green 1TB drive cost $79 at a local Computer Fair. :)
-Tursiops_G.
Great deal :-)
reply
koitsu @ 27th Oct 10:15PM:
Re: [hard drive] New HD failing
Oleg, can you please post SMART stats for your disks using a utility that's well-maintained? The utility you're using doesn't appear to understand vendor one-offs for attribute encoding. Secondly, it looks as if you're reading the SMART data wrong (this is quite common these days).
Please post data for your disks taken from smartmontools. Install, fire up a cmd.exe window, then run "smartctl -a /dev/hdX" where X is a letter such as "a" for the 1st disk you have, "b" for the 2nd, etc...
Getting the output will be somewhat annoying given how "wonderful" cmd.exe is, so you may want to redirect the output to separate files, e.g.:
smartctl -a /dev/hda > drive1.txt
smartctl -a /dev/hdb > drive2.txt
Etc...
Once you provide those I can look over the SMART stats and tell you if anything is of concern, and teach you how to read the statistics shown.
--
Making life hard for others since 1977.
I speak for myself and not my employer/affiliates of my employer.
reply
Oleg @ 28th Oct 11:46AM:
Re: [hard drive] New HD failing
Thanks koitsu i will defiantly post status when i get back home from work :)
reply
Oleg @ 30th Oct 06:21PM:
Re: [hard drive] New HD failing
Thanks to koitsu i have checked my C:\ drive and it looks fine as for USB external drive it does not work with it.
reply
koitsu @ 31st Oct 04:02AM:
Re: [hard drive] New HD failing
smartmontools for Windows can't talk to USB devices, because it (smartmontools) lacks the code to passthrough raw SMART requests via USB. I believe there are some Windows programs out there which can get SMART data from a USB-connected drive, but they likely don't provide all of the information smartmontools does.
With regards to the drive connected via SATA or PATA -- please re-run the command I asked you to using the "-a" flag (lowercase a, not capital a). I need to see the SMART event log.
--
Making life hard for others since 1977.
I speak for myself and not my employer/affiliates of my employer.
reply
Chip @ 31st Oct 07:53AM:
Re: [hard drive] New HD failing
said by Oleg :
1T for $25 where did you find it so cheap?
Well i let it warm up and it stays at 43C.
Max operating temp for that drive is 60c. Your ok
--
The three great strategies for obscuring an issue are to introduce irrelevancies, to arouse prejudice, and to excite ridicule--Bergen Evans
reply
koitsu @ 1st Nov 08:32PM:
Re: [hard drive] New HD failing
Oleg sent me a private message with the full SMART output for his C: drive:
»www.filedropper.com/drivec
Rather than respond privately, I felt disclosing how to read the stats publicly would benefit others. So, here's the analysis:
This tells me the disk is SATA, and that the drive model/revision *is not* in smartmontools' internal database. The fact that it's not in the database means some Attributes may not be decoded correctly; I'll talk more about this at the end of my post.
Let's check the overall "global status" for SMART:
Okay, good.
Let's look for signs of bad sectors on the disk:
Attribute 5 (Reallocated_Sector_Ct) tells you how many times the drive has detected a bad sector and transparently (and successfully) remapped it to a spare. Your OS wouldn't know of this remapping, but the OS may have seen an I/O error or timeout during the remapping phase (it depends -- too many cases to discuss here). Either way, RAW_VALUE is 0, so that indicates this hasn't happened.
Attribute 196 (Reallocated_Event_Count) tells you how many times the drive detected a bad sector, regardless of remapping being successful or failing. Again, RAW_VALUE is 0, so things look good.
Attribute 197 (Current_Pending_Sector) tells you how many sectors on the drive are pending remapping, or at least that's how I understand this attribute to function. :-) Again, RAW_VALUE is 0 -- good.
Attribute 198 (Offline_Uncorrectable) tells you how many times the drive has failed to remap sectors it thought were bad. The important thing to note about this Attribute is that its UPDATED column is labelled Offline. This indicates that the value will only be updated if you run an offline test. Don't let the word "offline" scare you -- you can actually run a test while the disk is in use (I/O operations take priority over SMART tests); you can run such a test by doing "smartctl -t short {disk}" or if you want a more thorough test, use "smartctl -t long {disk}". "short" is sufficient to update this attribute most of the time.
Let's check thermals:
Attribute 194 (Temperature_Celsius) indicates your drive is presently running at 31C. The drive, during its lifetime, has seen a minimum of 10C and maximum of 49C. It's important to note that the Lifetime Min/Max values shown are sometimes not cleared when a drive leaves the factory, and other times aren't decoded correctly (when the drive isn't in smartmontools' internal database; again, more on that later). Either way, 31C is excellent (cold).
Let's check drive spin-up/spin-down and power-cycle counts, and overall drive lifetime, since you state this is a brand new disk:
Attribute 9 (Power_On_Hours) indicates the drive has been powered up during its lifetime for a total of 3442 hours (143 working days). This is very surprising for a drive that you state is brand new.
Attribute 4 (Start_Stop_Count) tells you how many times the drive has been told to spin down or spin up, including when the system has been powered on or shut off. RAW_VALUE says 1626 times, which is again surprising for a brand new drive.
Attribute 12 (Power_Cycle_Count) tells you how many times the drive has been powered cycled (e.g. how many times the drive has been powered on, such as when you turn your PC on but not necessarily when you hit Reset). RAW_VALUE says the drive has been been powered on 1610 times -- same thing as before (surprising).
I do not know how to read Attributes 192 and 193. :-)
First question: is this machine being powered off regularly, or going into suspend/sleep mode (not the monitor -- I mean the entire machine), or do you have your Power Policy Setting in Windows set up to power down the disks after N minutes of being idle? If so -- okay, those values are normal then.
Second question: when did you purchase this disk, and how long has it been in use? You stated "New HD" in your post, that's why I ask. If this drive was purchased within the past couple weeks as new, then whoever sold you the drive sold you something that had been used in the past -- meaning you got sold a used disk.
Now for the bad news...
Attribute 199 (UDMA_CRC_Error_Count) tells you how many times the drive has detected a CRC error for any sort of data I/O on the drive. RAW_VALUE says 86 CRC errors have been witnessed during the lifetime of this drive. Because we don't know if this drive is TRULY brand new, it may have happened in the past (e.g. when it wasn't in your machine).
These errors are correctable -- that is to say, if the drive witnesses a CRC error, it tells the SATA controller "hey that last command/data you sent me was corrupt, try it again". CRC (checksum) errors can happen for all kinds of reasons, but the most common reason is bad cables or cables which are too long. They could also be the sign of a SATA controller going bad, or crazy-insane interference inside of your case (I've never heard of this happening, but I suppose it's possible).
Next you'll understand why I asked you to use "-a" to view the SMART error log:
This is an indication of some form of internal SMART error witnessed on the drive. Sadly, smartmontools does not know how to decode the SMART error log commands for this particular drive. Usually there are READ_DMA or WRITE_DMA indications in the log, but in this case we have "VENDOR SPECIFIC".
Don't let the fact that there are 5 lines shown here scare you; there are numerous commands that get sent to/from the drive during an I/O transfer. The SMART error log helps tell you what happened shortly prior to the error (e.g. "I was told to do this, I did this, then I was told to do this, and the error happened").
To decode these, I would have to get a full technical specifications document from Hitachi and decode the hexadecimal values shown on the left side. Drive vendors do not usually release this for consumer (PATA/SATA) drives, but do provide it for SCSI drives (and the error reporting mechanism in SCSI is entirely different anyway).
What's of importance is that there is a single error seen, and it happened at the 2057 lifetime hours mark. Remember that Attribute 9 (Power_On_Hours) was at 3442 when you ran smartctl; so, the error happened in the past.
Finally, discussing the data stored in SMART and how to read data:
Relying on RAW_VALUE is not always possible or wise. There is no "standard" method to reading this data, but smartmontools tries to do its best (by having a database of drive models/revisions and knowing how to decode the data for each drive), which is more than I can say for other SMART software.
You see, the data stored in RAW_VALUE can be in any format -- there is no industry standard. There's a recommended format for them, but vendors often store the data in a proprietary format. For example, some Seagate disks will show a very high RAW_VALUE for Attribute 1 (Raw_Read_Error_Rate), which people see and "freak out" over when actually it's 100% normal -- just that Bruce Allen (author of smartmontools) hasn't managed to figure out the encoding format for the data yet.
In this particular case, it's best to look at the fields labelled VALUE, WORST, and THRESH. These are what are called "adjusted values", indicating a formula is applied to the RAW_VALUE data (since the drive firmware itself knows how to decode the data) and is then turned into something that's considered an "health status" value.
VALUE is what the current adjusted value is, WORST is the worst value ever seen, and THRESH is the threshold where if reached the overall-health self-assessment test result will go from PASSED to something like FAILED. I can show you what a failure look like if you want.
The first thing you'll notice about THRESH is that they're set absurdly low. For example, your drive has 86 CRC errors, yet VALUE is 200, WORST is 200, and THRESH is 000.
What does this mean? It means drive manufacturers set their SMART health thresholds to absolutely absurd values; a failing drive will often show PASSED for its overall-health self-assessment. Isn't it great? Hitachi, Seagate, Western Digital, Fujitsu, blah blah blah... they all are like this. The thresholds are set way too low for them to be effective.
So what's the status of your C: drive?
It honestly looks fine, although I don't know how "new" it is. I'd say this drive has been in use for quite some time. You might want to replace your SATA cables "just in case", but otherwise your drive shows no sign of problems.
I have a feeling a ton of people are going to thumbs-up this post. *laugh* I spend a lot of time with disks since I'm a UNIX SA, and "messing with disks" is a kind of forte of mine.
--
Making life hard for others since 1977.
I speak for myself and not my employer/affiliates of my employer.
reply
Vamp @ 1st Nov 08:38PM:
Re: [hard drive] New HD failing
The only data in the SMART data you need to be concerned about is the raw data, the thresholds and limits are just BS values IMO.
As long as reallocated sector count and read/write error counts are at 0 you have nothing to worry about, when you start seeing any raw data for realloc then you can pretty much recover whatever data you can then toss the drive because it's failing.
--
20/20 FIOS || MSN Msgr: scott001^gmail_com
reply
Oleg @ 1st Nov 08:51PM:
Re: [hard drive] New HD failing
Thanks so much koitsu for your answers.This test was for the old HD,because i was unable to do test on the new drive connected using USB cable.
reply
koitsu @ 2nd Nov 12:47AM:
Re: [hard drive] New HD failing
said by Vamp :
The only data in the SMART data you need to be concerned about is the raw data, the thresholds and limits are just BS values IMO.
This is simply untrue. When the RAW_VALUE portion looks absurd or unreasonable (due to what I described -- see the bottom portion of my long explanation), the only thing you have to go off of are VALUE/WORST/THRESH. That's simply reality.
--
Making life hard for others since 1977.
I speak for myself and not my employer/affiliates of my employer.
reply
Thank you for using lo-fi dslreports.com - report bugs
© 99-2009 silver matrix LLC