Device disabled again


Recommended Posts

Somehow, I continue to have issues using Unraid. After around one week, one disk typically disables. I am at a loss here what is driving this and previous attempts have failed to show any improvement (e.g., changed controller card, changed PSU). Any idea what could be driving this? Diagnostic attached. Thanks in advance!

 

There are masses of Read and Write Errors throughout the log against Disk 2: (sdn) WDC_WD60EZRX-00MVLB1_WD-WX11D741AYXK

 

Example:

 

 

Read Errors

Feb 23 11:16:16 Tower kernel: md: disk2 read error, sector=10602791888
Feb 23 11:16:16 Tower kernel: md: disk2 read error, sector=10602791896
Feb 23 11:16:16 Tower kernel: md: disk2 read error, sector=10602791904

Write Errors:

Feb 23 11:17:06 Tower kernel: md: disk2 write error, sector=10602791888
Feb 23 11:17:06 Tower kernel: md: disk2 write error, sector=10602791896
Feb 23 11:17:06 Tower kernel: md: disk2 write error, sector=10602791904

 

The SMART report for the Disk looks fine and dandy. I'd try this:

 

Shutdown. Check the physical connections to the drive. Power on. Post a new Diagnostic File.

Link to comment

Thanks a lot. I have shut down and then unplugged and plugged in the cables again. Then restarted. Please see attached new diagnostic files. Any new ideas? Thanks a ton for looking into it for hints what keep causing this issue.

 

Nothing relating to disk2 yet. Too soon I think. Try and write / read from disk 2. Nothing important mind you BUT of reasonable size. Then post new diagnostic zip.

Link to comment

Disk 2 is disabled, so I cannot read / write. Or possible to mount it and then do?

 

Ah, of course it disabled because it failed a write. So you have a Red X (V6.x) right? I am not inclined to advise you to re-enable the disk until we are sure that it is / was a bad connection. Not being a prat here just trying to take the cautious approach.

 

I am combing through your diagnostic files right now and I don't see any other issue with disk2 so I am inclined to think bad connection / cable etc. Let's see if anyone else thinks differently.

Link to comment

Got it, thanks. What is Red X?

 

I have never had a failed drive before. I know that in v5 the drive status indicator for a disabled drive was always a red ball (red balled drive). As of v6.x I thought it had been changed to a red X.

 

Some more reading BTW re failed disks: http://lime-technology.com/wiki/index.php/Troubleshooting

 

Be careful before following some of that. I still say lets see if some of the more experienced users chime in and what they say.

 

 

Link to comment

I notice that the disk causing problems appears to be a WD 6TB drive?  I have some WD 6TB Red drives that would periodically drop offline with no apparent cause resulting in the red 'x' appearing.    I found that if I set them to never spin down then the problem went away and the system is rock solid.  The strange thins was that it did not appear to affect all of the WD 6TB Red drives that I had - just a couple of them.

Link to comment

I can well imagine that this may actually be the issue. How did you solve it? Letting the disk running 24/7?

Yes.  Not the ideal solution, but keeping the system completely stable was worth more to me than the minor power savings from spinning the affected drives down.

 

I suspect that anyway the cost of having to regularly do rebuilds because a disk had been red-balled (which thus spun up all drives) was more than the cost of keeping a few drives spinning all the time.

Link to comment

Got it, thanks. I was less worried about the cost, but more of the lifetime of the HD assuming it spins 24/7.

I believe that spinning the disks up/down is particularly expensive in terms of lifetime cost so keeping them spinning is not an issue.

You keep all your WD disks spinning or just one? Did you try to replace it or is the "slot" the issue rather than the disk?

I keep all of them spinning (I have a total of 7 WD 6TB disks) although I think the problem may be isolated to 3 of them.  It is a trade-off as to whether to keep them all spinning or doing the work to be sure which ones have an issue and having to do a rebuild every time something goes wrong.  As far as I could tell the issue followed the disk rather than the slot although I must admit I did only limited testing on this.  Interestingly enough I also have some WD disks of other sizes in the array and they do not seem to show the issue.
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.