Disabled Drives on TWO Unraid Systems


Recommended Posts

Wow,

 

I've got crazy issues with Unraid all of a sudden.  I have two machines and they both have failed drives according to the Main screen.  :'(

 

I hope someone knows what to do to assist.  I'll update here if I find more information as I try to interpret my logs (but I don't really know how).

 

Thanks,

 

Russell

zurlo-diagnostics-20170118-2041.zip

palmwood-diagnostics-20170112-2249.zip

Link to comment

On Palmwood it looks like my HGST drive ending in RHS has the following error:

 

smartctl 6.5 2016-05-07 r4318 [x86_64-linux-4.4.30-unRAID] (local build)

Copyright © 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

 

=== START OF INFORMATION SECTION ===

Vendor:              /0:0:0:0

Product:             

Compliance:          SPC-5

User Capacity:        600,332,565,813,390,450 bytes [600 PB]

Logical block size:  774843950 bytes

scsiModePageOffset: response length too short, resp_len=47 offset=50 bd_len=46

scsiModePageOffset: response length too short, resp_len=47 offset=50 bd_len=46

>> Terminate command early due to bad response to IEC mode page

A mandatory SMART command failed: exiting. To continue, add one or more '-T permissive' options.

 

 

On Zurlo it looks like my Seagate drive ending in YB5 has the following error:

 

smartctl 6.5 2016-05-07 r4318 [x86_64-linux-4.4.30-unRAID] (local build)

Copyright © 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

 

=== START OF INFORMATION SECTION ===

Vendor:              /6:0:0:0

Product:             

Compliance:          SPC-5

User Capacity:        600,332,565,813,390,450 bytes [600 PB]

Logical block size:  774843950 bytes

scsiModePageOffset: response length too short, resp_len=47 offset=50 bd_len=46

scsiModePageOffset: response length too short, resp_len=47 offset=50 bd_len=46

>> Terminate command early due to bad response to IEC mode page

A mandatory SMART command failed: exiting. To continue, add one or more '-T permissive' options.

 

 

 

Sure seems strange to have the same problem on different make of drives on two different systems at the same time... Yikes!

 

I hope a guru can help me.  :)

 

Russell

 

Link to comment

Swapping out cables, I've got no changes on Zurlo...

 

But on Palmwood, I was able to get a little difference:  Now Disk 3 (ZUT) is green, but Used/Free space shows "Unmountable" and Disk 5 (RHS) as "Not Installed."  Palmwood is making some shares available.

 

Hoping for some advice.  Do I dare try to do some emergency backup?  (These systems were supposed to be backup to each other - I have offsite backup too, but it isn't complete.)

 

Shutting down both systems as I wait for advice.  :)

 

Russell

Link to comment

For Palmwood:

 

Disk5 needs a new SATA cable, almost certainly the reason it dropped offline and got disable:

 

199 UDMA_CRC_Error_Count    0x000a   053   053   000    Old_age   Always       -       7914732

 

Disk4 also has abnormal number of CRC errors:

 

199 UDMA_CRC_Error_Count    0x000a   200   200   000    Old_age   Always       -       116

 

Other disks also show some errors from 4 or 5 to 50, these are not normal and cable need to be replaced if the attribute increases by 2 or more.

 

Disk3 is missing, powerdown, check/replace both cables and power back on.

 

After checking/replacing at least the cables for disk 3 and replacing the SATA cable for disk5 power back on and post new diags.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.