cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
2521
Views
0
Helpful
6
Replies

Frequent disk failure - UCS C210 M2

heather_sessom
Level 1
Level 1

Hello.  We currently have about 12 Cisco UCS C210 M2's in production and about once a week recently have seen a disk with either a "Predictive Failure" or completely failed.  Are there any known issues with this model and failing disks?  I've seen the field notice FN - 63499 (http://www.cisco.com/c/en/us/support/docs/field-notices/634/fn63499.html) and will start investigating that, but in the mean time I'm wondering if anyone is having this issue with this model of UCS.  The disks are all in either a RAID 1 or 5 configuration although it is always been a disk in a RAID 5 that has failed.  Any help will be greatly appreciated.

6 Replies 6

Keny Perez
Level 8
Level 8

Hi,

How do you recover from those failures? Are they cleared after a server reboot or you have always just replaced them?....

-Kenny

 

Cisco has been sending us new disks each time.  It's getting old running down to the data center though and the customer is concerned about the stability of the UCS platform.

Dragan Ilic
Level 4
Level 4

I have few of them in production and no problems so far...on what firmware are you right now?

Maybe disk manufacturer changed something :)

BR,

Dragan

HTH,
Dragan

We are running either firmware version 1.4(2) or 1.4(3).  The reoccurance of this issue seems to be the same across firmware versions.

This might be realted to something that is called Punctured RAID, in summary one disk failure causes cascade disk issues, google it, it might be the root cause but something that called my attention is that the issue you mention is about Predictive Failures, those generally mean that the disk run out of spare blocks which should be expected if the disks have been deployed about the same time.

Would be useful to open a TAC case for investigation an dif necessary, you may want to ask for a EFA (engineering Failure Analysis) if you have seen the issue is very consistent (80-90% of disks have the same issue).

-Kenny

 

 

 

 

louisvandyk
Level 1
Level 1

Hi

Did you ever get a resolution to this issue?  We have 5 servers, all running BIOS ver 1.4.3f.  Every few months I have a drive fail.  If we reseat it, it often rebuilds and continues working, although sometimes they fail again in a week or two and then we get Cisco to replace them.  The original disks were Seagate, but I see the latest replacement disks are now Toshiba.

But it makes me nervous - I have already had two fail at the same time, and rebuilding servers is not a fun pastime!

Thanks.

Review Cisco Networking products for a $25 gift card