Got it

DSTV VMs are down/ please ugently assist customer production down

Latest reply: Jan 22, 2022 12:12:00 521 1 1 0 0

Cutomer  assist to access the fusionCompute VM that are down , FusionCompute where the VMs are down as a below : 193249uowzdctcmmmzbq0h.png?Fusioncompute

and after we check with our expert he found :

Problem Description

There are 16 VMs faulty and can’t be resumed on FusionCompute V1R5.

        

Problem Analysis

1.       We check E9000’s Blade 12 are abnormal. And its network plane are abnormal

2.       After analyzing the OS log, the Mezz510 nic occurred many UEs(Uncorrectable Errors), the whole card including two ports can’t work, which leads to the hosts’ abnormal.

3.       FusionCompute execute HA, but the HA task failed because the faulty VMs can’t mount datastore.

4.       We reboot the Blade 12, the VM’s HA task succeed, the service is recoveryed.

 

Root Cause

         Q1. Why the HA task failed when Blade 12’s abnormal?

         A

The DataStore that is constructed on FusionStorage is abnormal when doing the HA.

                  There’re 3 ZK nodes in the FusionStorage, The FusionStorage can work normally when any one of them faulty.

                  We login the FusionStorage, finding there is one ZK node(Blade 9) faulty since 2016.

                   Blade 12 is another ZK node, it’s faulty on 2019/1/17.

                   When there’re 2/3 ZK nodes faulty, the DataStores on the FusionStorage is faulty, which leads to the HA failure.

                   Solution:

                   Recovery Blade 9. Blade 9 is faulty in Fusionstorage because of wrongly VLAN configuration on the network port. We change the port from VLAN 0 to VLAN 1, the Blade 9 is normal

         Q2. Why the Blade 9’s abnormal.

         A: Blade 12’s Mezz510 nic is faulty with some UEs.

        

         Solution:

         We reboot Blade 12 and temporarily recovery Blade 12.

         We suggest you to arrange some down time to upgrade All of the Blades’  Mezz510’s NIC fw & driver to the newest version in the future.



after this everything is ok .

 


  • x
  • convention:

Unicef
MVE Created Jan 22, 2022 12:12:00

COOL
View more
  • x
  • convention:

Comment

You need to log in to comment to the post Login | Register
Comment

Notice: To protect the legitimate rights and interests of you, the community, and third parties, do not release content that may bring legal risks to all parties, including but are not limited to the following:
  • Politically sensitive content
  • Content concerning pornography, gambling, and drug abuse
  • Content that may disclose or infringe upon others ' commercial secrets, intellectual properties, including trade marks, copyrights, and patents, and personal privacy
Do not share your account and password with others. All operations performed using your account will be regarded as your own actions and all consequences arising therefrom will be borne by you. For details, see " User Agreement."

My Followers

Login and enjoy all the member benefits

Login

Block
Are you sure to block this user?
Users on your blacklist cannot comment on your post,cannot mention you, cannot send you private messages.
Reminder
Please bind your phone number to obtain invitation bonus.
Information Protection Guide
Thanks for using Huawei Enterprise Support Community! We will help you learn how we collect, use, store and share your personal information and the rights you have in accordance with Privacy Policy and User Agreement.