Got it

Heartbeat issue of the two cluster node

Latest reply: Dec 24, 2018 08:35:18 877 4 8 0 0


Abnormal Heartbeat State of HACS

SymptomThe HACS heartbeat status is abnormal.

 

Procedure

1.Check whether the heartbeat network is normal.

If the heartbeat network is abnormal, go to 2.

If the heartbeat network is normal or has been recovered, go to 5.

2.Recover the heartbeat network.

3.Log in to the active node and run the /usr/sbin/corosync-cfgtool -r command to recover the heartbeat service.

 

Run the /usr/sbin/corosync-cfgtool -s

 

command to check whether the heartbeat status is normal.

 

If the heartbeat status is normal, go to 7. Otherwise, go to 4.

 

 NOTE:

Normally, the status parameter in each heartbeat message must include active with no faults, and the value of id cannot include 127.0.0.1 (excluding messages in the single-node system scenario). The following information indicates that two heartbeat messages are configured and both messages are normal:

 

Printing ring status.

Local node ID 1

RING ID 0

         id      = 10.10.11.11

         status       = ring 0 active with no faults

RING ID 1

         id      = 192.168.11.11

         status       = ring 1 active with no faults

4.Log in to the standby node and run the /usr/sbin/corosync-cfgtool -r command to recover the heartbeat service.

Run the /usr/sbin/corosync-cfgtool -s command to check whether the heartbeat status is normal.

If the heartbeat status is normal, go to 7. Otherwise, go to 5.

5.Run the service corosync restart command on the active node to restart the service.

Run the /usr/sbin/corosync-cfgtool -s command to check whether the heartbeat status is normal.

If the heartbeat status is normal, go to 7. Otherwise, go to 6.

6.Run the service corosync restart command on the standby node to restart the service.

Run the /usr/sbin/corosync-cfgtool -s command to check whether the heartbeat status is normal.

If the heartbeat status is normal, go to 7. Otherwise, contact Huawei technical support engineers.

7.On the active and standby node, run the crm_mon -fri 2 command to check the running status of the two-node cluster.

If both nodes are Online and all modules are in Started state, the two-node cluster is running properly, and the fault has been rectified. Otherwise, contact Huawei technical support engineers.


What is the meaning of HACS?
View more
  • x
  • convention:

Run the /usr/sbin/corosync-cfgtool -s Can I check other statuses?
View more
  • x
  • convention:

What mean of cluster?
View more
  • x
  • convention:

How much time does it take to check the status of heartbeat? Every second?
View more
  • x
  • convention:

Comment

You need to log in to comment to the post Login | Register
Comment

Notice: To protect the legitimate rights and interests of you, the community, and third parties, do not release content that may bring legal risks to all parties, including but are not limited to the following:
  • Politically sensitive content
  • Content concerning pornography, gambling, and drug abuse
  • Content that may disclose or infringe upon others ' commercial secrets, intellectual properties, including trade marks, copyrights, and patents, and personal privacy
Do not share your account and password with others. All operations performed using your account will be regarded as your own actions and all consequences arising therefrom will be borne by you. For details, see " User Agreement."

My Followers

Login and enjoy all the member benefits

Login

Block
Are you sure to block this user?
Users on your blacklist cannot comment on your post,cannot mention you, cannot send you private messages.
Reminder
Please bind your phone number to obtain invitation bonus.