Packet loss occurs during BGP route switchback Highlighted

Latest reply: Feb 18, 2020 08:49:04 114 4 5 1

Hello everyone,

I'd like to share with you a case about packet loss during BGP route switching.

Problem

http://dts.huawei.com/net/dts/fckeditor/download.ashx?Path=CoPqkvQ2gGTjdvYw87ROkcce71Eiue+LvI4jty5zsVhWhrHxJOCVbktqdfRfnwyf

As shown in the preceding networking diagram, the three devices are connected through Layer 3 Eth-Trunk interfaces. S12700_1 is using a NEP board. The OSPF process advertises the addresses of all interfaces (including loopback interfaces). 

IBGP peer relationships are established between S12700_2 and S12700_1, and eBGP peer relationships are established between S12700_2 and the tester Test2/2. BGP fast refresh function is enabled by default. 

Use Test2/2 to send traffic to S12700_2, and the IBGP routes are sent from S12700_2 to S12700_1. Use Test2/1 to send Layer 3 traffic to Test2/2. 

The traffic on the active path is sent to S12700_2 through Eth-Trunk11 of S12700_1, and the traffic on the standby path is sent to S6720EI through Eth-Trunk22 and then forwarded to S12700_2.

Simulate a situation in which the primary path goes Down (by shutting down Eth-Trunk 11 of S12700_2). Use the tester to send traffic. The traffic is switched to the secondary path. 

http://dts.huawei.com/net/dts/fckeditor/download.ashx?Path=CoPqkvQ2gGTjdvYw87ROkcce71Eiue+LvI4jty5zsVj6M4SrHKEZ5l9+LxccNVwq

At this time, open eth-trunk11 of S12700_2, switch traffic to the primary path. 

http://dts.huawei.com/net/dts/fckeditor/download.ashx?Path=CoPqkvQ2gGTjdvYw87ROkcce71Eiue+LvI4jty5zsVg2+iP40g0LWvG0RRx7E/Ik

Packet loss lasts for about 100 ms. 

http://dts.huawei.com/net/dts/fckeditor/download.ashx?Path=CoPqkvQ2gGTjdvYw87ROkcce71Eiue+LvI4jty5zsViSkhzMSu2j0g2S9Zmko3wk

In addition, it is found that packet loss also occurs when the BGP fast refresh function is disabled. However, the number of lost packets is less than that when BGP fast refresh is enabled. 

After the fault is rectified, the Eth-Trunk interface goes Up and immediately sends gratuitous ARP packets. However,  in consideration of security and attack prevention, the main interface does not learn gratuitous ARP packets. As a result, the local end fails to learn the ARP entry of the peer end. 

However, because the OSPF network type of the interfaces is P2, all packets are sent in multicast mode. Therefore, the OSPF neighbor relationship can be established without learning the ARP entry of the peer end, and the route from the peer end to OSPF can be learned.  So the BGP route is immediately iterated to the new outbound interface. However, the ARP entry of the outbound interface is not learned in time. As a result, packet loss occurs. The traffic recovers only after the ARP entry is learned.  And after BGP fast refresh is enabled, more packets are discarded because route switching is faster. 


Solution : 

Solution 1: Change the Layer 3 main interface on the live network to a VLANIF interface. Because the VLANIF interface supports gratuitous ARP learning, the VLANIF interface can immediately learn the gratuitous ARP from the peer end after the interface goes Up. In this way, packet loss does not occur. 

Solution 2: Change the OSPF network type to broadcast. After the change, DD packets are unicast during the OSPF process, which triggers ARP-miss and helps learn ARP entries in advance. In this way, no packet is lost during the switchback.


I hope it is of help to you.

  • x
  • convention:

chenhui
Admin Created Jan 23, 2020 03:49:26 Helpful(0) Helpful(0)

Thanks, this case explains nice and clear.
  • x
  • convention:

ejcastriver
Created Jan 23, 2020 17:01:20 Helpful(0) Helpful(0)

Thanks
  • x
  • convention:

lucian2003
MVE Created Jan 25, 2020 17:09:48 Helpful(0) Helpful(0)

Interesting, thanks to share
  • x
  • convention:

Hello%20friends%2C%20I%20am%20a%20Telecommunications%20and%20electronics%20engineer%20and%20I%20just%20graduated%20as%20a%20master%20in%20telecommunications%20systems.%20I%20work%20in%20the%20telecommunications%20company%20of%20Cuba%2C%20ETECSA.%20I%20am%2035%20years%20old%20and%20I%20attend%20the%20transport%20network%20in%20my%20province%2C%20which%20is%20mainly%20Huawei.
HK19
Created Feb 18, 2020 08:49:04 Helpful(0) Helpful(0)

Thanks!!
  • x
  • convention:

Network%20Analyst%2Cwith%20security%20and%20system%20backgoround.

Comment

Reply
You need to log in to reply to the post Login | Register

Notice Notice: To protect the legitimate rights and interests of you, the community, and third parties, do not release content that may bring legal risks to all parties, including but are not limited to the following:
  • Politically sensitive content
  • Content concerning pornography, gambling, and drug abuse
  • Content that may disclose or infringe upon others ' commercial secrets, intellectual properties, including trade marks, copyrights, and patents, and personal privacy
Do not share your account and password with others. All operations performed using your account will be regarded as your own actions and all consequences arising therefrom will be borne by you. For details, see " Privacy."
If the attachment button is not available, update the Adobe Flash Player to the latest version!

My Followers

Login and enjoy all the member benefits

Login and enjoy all the member benefits

Login