Got it

Physical Port goes down after NE40E upgrade to V600R009C20SPC600

Latest reply: Apr 3, 2022 06:52:37 286 19 11 0 0

Hi Community !


It is a pleasure to share with you my experience on tackling an issue related to a software upgrade operation leading to a physical port changing status from up to down between a Huawei NE40E router and a Cisco switch. Enjoy the read below in order to get more insights about the subject.



ISSUE DESCRIPTION


The Network Operations Center (NOC) reported an incident involving many datacenter services that went down. On the customer side, they were unable to access some applications, short codes, could not make voice calls and this greatly impacted the experience leading to loss of revenue. This happened during an upgrade operation.



HANDLING PROCESS


Considering this was very critical, the following steps were taken :


STEP 1 : The NOC immediately opened an incident ticket with severity tagged as critical and assigned it to the Back office datacom team for investigation and resolution.


STEP 2 : The Backoffice team confirmed there was an upgrade operation and quickly informed the team of Engineers that were handling the operation for fast investigation and resolution.


STEP 3 : The team of Engineers incharge of the operation and they immediately started looking at logs on the NE40E using the "display logbuffer" command. It was noticed the interface Gx/y/z connecting the cisco switch was down. It had changed state after the operation.


display logbuffer 01


The down reason was indicated to be PCS_unLock, AutoNegotiation_Fail indicating issue is with peer device.


Checking the interface with the "display interface" command showed the interface went down 


display interface


STEP 4 : Considering the down reason was related to the peer device, the next step was to log in to the peer device and check the logs. The peer device here was a cisco device. The logs specific to this event had the output below.

show logging

A loopback error was detected on the cisco switch and put the interface on the error disable state. This made the interface to be move from up to down status.


STEP 5 : After checking the logs from the Huawei NE40E and the Cisco switch, our next move was to look at the configurations of the Huawei router to detect what was causing the loop.

Router interface config

STEP 6 : Considering it was a loop that made many other services indicated to be down, we removed the VSI configuration on the interface using the "undo l2 binding vsi" command

This brought back the interface up but just the service linked to the vsi was down.


STEP 7 : At this point where the level of criticity was low, we now had to build the same environment on a test bed to investigate why the issue occured. Same results were obtained with V600R009C20SPC600. 



ROOT CAUSE


The mac-forwarding table of the Network Processor for the NE40E was checked and was incorrect.

When the Cisco switch sends a keep-alive message (used to detect loop) to the NE40E, the NE40E sends the keep-alive message back to the Cisco switch, so the switch will detect a loop in the network and changes its interface to error disable state (down state)

Keepalive message


Packet capture on the Cisco switch shows source and destination mac are the same (loop).


This is a VRP issue with the V6R9 version where the NE40E receives packets with the same source and destination MAC in the Ethernet header making the MAC forwarding table incorrect on the Network Processor chip which triggers packet forwarding back to the Cisco switch causing the loop.


TEMPORAL SOLUTION 


  1. Move the VSI configuration from the main interface to a sub-interface on the NE40E

  2. Stop the sending of keep-alive message on the Cisco switch by using the "no keepalive" command



SOLUTION 


The bug was corrected in the patch release : V600R009SPH018

The post is synchronized to: Author group

Lucfabrice
MVE Author Created Mar 31, 2022 09:55:51

Your feedback on this post will be highly welcome. Thanks.
@olive.zhao @Irina @dragos_v @BAZ @wissal @umaryaqub @Rumana @ander.sanchez @Lan59 @nuchi @Vlada85 @hemin88 @Unicef @little_fish @gzzz @nochhie @chantha @smileymind @Navin_kay @user_4358465 @Majdi.Chebil @AndresMoreno @shakeela @phuta @Ayeshaali @user_4001805 @VinceD @lucian2003 @dengdengdeng @Sara_Obaid @AL_93 @MahMush @Y_T_Z @Kevin_Thomas @Saqib123 @bobi @richie9999 @user_4359501 @MesayW. @Khalid_Gul @LilStylz237 @MMshaikh @Laiheang @Chanbora @Sokrin @simchamnan @user_4237671 @Abdussamed @andersoncf1 @Herediano @Vesper_EvenStar @taha_29four @user_4326135 @Assis_bsb @Serges_armel @sachandio @hamza11 @mouh1991 @Tiplu @Null_0 @Tongun @Haseeb_Haris @Diego.Silva @Caroline_Herrera @kunthea @Somemeow @Anno7 @chenhui @jason_hu @Popeye_Wang @alopez @Chenxintao @E.DR_91 @stephen.xu @DDSN @Malik3000 @Zemo_Mccracken @adrian_alucard @Precious @Kwesi @imransumayari @abdul_basit7233 @Andre_G @Murat87 @LucianoNhantumbo @Vien @titusmahwe @DragonVN @Zebra @thisu @Funstuf @DKetrari @4TEch @rkahya_4 @scidox @faysalji @user_3134129 @SamB @mustafa211 @rimon @RajK @Funstuff @Abrar_Akbar @Kh_Elias65 @James_Nel @Zonger @Hurr @15393597009 @safecity @LeeMARK @jerry_zhuzi @bruno.guedes @Kashif @DrDoom @mrppa @sliawatimena @daniellima @thibay @maithi @hanhcao @wonderj @mytruc @huyvan @manpham @Imnh @hugu @nagu @sam_san @NTan33 @Faridrami @I_Am_Batman @amr_rashedy @Ignatius @Saqibaz @user_4252339 @Satya_Syam @Vijji @user_4413531 @Wieczorekcool @user_4400653 @Sirajs @Dia0205 @abdelali @Irshadhussain @cmarban @javaid100 @Natan_Oliveira @backwaves @alexander.grosello @Confucius @Soliman_Mohammed @sohaib.ansar @csk99 @OneDan @bek7 @Farah_O @AymanOT @Asimsaad @Salah @gabo.lr @Mr.Jack @Steffy @h89151 @Alibaba8000 @SidzHuawei
View more
  • x
  • convention:

taha_29four
taha_29four Created Mar 31, 2022 10:42:01 (1) (0)
You're getting better at posting and writing, contgrats  
Lucfabrice
Lucfabrice Reply taha_29four  Created Mar 31, 2022 13:53:57 (0) (0)
Thanks for the remark @taha_29four  
lucian2003
lucian2003 Created Mar 31, 2022 16:37:23 (1) (0)
 
rkahya_4
rkahya_4 Created Apr 1, 2022 08:10:34 (1) (0)
 
lucian2003
lucian2003 Created Apr 4, 2022 00:39:36 (1) (0)
 
Its always good to share real technical issue with solutions. Thanks
View more
  • x
  • convention:

Lucfabrice
Lucfabrice Created Mar 31, 2022 13:54:19 (0) (0)
Thanks @faysalji  
BAZ
BAZ Created Mar 31, 2022 20:18:15 (1) (0)
That's great  
Lucfabrice
Lucfabrice Reply BAZ  Created Apr 3, 2022 05:07:42 (0) (0)
Thanks @BAZ  
thanks for sharing this useful post
View more
  • x
  • convention:

lucian2003
lucian2003 Created Apr 1, 2022 17:29:21 (0) (0)
 
Lucfabrice
Lucfabrice Created Apr 3, 2022 05:08:22 (0) (0)
Thanks @Zonger  
Thanks for share Physical Port goes down after NE40E upgrade to V600R009C20SPC600-4832971-1
View more
  • x
  • convention:

Lucfabrice
Lucfabrice Created Apr 3, 2022 05:08:51 (0) (0)
Thanks @4TEch  
great one
View more
  • x
  • convention:

Lucfabrice
Lucfabrice Created Apr 4, 2022 04:17:16 (0) (0)
Thanks @Anno7  
Good share
View more
  • x
  • convention:

Lucfabrice
Lucfabrice Created Apr 4, 2022 04:17:45 (0) (0)
Thanks @thisu  

Comment

You need to log in to comment to the post Login | Register
Comment

Notice: To protect the legitimate rights and interests of you, the community, and third parties, do not release content that may bring legal risks to all parties, including but are not limited to the following:
  • Politically sensitive content
  • Content concerning pornography, gambling, and drug abuse
  • Content that may disclose or infringe upon others ' commercial secrets, intellectual properties, including trade marks, copyrights, and patents, and personal privacy
Do not share your account and password with others. All operations performed using your account will be regarded as your own actions and all consequences arising therefrom will be borne by you. For details, see " User Agreement."

My Followers

Login and enjoy all the member benefits

Login

Block
Are you sure to block this user?
Users on your blacklist cannot comment on your post,cannot mention you, cannot send you private messages.
Reminder
Please bind your phone number to obtain invitation bonus.