[Problem Description]
Application Scenario
S5700 switches run V200R008, V200R009, or V200R010 and set up a stack.
Trigger Condition
l S5700 switches run V200R008, V200R009, or V200R010.
l Two or more S5700 switches set up a stack.
Symptom
There is a low probability that the stack of S5700 switches splits or restarts.
Identification Method
1. Run the display device command to check whether the device model is S5720SI, S5720S-SI.
<HUAWEI>display device S5720-52X-PWR-SI-AC's Device status: Slot Sub Type Online Power Register Status Role ------------------------------------------------------------------------------- 0 - S5720-52X-PWR-SI Present PowerOn Registered Normal Standby PWR1 POWER Present PowerOn Registered Normal NA PWR2 POWER Present PowerOn Registered Normal NA 1 - S5720-52X-PWR-SI Present PowerOn Registered Normal Master PWR1 POWER Present PowerOn Registered Normal NA PWR2 POWER Present PowerOn Registered Normal NA |
2. Run the display version command to check whether the software version is V200R008, V200R009, or V200R010.
<HUAWEI>display version Huawei Versatile Routing Platform Software VRP (R) software, Version 5.170 (S5720 V200R010C00SPC600) Copyright (C) 2000-2016 HUAWEI TECH CO., LTD |
3. Run the display reboot-info command to check whether a reboot event with the cause of OTHER exists.
<HUAWEI>display reboot-info Slot ID Times Reboot Type Reboot Time(DST) =========================================================================== 0 1 OTHER 2018/04/17 11:51:08 0 2 POWER 2018/04/07 08:12:55 |
4. Run the display stack trace nvram command in the diagnostic view to check the time when the stack split and whether abnormal packet sending or receiving records exist.
[HUAWEI-diagnose]display stack trace nvram 2018-04-17 03:51.910 DST:Stack port 2 does not receive any hello packet for 15 second(s). 2018-04-17 03:51.910 DST:Stack port 1 does not receive any hello packet for 15 second(s). 2018-04-17 03:51.860 DST:No SPDU packet received from the master (slot 0). 2018-04-17 03:51.900 DST:Stack port 2 does not receive any hello packet for 10 second(s). 2018-04-17 03:51.890 DST:Stack port 1 does not receive any hello packet for 10 second(s). |
5. Run the display diag-logfile log.dblg command in the diagnostic view to check whether the log of "Reset for critical task FSP has not been scheduled within 70 seconds" exists.
[HUAWEI-diagnose] display diag-logfile log.dblg … 18-Apr-17 11:55:17.499.2+02:00 DST HUAWEI 01SSP_ADP/4/TASKSCHEDULE(D)[1438]:Slot=0;Reset for critical task FSP has not been scheduled within 70 seconds(slot : 0). the callstack: ->(0xb6e8e518) ->(0xb6e88b00) ->(0xb6e05d94) |
[Root Cause]
An error in the chip source code causes the packet receiving task to enter an abnormal state. As a result, the stack task (FSP) is blocked and the stack splits.
[Impact and Risk]
The stack splits, and automatically restarts and recovers after a period of time. During this period, services will be interrupted if inter-card link protection is not configured.
[Measures and Solutions]
Recovery Measures
Services can be restored automatically after the stack resets.
Workarounds
None
Solution
1. Install the patches according to the following table.
Device Model | Software Version | Patch Version |
S5700S-X-LI, S5720-SI, S5720S-SI | V200R008C00SPC500 | Load the patch of V200R008SPH018 or later. Or upgrade the software version to V200R010C00SPC600 and load the patch of V200R010SPH011 or later. |
S1720, S5700S-X-LI, S5710-X-LI, S5720-SI, S5720S-SI | V200R009C00SPC500 | Upgrade the software version to V200R010C00SPC600 and load the patch of V200R010SPH011 or later. |
S1720, S5700S-X-LI, S5710-X-LI, S5720-SI, S5720S-SI, S5720-LI, S5720S-LI | V200R010C00SPC600 | Load the patch of V200R010SPH011 or later. |
2. For V200R007, contact R&D personnel to confirm the hardware replacement solution.
