Step 1 Analysis the cause of reporting BD_STATUS alarms
1. According to the analysis from NE data, the NE replaced the TN18EFI card in slot 21 on July 25th; the NE reported the BD_STATUS alarm again on Jul 30th.
2. Checking the EFI card logs and found that at the time of 09:55 (+07:00) on July 30th, the TN18EFI card had an abnormal power-down reset. EFI was powered off, the NE lost association. All cards report BD_STATUS alarms at the same time.
3. The internal communication between each card processing in the OSN9800 UPS sub-rack is centralized on the TN18EFI board
As the TN18EFI was abnormally powered off, the communication of chip between the TN18EFI and the Lanswitch was abnormal. As a result, the NE lost association and other cards reported BD_STATUS alarms.
Step 2 Analysis of the cause of card fault
1. Since the sub-rack had been continuously replaced with spareTN18EFI cards, but the NE was still loss association and reported the BD_STATUS alarm. The faults cards in the previous period analyzed from the fault log data, card (022QQMD0H6003562)due to FPGA access abnormal and card (022QQMD0H6003264) faulty due to BCM53242 chip access abnormal that causing the NE to be association
2. Check the card that had been returned to the R&D.
Check the appearance of the device on the surface of the board one by one.
It is found that the 022QQMD0H6003562 barcode has obvious traces of salt spray corrosion on the card. The corrosion area is as follows:
Check the device bit number of the corroded area, which are U1016 clock driver, U4 clock driver, magnetic bead, capacitor, U1006 power chip, especially the U1016 chip part of the pin has obvious black burn marks.
After the device was eroded, the clock of the FPGA and LSW chip was abnormal.
According to the customer's photo inspection on the card that was replaced on July 24th, it was found that the replaced board also had the sign that the board 022QQMD0H6003264 also showed signs of corrosion by salt spray.
3. From the above analysis, it is confirmed that the cards that were replaced in January and July in the NE have obvious corrosion marks. The cards are corroded that cause signal sent to CPU, FPGA, BCM53242 and other key chip clocks and power supply abnormalities. An abnormal reset will cause the NE to be removed frequently and the BD_STATUS alarm will be reported on the board.
The abnormal loss association and the BD_STATUS alarm were reported on July 30. Suspected that the TN18EFI card is abnormal due to the same salt spray corrosion.
----End
Notice: To protect the legitimate rights and interests of you, the community, and third parties, do not release content that may bring legal risks to all parties, including but are not limited to the following:
Politically sensitive content
Content concerning pornography, gambling, and drug abuse
Content that may disclose or infringe upon others ' commercial secrets, intellectual properties, including trade marks, copyrights, and patents, and personal privacy
Do not share your account and password with others. All operations performed using your account will be regarded as your own actions and all consequences arising therefrom will be borne by you. For details, see " User Agreement."