Hello, everyone!
This case will tell you how to solve the blade server CH121 V5 Multi-Compute alarm.
Problem Description
An alarm is generated for multiple components.
Problem Analysis
1. According to the SEL, an uncorrectable error alarm is generated for CPU1.

2. After CPU 1 is replaced, alarms are generated repeatedly, indicating that the mainboard is not installed. After that, alarms are generated, indicating that all DIMMs in the CPU 2 area fail to be initialized and the RAID controller card communication is abnormal.

3. According to the logical structure diagram, the DIMM in the cpu2 area and the RAID controller card are connected to CPU 1 and CPU 2 through the mainboard. There is a high probability that the mainboard is faulty.

4. According to the FDM log, no alarm is generated after CPU 1 is replaced. The BANK (PCU) is the QPI module. This indicates that the fault is caused by the BANK (PCU) or the mainboard.

Root Cause
Multiple Component Alarms Are Generated Due to Mainboard Faults
Solution Description
Conclusion:
Multiple Component Alarms Are Generated Due to Mainboard Faults
Solution:
Replacing a mainboard
This is my solution, how about yours? Go ahead and share it with us!


