Hi guys!
Here’s a case that disks are powered off and cannot be monitored as an internal command times out.
Fault Description
Disks are powered off and cannot be monitored as an internal command times out.
Symptom
An alarm is displayed indicating that disks cannot be monitored.
0xF000A0015MajorNoneThe system failed to monitor the disk (disk enclosure DAE0XX, slot ID XX).
The error code is 1077936787.
Cause
When the PC-type command times out twice, the path is changed and the command is executed again,
and Multi_path (MP) deletes the preceding I/O path.
As a result, there is no available path can be queried in the Device Reset procedure,
causing the disks to be powered off by the BDM.
Fault Diagnosis
1.Check whether disks are powered off by the BDM based on logs.
[Disk send power ctrl cmd to driver succeed, frame id:2, slot id:69, bdm flag 3, dmi flag 4.][BDM_HDM][hdmPowerCtrlStart,1539][sched_work]
The power-off disk is in frame 2, slot 69.
2.View logs to check whether the path is not found after the Device Reset procedure.
[Path for REQ(0xffff88017d4e2478) is NULL.][BDM_SIO][sioDeviceReset,168][sched_work]
This fault can be diagnosed if the preceding conditions are met.
Solutions
Reinsert the disks.
Check After Recovery
The disks can be monitored.