How to replace NVMe SSD , what is the replacement procedures ?
Because we have not experience it on FusionCube.
Product is FusionCube 6000, ES3000 V2 is the SSD card.
Best answer
Hello!
Please find below how to replace NVMe SSD and the replacement procedure: Determine the status of the faulty device. 1. On the disk topology page of the storage pool, perform the required operation based on the status of the faulty device (hard disk or SSD device used as main storage). You can perform the following operations to query the troubleshooting method. Click the faulty device and click Query Troubleshooting Method displayed. In the displayed dialog box, click OK. Click the faulty device again, and the recommended troubleshooting method will be displayed. Figure 2-4shows the key operation steps. Figure 2-4. Querying the troubleshooting method recommended by the system ![]()
Replace the faulty device. Perform the required operation based on whether the faulty device needs to be powered off. NOTE:V3 and V5 SSDs can coexist in the same storage pool but cannot coexist on the same server. Table 2-1 shows the methods of powering off devices of different storage media. Table 2-1. Methods of powering off devices[tr]Device Medium Type Disk Type Displayed on FusionStorage Block Self-Maintenance Platform Power Off Method [/tr]
![]() If the SSD card or SSD is to be replaced, take note of its electronic serial number (ESN) displayed on the FusionStorage Block Self-Maintenance Platform before replacement. If disks added to a FusionStorage storage pool are required to form redundant array of independent disks (RAID) 0, perform operations provided in the server documentation. If disks in RAID 0 are hot-swapped, manually activate RAID 0 to add the disks to the system. Otherwise, the disks cannot be identified by the system. For details, see the server documentation.
Determine the status of the hardware device. On the disk topology page of the storage pool, perform the required operation based on the status of the faulty device (hard disk or SSD device used as main storage).
Restore storage resources.
Enable hardware DIF. If the replaced device is an NVMe device and hardware DIF was enabled before the fault occurs, you need to enable hardware DIF after the device is replaced but before it is added to the storage pool.
Please refer to the link:
View more
| ||||||||||||
|
||||||||||||
|
hello, sir, pls check this guide document
Cache Faults (NVMe SSD Card) Scenarios Replace hardware and restore services when a cache fault (NVMe SSD card) occurs. Impact on the System During the SCNA replacement, the SCNA O&M services are interrupted. Prerequisites Conditions •Spare parts of the original model and specifications are available for replacement. •You have located the server and labeled its panel to avoid misoperations. Data Table 1 Required data Category Data Default Value Example Value BMC Management IP address - https://192.168.1.22 User name root - Password Huawei12#$ - FusionStorage Block Management IP address - https://192.168.8.162:28443/fsportal User name admin - Password Huawei@CLOUD8! - VMware vCenter Management IP address - https://192.168.8.68 User name •The user name and password for logging in to vCenter vary with the .ova file you use. •Obtain the user name and password from the VMware official website. •vSphere 5.5: root •vSphere 6.0: administrator@vsphere.local Password •vSphere 5.5: vmware •vSphere 6.0: Huawei@123 Procedure Enter the FusionStorage Block maintenance mode. 1.Enable the host to enter the FusionStorage Block maintenance mode. For details, see Configuring the Host Maintenance Mode (FusionStorage Block). Migrate VMs (VMware vSphere). 2.Access the real-time interface of the host by connecting to the physical device or remote virtual console. 3.Check the host status and perform related operations. •If the host OS is running properly and the communication between the local PC and the host network is normal, go to Migrating VMs (VMware vSphere). •If the host OS is not running properly or the communication between the local PC and the host network is abnormal, contact Huawei technical support. Shut down the management VM. 4.Select the management VM, and click Power off the virtual machine on the Getting Started tab. A confirmation dialog box is displayed. 5.Click yes. The management VM is shut down. Enter the VMware vSphere maintenance mode. 6.Enable the host to enter the VMware ESXi maintenance mode. For details, see Configuring the Host Maintenance Mode (VMware vSphere). Replace hardware. 7.Replace faulty hardware. For details, see Parts Replacement. Check the hardware installation status. 8.Use PuTTY and the following parameters to log in to the host CLI. •IP address: management IP address of the host •User name: fc2 9.Run the following command and enter the root user password to switch to user root. su - root 10.Run the following command to disable logout on timeout: TMOUT=0 11.Run the following command to check the SN of the NVMe SSD device: hioadm info 12.Run the following command to check the health status of the NVMe SSD device: hioadm info -d nvme0 NOTE: The query of the NVMe0 health status is used as an example. 13.Perform operations according to the status. Status (Device Status) Description Operation OK Normal. The hardware is replaced successfully. - NOT OK Abnormal. The hardware is faulty. Go to 14. BLANK Abnormal. No hardware is detected. Go to 15. 14.(Optional) Perform operations as prompted. 15.(Optional) Replace the hardware again. 16.(Optional) Check whether the health status of the NVMe SSD device is normal again. •If yes, the replacement is successful. •If no, contact Huawei technical support. Exit the VMware vSphere maintenance mode. 17.Enable the host to exit the VMware ESXi maintenance mode. For details, see Configuring the Host Maintenance Mode (VMware vSphere). Configure the NVMe SSD pass-through function again. 18.Double-click vClient, and enter the vCenter IP address, user name and password. 19.Select the ESXi host of the faulty node, choose Configuration > Advanced Settings, click the editing button, deselect Non-Volatile memory controller in the list, click ok, and restart the host for the configuration to take effect. 20.After the system is started, select the ESXi host of the faulty node in vCenter again, choose Configuration > Advanced Settings, click the editing button, select Non-Volatile memory controller in the list, click ok, and restart the host for the configuration to take effect. 21.After the system is started, right-click the CVM of the faulty node. 22.Choose Edit Settings. Remove the NVMe SSD card from the PCI device list. •If the VM starts with the host, end the configuration process. •If the VM does not start with the host, perform 23. 23.Click Add, select Non-Volatile memory controller in the PCI device list, and add it to the VM. Check the VM status (VMware vSphere). 24.Log in to the VMware vCenter. For details, see Logging In to the VMware vCenter. 25.Choose Inventory > Host and Clusters. The Host and Clusters page is displayed. 26.Check whether the VM status is Powered On. If the VM status is Powered Off, perform the following operations to start the VM: a.Right-click the row where the VM to be operated is located, and choose Power. b.Click Power On. Exit the FusionStorage Block maintenance mode. 27.Enable the host to exit the FusionStorage Block maintenance mode. For details, see Configuring the Host Maintenance Mode (FusionStorage Block). Restore storage resources. 28.Log in to the FusionStorage Manager WebUI. For details, see Logging In to the FusionStorage WebUI. 29.Choose Resource Pool > Storage Pools > Disk Topology. The Disk Topology page is displayed. 30.Check the status of the replaced hardware. 31.Perform operations according to the status. Status Description Operation Green solid box Normal. - Grey dotted box Abnormal. The cache NVMe SSD card is removed from the storage pool. Go to 32. Red solid box Abnormal. The cache NVMe SSD card needs to be restored. Go to 33. Red dotted box Abnormal. Storage resources of the cache NVMe SSD card need to be restored. Go to 34. 32.(Optional) Rectify the grey dotted box fault. a.Select the replaced cache NVMe SSD card. b.Click Add to Storage Pool. 33.(Optional) Rectify the red solid box fault. a.Select the replaced cache NVMe SSD card. b.Click Query Troubleshooting Method. c.Select the replaced cache NVMe SSD card. d.Perform restoration operations as prompted. 34.(Optional) Rectify the red dotted box fault. a.Use PuTTY and the following parameters to log in to the FusionStorage Manager CLI.•IP address: management IP address of the active node of FusionStorage Block •The default user name is dsware •The default password is Huawei@CLOUD8! b.Run the following command to disable logout on timeout: TMOUT=0 c.Run the following command to switch to the directory storing the command line tool dswareTool.sh: cd /opt/dsware/client/bin NOTE: You must enter the authentication user name and password after running the dswareTool command. The default user for authentication is cmdadmin, and the default password is cmdHuawei@123. d.Run the following command to restore storage resources: sh dswareTool.sh --op forceReplaceSSD -id ID -oldEsn oldEsn -newEsn newEsn -nodeMgrIp nodeMgrIp -type type NOTE: •ID: Storage pool ID. For details, see Resource Pool > Storage Pools > ID on the FusionStorage Manager WebUI. •oldEsn: ESN of the faulty NVMe SSD device. •newEsn: ESN of the new NVMe SSD device. •nodeMgrIp: Management IP address of the host. •type: Storage media usage. For cache, the value is cache. Check the cache NVMe SSD card status. 35.Log in to the FusionStorage Manager WebUI. For details, see Logging In to the FusionStorage WebUI. 36.Choose Resource Pool > Storage Pools > Disk Topology. The Disk Topology page is displayed. 37.Check whether the cache NVMe SSD card is restored. •If yes, no further action is required. •If no, contact Huawei technical support. and this is the link : https://support.huawei.com/enterprise/en/doc/EDOC1000156428?idPath=7919749|7941815|23972641|250416235|21488161### you can search the key words : replace SSD Card
View more
|
|
|
Hello!
Please find below how to replace NVMe SSD and the replacement procedure: Determine the status of the faulty device. 1. On the disk topology page of the storage pool, perform the required operation based on the status of the faulty device (hard disk or SSD device used as main storage). You can perform the following operations to query the troubleshooting method. Click the faulty device and click Query Troubleshooting Method displayed. In the displayed dialog box, click OK. Click the faulty device again, and the recommended troubleshooting method will be displayed. Figure 2-4shows the key operation steps. Figure 2-4. Querying the troubleshooting method recommended by the system ![]()
Replace the faulty device. Perform the required operation based on whether the faulty device needs to be powered off. NOTE:V3 and V5 SSDs can coexist in the same storage pool but cannot coexist on the same server. Table 2-1 shows the methods of powering off devices of different storage media. Table 2-1. Methods of powering off devices[tr]Device Medium Type Disk Type Displayed on FusionStorage Block Self-Maintenance Platform Power Off Method [/tr]
![]() If the SSD card or SSD is to be replaced, take note of its electronic serial number (ESN) displayed on the FusionStorage Block Self-Maintenance Platform before replacement. If disks added to a FusionStorage storage pool are required to form redundant array of independent disks (RAID) 0, perform operations provided in the server documentation. If disks in RAID 0 are hot-swapped, manually activate RAID 0 to add the disks to the system. Otherwise, the disks cannot be identified by the system. For details, see the server documentation.
Determine the status of the hardware device. On the disk topology page of the storage pool, perform the required operation based on the status of the faulty device (hard disk or SSD device used as main storage).
Restore storage resources.
Enable hardware DIF. If the replaced device is an NVMe device and hardware DIF was enabled before the fault occurs, you need to enable hardware DIF after the device is replaced but before it is added to the storage pool.
Please refer to the link:
View more
| ||||||||||||
|
||||||||||||
Contact Us: e_online@huawei.com Copyright © 2022 Huawei Technologies Co., Ltd. All rights reserved.