Got it

GPU VM login "Lost connection, trying to reconnect." Highlighted

Latest reply: Feb 24, 2022 13:21:22 1053 10 3 0 0

Hello, everyone!

The post will share with you how to solve GPU VM login display "Lost connection, trying to reconnect.".

Fault Type

In FusionAccess 8.0.1, a message is displayed indicating that the connection has been lost and reconnecting to a GPU VM. The fault is caused by a fault in the NVIDIA driver file of the user VM. Uninstalling the driver file does not work. After the driver file is manually deleted and reinstalled, the fault is rectified.

Keywords: login, disconnection, NVIDIA.

Applicable version: FusionAccess 6.5.1

Symptom

When a user logs in to a GPU VM, a message is displayed, indicating that the connection has been lost. Trying to reconnect to the VM.

Analysis

1. Check the TraceLogMainService_20201221-151733.log file of the VM. It is found that the VM was intermittently disconnected and reconnected at 14:31:25.867 on December 21, 2020.1

2. View TraceLog_20201221-143046.log to check whether the GPU function process is normal. 

The NVIDIA GRID V100D-2Q driver is listed, indicating that the GPU card and driver are normal.   2

The resolution of the client is successfully changed.3

Run the NVIDIA driver snapshot. Load HdpxCore.dll and NvEncoder.dll. The loading is successful.4

After the NvEncoder.dll file is loaded, no subsequent process is available. Normally, the NvEncoder.dll file invokes the Microsoft d3d9.dll file, as shown in the following figure.5

3. Log in to the VM using VNC, open APIMonitor as an administrator, select LoadLibraryA, LoadLibraryExA, LoadLibraryExW, and LoadLibraryW in the API Filter area on the left, and double-click Running Processes. The HDPDisplay.exe process starts to monitor API calling behavior.

4. Use the client to log in to the GPU VM again. If the login fails, use VNC to log in to the VM again. The NvEncoder.dll file invokes NvFBC64.dll, d3d9.dll, and nvapi64.dll, however, the path is not C:\Windows\System32\DriverStore\FileRepository\XXX. (The following figure shows a normal path, which is a relative path nvapi64.dll.)7

5. Enter C:\Windows\system32\pnputil.exe /enum-drivers > c:\test.log in the CLI, open c:\test.log, search for nvidia, and find multiple inf files invoked by Nvidia.

8.18.2

6. After confirming the version of the C:\Windows\system32\pnputil.exe /delete-driver oem8.inf, run the following command to delete other C:\Windows\system32\pnputil.exe /delete-driver oem8.inf: If you are not sure about the version, uninstall the NVIDIA on the control panel, and then run the command in step 5 to find all oemXX.inf files involved in the NVIDIA, and then run the pnputil.exe /delete-driver command to delete them.

7. Install the NVIDIA driver again and verify the login.

Solution

Perform operations according to the analysis process.

For details, see https://www.zhihu.com/question/51654630.

Summary and Suggestions

If the GPU VM is faulty, check whether the NVIDIA driver version and installation are normal and whether the DLL file is successfully loaded.

This is my solution, how about yours? Go ahead and share it with us!


The post is synchronized to: Huawei Cloud Computing Case

  • x
  • convention:

VinceD
Moderator Created Apr 7, 2021 06:26:37

thanks for sharing.
View more
  • x
  • convention:

olive.zhao
olive.zhao Created Apr 7, 2021 06:27:19 (0) (0)
 
Saqibaz
Saqibaz Created Feb 24, 2022 13:21:11 (0) (0)
 
Unicef
MVE Created Apr 7, 2021 07:55:45

Very good
View more
  • x
  • convention:

Lianet
Created Apr 9, 2021 02:01:05

Interesting
View more
  • x
  • convention:

thibay
Created May 3, 2021 09:04:14

Good case. Thanks for sharing
View more
  • x
  • convention:

bobi
Created May 6, 2021 14:26:59

Thanks for sharing
View more
  • x
  • convention:

VinceD
Moderator Created Jul 13, 2021 03:44:08

thanks for sharing.
View more
  • x
  • convention:

Unicef
MVE Created Jul 13, 2021 13:40:33

GOOD
View more
  • x
  • convention:

Saqibaz
Created Feb 24, 2022 13:21:22

Thanks for sharing
View more
  • x
  • convention:

Comment

You need to log in to comment to the post Login | Register
Comment

Notice: To protect the legitimate rights and interests of you, the community, and third parties, do not release content that may bring legal risks to all parties, including but are not limited to the following:
  • Politically sensitive content
  • Content concerning pornography, gambling, and drug abuse
  • Content that may disclose or infringe upon others ' commercial secrets, intellectual properties, including trade marks, copyrights, and patents, and personal privacy
Do not share your account and password with others. All operations performed using your account will be regarded as your own actions and all consequences arising therefrom will be borne by you. For details, see " User Agreement."

My Followers

Login and enjoy all the member benefits

Login

Block
Are you sure to block this user?
Users on your blacklist cannot comment on your post,cannot mention you, cannot send you private messages.
Reminder
Please bind your phone number to obtain invitation bonus.
Information Protection Guide
Thanks for using Huawei Enterprise Support Community! We will help you learn how we collect, use, store and share your personal information and the rights you have in accordance with Privacy Policy and User Agreement.