Got it

RuntimeError: ACL stream synchronize failed

Latest reply: Dec 22, 2021 14:23:46 199 1 0 0 0

Hi,


I have tried to train my model on Ascend AI Processor, but I've got this error:


[ERROR] RUNTIME(6845)mem async copy error, retCode=0x87, [pcie dma copy error].[ERROR] RUNTIME(6845)mem async copy failed device_id=1, stream_id=534, task_id=831[ERROR] RUNTIME(6845)copy_type=1, memcpy_type=0, copy_data_type=0, src_addr=dbc0, dst_addr=2c00000001, length=4
Traceback (most recent call last):...
RuntimeError: ACL stream synchronize failed.THPModule_npu_shutdown success.

I was using ascend-pytorch-x86:21.0.1 docker image. After this error, npu-smi is not showing NPU chip device 1 anymore.


Why did I got this error, and how can I fix this? I look forward to your help in this issue.

Hello,
We're working on your problem. Please be patient.
View more
  • x
  • convention:

Comment

You need to log in to comment to the post Login | Register

Notice: To protect the legitimate rights and interests of you, the community, and third parties, do not release content that may bring legal risks to all parties, including but are not limited to the following:
  • Politically sensitive content
  • Content concerning pornography, gambling, and drug abuse
  • Content that may disclose or infringe upon others ' commercial secrets, intellectual properties, including trade marks, copyrights, and patents, and personal privacy
Do not share your account and password with others. All operations performed using your account will be regarded as your own actions and all consequences arising therefrom will be borne by you. For details, see " User Agreement."

My Followers

Login and enjoy all the member benefits

Login

Block
Are you sure to block this user?
Users on your blacklist cannot comment on your post,cannot mention you, cannot send you private messages.
Reminder
Please bind your phone number to obtain invitation bonus.