#5121 【示例代码】新大跑智算NPU训练failed

Closed
created 3 months ago by wangj · 5 comments
wangj commented 3 months ago
新大(章鱼管理)跑的智算NPU训练任务failed。 日志报错:python: can't open file '/home/ma-user/davinci/train/davincirun.py': [Errno 2] No such file or directory” 任务名 wjtes202401162008825
wangj added this to the V20240116 milestone 3 months ago
wangj added the
bug
label 3 months ago
liuzx was assigned by wangj 3 months ago
wangj commented 3 months ago
Owner
镜像 mindtorch0.2_mindspore2.2.1_cann7.0rc1_train 问题。 正式环境用这个镜像跑智算NPU训练也会failed,一样的报错信息。
wangj added the
wait
label 3 months ago
wangj commented 3 months ago
Owner
重新跑了个任务,报错:FileNotFoundError: [Errno 2] No such file or directory: '/user/config/jobstart_hccl.json' 任务名 wjtes202401171163044
lewis was assigned by wangj 3 months ago
wangj commented 3 months ago
Owner
@lewis 新疆大学的NPU(非modelArts),跑训练failed。报错 : sh: line 0: cd: openi_cloudbrain_example: No such file or directory python: can't open file '/tmp/code/npu_mnist_example/train.py': [Errno 2] No such file or directory 任务名 wjtes2024011918t312073738
wangj commented 3 months ago
Owner
@liuzx 新大跑智算NPU训练示例代码failed,报错:ModuleNotFoundError: No module named 'easydict' 任务名 wjtes202401221612356
wangj commented 3 months ago
Owner
> @liuzx 新大跑智算NPU训练示例代码failed,报错:ModuleNotFoundError: No module named 'easydict' > 任务名 wjtes2024012216t324264750 在示例代码中,需要在import config之前添加代码行 :os.system("pip install easydict")
wangj closed this issue 3 months ago
wangj added the
test
label 3 months ago
Sign in to join this conversation.
No Milestone
No Assignees
1 Participants
Notifications
Due Date

No due date set.

Dependencies

This issue currently doesn't have any dependencies.

Loading…
There is no content yet.