#5391 gpu训练的资源利用率曲线图重复显示gpuUtil、accCardUtil

Closed
created 3 weeks ago by wangj · 3 comments
wangj commented 3 weeks ago
启智混合智算集群的GPU训练任务,资源利用率曲线图重复显示了2组指标:gpuUtil、accCardUtil;gpuMemUsage、accCardMemUsage。
wangj added this to the V20240402 milestone 3 weeks ago
wangj added the
bug
label 3 weeks ago
liaowsh was assigned by wangj 3 weeks ago
ychao_1983 was assigned by wangj 3 weeks ago
wangj added the
grampus
label 3 weeks ago
wangj commented 3 weeks ago
Owner
需要兼容鹏城云脑1。该分中心安装的是旧版本章鱼,没有accCardUtil、accCardMemUsage指标。
wangj commented 2 weeks ago
Owner
仍然能复现。待虎鲸解决。 @liaowsh
wangj added the
test
label 2 weeks ago
wangj closed this issue 2 weeks ago
wangj commented 2 weeks ago
Owner
通过验证。 无论是鹏城云脑1 还是启智混合智算集群,智算GPU训练的资源利用率都展示accCardUtil、accCardMemUsage。不再展示gpuUtil、gpuMemUsage。
Sign in to join this conversation.
No Milestone
No Assignees
1 Participants
Notifications
Due Date

No due date set.

Dependencies

This issue currently doesn't have any dependencies.

Loading…
There is no content yet.