#4178 【大模型微调】同时部署的任务数量较多时,较大几率failed

Open
created 11 months ago by wangj · 2 comments
wangj commented 11 months ago
同时部署的任务数量较多时,较大几率FAILED。 包括两类:构建失败,部署失败。
wangj added this to the V20230517 milestone 11 months ago
wangj added the
need review
label 11 months ago
wangj added the
bug
label 11 months ago
chenzh was assigned by wangj 11 months ago
chenzh commented 11 months ago
Owner
构建失败:构建当前无数量限制,多个同时构建有的应用会超时。云脑2人员排查说是云脑2服务并发问题,暂时不清楚并发上限,后续需要与云脑2跟进并增加限制同时创建AI应用的数量。 部署失败:部署数量已经限制上限为5个任务,还会存在资源不够的报错。经云脑2人员排查为自定义资源规格配置问题,已更改成默认资源规格。
chenzh added the
test
label 11 months ago
wangj commented 11 months ago
Owner
受 #4191 影响,测试blocked.
wangj modified the milestone from V20230517 to V20230531 11 months ago
wangj modified the milestone from V20230531 to V20230628 10 months ago
wangj removed the
test
label 10 months ago
wangj removed this from the V20230628 milestone 10 months ago
Sign in to join this conversation.
No Milestone
No Assignees
2 Participants
Notifications
Due Date

No due date set.

Dependencies

This issue currently doesn't have any dependencies.

Loading…
There is no content yet.