Deleting a branch is permanent. It CANNOT be undone. Continue?
No due date set.
This issue currently doesn't have any dependencies.
Deleting a branch is permanent. It CANNOT be undone. Continue?
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》
场景描述
测试中发现,训练任务结果回传时,存在虎鲸返回NOT FOUND的情况,目前社区直接认为没有结果文件,导致不能进行重试。
实际上由于对象存储上传可能是异步的,所以立即访问可能存在访问不到对象的情况。
预期效果
训练任务结果回传时,虎鲸返回NOT FOUND, 社区不再直接认为没有结果文件。 前端提示“同步失败或没有结果文件”, 允许用户重试。
后端做重试。
测试时除了训练任务,也要测调试任务。
比如验证点:无文件回传时,结果下载页签是空的,而不是显示“文件同步失败”
测试不太好构造,我实际测试调试任务没有结果文件回传状态接口也可以返回成功。 目前如果遇到404的情况,会触发重试(重试后在下次检查回传状态时再更新状态和步骤(周期一分钟)), 最多触发2次重试, 如果还是404,就认为结果页是空的
测试场景:
1、调试/训练任务有结果文件,回传成功,结果下载页签显示文件;
2、调试/训练任务无结果文件,结果下载页签显示空;
3、调试/训练任务状态waiting时停止任务,结果下载页签显示空。
其中,场景2和3触发了重试机制;
场景1触发重试,不太好构造,暂时未测到。404 NOT FOUND现象之前在跑DCU调试(若在容器内调用c2net库upload_output方法,调试任务停止后也可以回传结果)时经常遇到,待虎鲸侧修复1132后再试。
已经上线。关闭。后续再观察。