You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
i-robot d1ad5dd7e8
!2764 【qwen】bugfix: mslite下Qwen-14B运行batch_size=10时报告内存不足
1 week ago
.gitee Initial commit 2 years ago
.jenkins update ms ci dependence 4 months ago
chat_web chatweb修复网页prompt文本框不能换行的问题,文档中增加对请求体的描述 3 months ago
configs bugfix 2 weeks ago
docs release note 1.0.2 1 week ago
mindformers 修复glm2_6b_ptuning2在增量推理时kvcache和attention mask的序列维度没有扩充prefix的问题 2 weeks ago
research [qwen] lite.ini: don't enable ge.externalWeight by default 2 weeks ago
scripts !2021 【bugfix】修改动态组网脚本中环境变量设置 4 months ago
tests tokenizer用例bug修复 3 months ago
.gitignore add nezha, checkpoint transformation 1 year ago
.readthedocs.yaml fix read the docs file. 6 months ago
LICENSE Initial commit 2 years ago
OWNERS update OWNERS. 3 months ago
README.md release note 1.0.2 1 week ago
build.sh 删除setuptool版本限制,修改构建时使用的python版本 7 months ago
requirements.txt update requirement opencv-python-headless 3 months ago
run_infer_main.py fixed 2148319 from https://gitee.com/renyujin/mindformers/pulls/2374 1 month ago
run_mindformer.py 修复策略文件保存路径问题 3 months ago
setup.py update setup.py. 3 weeks ago

MindSpore Transformers套件的目标是构建一个大模型训练、微调、评估、推理、部署的全流程开发套件: 提供业内主流的Transformer类预训练模型和SOTA下游任务应用,涵盖丰富的并行特性。期望帮助用户轻松的实现大模型训练和创新研发。

Jupyter Notebook Python Markdown Shell