You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
Hsshuai f61a2bb2fe fix qwenvl issues, including image padding, infer batch size, weights shape 1 month ago
..
baichuan update doc for atlas device 4 months ago
baichuan2 tokenizer base class modify 3 months ago
codegeex tokenizer base class modify 3 months ago
glm32k !2253 切换仓库中的BaseConfig基类为PretrainedConfig基类 2 months ago
internlm tokenizer base class modify 3 months ago
qwen Optimize lm_head matmul performance 1 month ago
qwenvl fix qwenvl issues, including image padding, infer batch size, weights shape 1 month ago
rewardmodel 补齐配置文件中src_strategy_path_or_dir字段 4 months ago
skywork update doc for atlas device 4 months ago
telechat 切换仓库中的BaseConfig基类为PretrainedConfig基类 3 months ago
visualglm !2253 切换仓库中的BaseConfig基类为PretrainedConfig基类 2 months ago
wizardcoder !2253 切换仓库中的BaseConfig基类为PretrainedConfig基类 2 months ago
ziya update doc for atlas device 4 months ago
README.md update research/README.md. 8 months ago
run_multinode.sh internlm启动脚本,预处理脚本,readme 9 months ago
run_singlenode.sh internlm启动脚本,预处理脚本,readme 9 months ago

MindSpore Transformers套件的目标是构建一个大模型训练、微调、评估、推理、部署的全流程开发套件: 提供业内主流的Transformer类预训练模型和SOTA下游任务应用,涵盖丰富的并行特性。期望帮助用户轻松的实现大模型训练和创新研发。

Jupyter Notebook Python Markdown Shell