You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
t00852069 bcf364495b gpt2-13B 910B 7 months ago
..
run_gpt2.yaml 删除optimizer_shard开关 7 months ago
run_gpt2_13b.yaml gpt2 13b seq length default 2048 7 months ago
run_gpt2_13b_910b.yaml gpt2-13B 910B 7 months ago
run_gpt2_52b.yaml 删除optimizer_shard开关 7 months ago
run_gpt2_lora.yaml [Features]Add Parameters-Efficient-Tuning unified framework of LLMs. 7 months ago
run_gpt2_txtcls.yaml 删除optimizer_shard开关 7 months ago
run_gpt2_xl.yaml 删除optimizer_shard开关 7 months ago
run_gpt2_xl_lora.yaml [Features]Add Parameters-Efficient-Tuning unified framework of LLMs. 7 months ago

MindSpore Transformers套件的目标是构建一个大模型训练、推理、部署的全流程套件: 提供业内主流的Transformer类预训练模型, 涵盖丰富的并行特性。 期望帮助用户轻松的实现大模型训练。 文档:https://mindformers.readthedocs.io/zh-cn/latest/

Jupyter Notebook Python Markdown Shell