You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
Lin 14d3f1b376
!1555 【FIX】模型构建部分适配微调统一结构,统一微调模型传参适配所有LLM模型,删除冗余代码
6 months ago
..
causal_language_modeling !1521 添加规则校验 6 months ago
contrastive_language_image_pretrain add Trainer.export 7 months ago
general_task_trainer add Trainer.export 7 months ago
image_classification add Trainer.export 7 months ago
image_to_text_generation fix multi_image infer with pipeline in blip2 doc and improve prompt and template 7 months ago
image_to_text_retrieval 添加规则校验 6 months ago
masked_image_modeling trainer API 整改(C-P) 7 months ago
masked_language_modeling trainer API 整改(C-P) 7 months ago
question_answering update API description of some trainers 7 months ago
text_classfication update API description of some trainers 7 months ago
token_classification update API description of some trainers 7 months ago
translation update API description of some trainers 7 months ago
__init__.py add blip2 2-stage inference support 8 months ago
base_trainer.py !1555 【FIX】模型构建部分适配微调统一结构,统一微调模型传参适配所有LLM模型,删除冗余代码 6 months ago
build_trainer.py BLIP2一阶段上仓 9 months ago
config_args.py 删除per_epoch_size改为sink_size 11 months ago
optimizer_grouped_parameters.py add all parallel methods for GPT-2. 1 year ago
trainer.py [Fix]Build model adapts unify llm pet-tuning framework. 6 months ago
training_args.py 高阶接口开发样例中单卡样例代码修复 7 months ago
utils.py 兼容历史版本的mindpet权重 6 months ago

MindSpore Transformers套件的目标是构建一个大模型训练、推理、部署的全流程套件: 提供业内主流的Transformer类预训练模型, 涵盖丰富的并行特性。 期望帮助用户轻松的实现大模型训练。 文档:https://mindformers.readthedocs.io/zh-cn/latest/

Jupyter Notebook Python Markdown Shell