关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

History

jasonhuang 989febc9fd 更新 'configs/convit/convit_tiny_ascend.yaml'		1 year ago
..
README.md	update model readme urls	1 year ago

README_CN.md	update model readme urls	1 year ago

convit.png	rm confg folder and update readme	1 year ago

convit_base_ascend.yaml	add convit_small_plus/convit_base/convit_base_plus config	1 year ago

convit_base_plus_ascend.yaml	add convit_small_plus/convit_base/convit_base_plus config	1 year ago

convit_small_ascend.yaml	add convit_small_ascend.yaml	1 year ago

convit_small_plus_ascend.yaml	add convit_small_plus/convit_base/convit_base_plus config	1 year ago

convit_tiny_ascend.yaml	更新 'configs/convit/convit_tiny_ascend.yaml'	1 year ago

convit_tiny_gpu.yaml	migrate 'warmup_cosine_decay' to 'cosine_decay' in existing yaml configs	1 year ago

convit_tiny_plus_ascend.yaml	migrate 'warmup_cosine_decay' to 'cosine_decay' in existing yaml configs	1 year ago

README.md

ConViT

ConViT

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

Introduction

ConViT combines the strengths of convolutional architectures and Vision Transformers (ViTs). ConViT introduce gated positional self-attention (GPSA), a form of positional self-attention which can be equipped with a “soft” convolutional inductive bias. ConViT initialize the GPSA layers to mimic the locality of convolutional layers, then give each attention head the freedom to escape locality by adjusting a gating parameter regulating the attention paid to position versus content information. ConViT, outperforms the DeiT (Touvron et al., 2020) on ImageNet, while offering a much improved sample efficiency.

Results

Model	Context	Top-1 (%)	Top-5 (%)	Params (M)	Train T.	Infer T.	Download	Config	Log
convit_tiny	D910x8-G	73.66	91.72	6	243s/epoch	50.7ms/step	model	cfg	log
convit_tiny_plus	D910x8-G	77.00	93.60	10	246s/epoch	40.9ms/step	model	cfg	log
convit_small	D910x8-G	81.63	95.59	27	491s/epoch	36.4ms/step	model	cfg	log
convit_small_plus	D910x8-G	81.8	95.42	48	557s/epoch	32.7ms/step	model	cfg	log
convit_base	D910x8-G	82.10	95.52	86	880s/epoch	32.8ms/step	model	cfg	log
convit_base_plus	D910x8-G	81.96	95.04	152	1031s/epoch	36.6ms/step	model	cfg	log

Notes

All models are trained on ImageNet-1K training set and the top-1 accuracy is reported on the validatoin set.
Context: GPU_TYPE x pieces - G/F, G - graph mode, F - pynative mode with ms function.

Quick Start

Preparation

Installation

Please refer to the installation instruction in MindCV.

Dataset Preparation

Please download the ImageNet-1K dataset for model training and validation.

Training

Hyper-parameters. The hyper-parameter configurations for producing the reported results are stored in the yaml files in mindcv/configs/convit folder. For example, to train with one of these configurations, you can run:
```
# train convit_tiny on 8 Ascends
python train.py --config configs/convit/convit_tiny_ascend.yaml --data_dir /path/to/imagenet
```
Note that the number of GPUs/Ascends and batch size will influence the training results. To reproduce the training result at most, it is recommended to use the same number of GPUs/Ascends with the same batch size.

Detailed adjustable parameters and their default value can be seen in config.py.

Validation

To validate the model, you can use validate.py. Here is an example for convit_tiny to verify the accuracy of your training.

python validate.py --config configs/convit/convit_tiny_ascend.yaml --data_dir /path/to/imagenet --ckpt_path /path/to/convit_tiny.ckpt

Deployment (optional)

Please refer to the deployment tutorial in MindCV.

No Description

Jupyter Notebook Python Markdown Text

285365963@qq.com 100194830+JunyuLiu1@users.noreply.github.com

74176172+GeniusPatrick@users.noreply.github.com

zp5070@gmail.com 83412649+spencerr221@users.noreply.github.com

jasondhuang@tencent.com

97332102+XuanmaiXue@users.noreply.github.com 119582555+sy-liang123@users.noreply.github.com 48508716+Baogerock@users.noreply.github.com 2441413514@qq.com

74176172+geniuspatrick@users.noreply.github.com huxiuyu1943@sina.com

xingxjtu@gmail.com

110210055+xiuyu0000@users.noreply.github.com

canna1102.yy@gmail.com

How to access data resources in code