关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

History

a1085728420 efaec93863 add convit_small_ascend.yaml		1 year ago
..
README.md	add convit_small_ascend.yaml	1 year ago

convit.png	rm confg folder and update readme	1 year ago

convit_small_ascend.yaml	add convit_small_ascend.yaml	1 year ago

convit_tiny_ascend.yaml	migrate 'warmup_cosine_decay' to 'cosine_decay' in existing yaml configs	1 year ago

convit_tiny_gpu.yaml	migrate 'warmup_cosine_decay' to 'cosine_decay' in existing yaml configs	1 year ago

convit_tiny_plus_ascend.yaml	migrate 'warmup_cosine_decay' to 'cosine_decay' in existing yaml configs	1 year ago

README.md

ConViT
- Introduction
- Benchmark
- Examples
- - Train
  - Eval

ConViT

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

Introduction

ConViT combines e the strengths of convolutional architectures and Vision Transformers (ViTs). ConViT introduce gated positional self-attention (GPSA), a form of positional self-attention which can be equipped with a “soft” convolutional inductive bias. ConViT initialize the GPSA layers to mimic the locality of convolutional layers, then give each attention head the freedom to escape locality by adjusting a gating parameter regulating the attention paid to position versus content information. ConViT, outperforms the DeiT (Touvron et al., 2020) on ImageNet, while offering a much improved sample efficiency.

Benchmark

				Pynative	Pynative	Graph	Graph
	Model	Top-1 (%)	Top-5 (%)	train (s/epoch)	Infer (ms)	train(s/epoch)	Infer (ms)	Download	Config
GPU	convit_tiny
Ascend	convit_tiny	73.66	91.72			240
GPU	convit_tiny_plus
Ascend	convit_tiny_plus	77.00	93.60			247
GPU	convit_small
Ascend	convit_small	81.63	95.59			490
GPU	convit_small_plus
Ascend	convit_small_plus
GPU	convit_base
Ascend	convit_base
GPU	convit_base_plus
Ascend	convit_base_plus

Examples

Train

The yaml config files that yield competitive results on ImageNet for different models are listed in the config folder. To trigger training using preset yaml config.
```
export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
mpirun -n 8 python train.py -c config/convit/convit_tiny_gpu.yaml --data_dir /path/to/imagenet
```

Here is the example for finetuning a pretrained convit_tiny on CIFAR10 dataset using Adam optimizer.

python train.py --model=convit_tiny --pretrained --opt=momentum --lr=0.001 dataset=cifar10 --num_classes=10 --dataset_download

Detailed adjustable parameters and their default value can be seen in config.py.

Eval

To validate the model, you can use validate.py. Here is an example for convit_tiny to verify the accuracy of pretrained weights.
```
python validate.py --model=convit_tiny --dataset=imagenet --val_split=val --pretrained
```
To validate the model, you can use validate.py. Here is an example for convit_tiny to verify the accuracy of your training.
```
python validate.py --model=convit_tiny --dataset=imagenet --val_split=val --ckpt_path='./ckpt/convit.ckpt'
```