关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

History

LGOODSKY b1dd373ee4 更新 'configs/bit/bit_resnet50_ascend.yaml'		1 year ago
..
BiT.png	重命名"configs/BigTransfer"为"configs/bit"	1 year ago

BiT50_ascend.yaml	重命名"configs/BigTransfer"为"configs/bit"	1 year ago

README.md	重命名"configs/BigTransfer"为"configs/bit"	1 year ago

README_CN.md	重命名"configs/BigTransfer"为"configs/bit"	1 year ago

bit_resnet50_ascend.yaml	更新 'configs/bit/bit_resnet50_ascend.yaml'	1 year ago

README.md

BigTransfer

BigTransfer

Big Transfer (BiT): General Visual Representation Learning

Introduction

Transfer of pre-trained representations improves sample efficiency and simplifies hyperparameter tuning when training deep neural
networks for vision. We revisit the paradigm of pre-training on large supervised datasets and fine-tuning the model on a target task. We scale
up pre-training, and propose a simple recipe that we call Big Transfer
(BiT). By combining a few carefully selected components, and transferring using a simple heuristic, we achieve strong performance on over
20 datasets. BiT performs well across a surprisingly wide range of data
regimes — from 1 example per class to 1 M total examples. BiT achieves
87.5% top-1 accuracy on ILSVRC-2012, 99.4% on CIFAR-10, and 76.3%
on the 19 task Visual Task Adaptation Benchmark (VTAB). On small
datasets, BiT attains 76.8% on ILSVRC-2012 with 10 examples per class,
and 97.0% on CIFAR-10 with 10 examples per class. We conduct detailed
analysis of the main components that lead to high transfer performance.

Results

Model	Context	Top-1 (%)	Top-5 (%)	Params(M)	Train T.	Infer T.	Download	Config	Log
BiT50-S	D910x8-G	76.81	93.17	25	652s/epoch	189.8ms/step	model	cfg	log

Notes

All models are trained on ImageNet-1K training set and the top-1 accuracy is reported on the validatoin set.
Context: D910 x 8 - G, D910 - Ascend 910, x8 - 8 devices, G - graph mode.

Quick Start

Preparation

Installation

Please refer to the installation instruction in MindCV.

Dataset Preparation

Please download the ImageNet-1K dataset for model training and validation.

Training

Hyper-parameters. The hyper-parameter configurations for producing the reported results are stored in the yaml files in mindcv/configs/BigTransfer folder. For example, to train with one of these configurations, you can run:
```
# train BiT on 8 NPUs
mpirun -n 8 python train.py -c configs/BigTransfer/BiT50_ascend.yaml --data_dir /path/to/imagenet
```
Note that the number of GPUs/Ascends and batch size will influence the training results. To reproduce the training result at most, it is recommended to use the same number of GPUs/Ascends with the same batch size.

Detailed adjustable parameters and their default value can be seen in config.py

Validation

To validate the trained model, you can use validate.py. Here is an example for BiT-50 to verify the accuracy of pretrained weights.
```
python validate.py -c configs/BigTransfer/BiT50_ascend.yaml --data_dir /path/to/imagenet --ckpt_path /path/to/ckpt
```