张佳预 6b4106a517 Merge pull request 'add zhisuan file' (#1 ) from jiayu_neu/EfficientNetV1_MindSpore:master into master Reviewed-on: https://git.openi.org.cn/SportTeam/EfficientNetV1_MindSpore/pulls/1		1 year ago
.idea	v1	1 year ago

Performance	correct name ckpt	1 year ago

mode_ckpt	test	1 year ago

result	new world	1 year ago

src	new world	1 year ago

400_799.txt	size64	1 year ago

README.md	create	1 year ago

efficientnet_b0_caltech256_config.yaml	new world	1 year ago

efficientnet_b1_imagenet_config.yaml	create	1 year ago

eval.py	v5	1 year ago

export.py	v5	1 year ago

init.py	test	1 year ago

mindspore_hub_conf.py	v5	1 year ago

predict.py	train/val=8:2	1 year ago

preject_private_dataset.py	new world	1 year ago

requirements.txt	add zhisuan file	1 year ago

shiyan2.py	balabala	1 year ago

train.py	ll	1 year ago

zhisuan_startup.py	add zhisuan file	1 year ago

README.md

EfficientNet-B0 Description

Paper: Mingxing Tan, Quoc V. Le. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. 2019.

Model architecture

The overall network architecture of EfficientNet-B0 is show below:

Link

Dataset

Dataset used:

ImageNet

Dataset size: ~125G, 133W colorful images in 1000 classes
- Train: 120G, 128W images
- Test: 5G, 5W images
Data format: RGB images

CIFAR-10

Dataset size: ~180MB, 6W colorful images in 10 classes
- Train: 150MB, 5W images
- Test: 30MB, 1W images
Data format: RGB images（Binary Version）

Note: Data will be processed in src/dataset.py

Environment Requirements

Hardware CPU/GPU
- Prepare hardware environment with CPU/GPU processor.
Framework
- MindSpore
For more information, please check the resources below：
- MindSpore Tutorials
- MindSpore Python API

Quick Start

After installing MindSpore via the official website, you can start training and evaluation as follows:

Training on GPU:

bash run_standalone_train_gpu.sh [DATASET_TYPE] [DATASET_PATH] [MODEL_NAME] [PRETRAINED_CKPT_PATH](optional)

Evaluation on GPU:

cd scripts
bash run_eval_cpu.sh [DATASET_TYPE] [DATASET_PATH] [MODEL_NAME] [CHECKPOINT_PATH]

Script description

Script and sample code

.
└─efficientnet
  ├─README.md
  ├─scripts
    ├─run_train_cpu.sh                    # launch training with cpu platform
    ├─run_standalone_train_gpu.sh         # launch standalone training with gpu platform
    ├─run_distribute_train_gpu.sh         # launch distributed training with gpu platform
    ├─run_eval_cpu.sh                     # launch evaluating with cpu platform
    └─run_eval_gpu.sh                     # launch evaluating with gpu platform
  ├─src
    ├─callbacks.py                        # callback for model evaluation during training
    ├─config.py                           # parameter configuration
    ├─dataset.py                          # data preprocessing
    ├─efficientnet.py                     # network definition
    ├─loss.py                             # Customized loss function
    ├─transform_utils.py                  # random augment utils
    ├─transform.py                        # random augment class
  ├─efficientnet_b0_cifar10_config.yaml   # config for EfficientNet-B0 CIFAR10 training
  ├─efficientnet_b0_imagenet_config.yaml  # config for EfficientNet-B0 ImageNet training
  ├─efficientnet_b1_imagenet_config.yaml  # config for EfficientNet-B1 ImageNet training
  ├─eval.py                               # eval net
  ├─train.py                              # train net
  ├─export.py                             # export net
  ├─mindspore_hub_conf.py                 # midnspore hub configuration file
  └─requirements.txt                      # pip requirements

Script Parameters

Parameters for both training and evaluating can be set in one of configuration files:

efficientnet_b0_cifar10_config.yaml
efficientnet_b0_imagenet_config.yaml
efficientnet_b1_imagenet_config.yaml

ImageNet Config for GPU:

'random_seed': 1,                # fix random seed
'model': 'efficientnet_b0',      # model name
'drop': 0.2,                     # dropout rate
'drop_connect': 0.2,             # drop connect rate
'opt_eps': 0.001,                # optimizer epsilon
'lr': 0.064,                     # learning rate LR
'batch_size': 128,               # batch size
'decay_epochs': 2.4,             # epoch interval to decay LR
'warmup_epochs': 5,              # epochs to warmup LR
'decay_rate': 0.97,              # LR decay rate
'weight_decay': 1e-5,            # weight decay
'epochs': 600,                   # number of epochs to train
'workers': 8,                    # number of data processing processes
'amp_level': 'O0',               # amp level
'opt': 'rmsprop',                # optimizer
'num_classes': 1000,             # number of classes
'gp': 'avg',                     # type of global pool, "avg", "max", "avgmax", "avgmaxc"
'momentum': 0.9,                 # optimizer momentum
'warmup_lr_init': 0.0001,        # init warmup LR
'smoothing': 0.1,                # label smoothing factor
'bn_tf': False,                  # use Tensorflow BatchNorm defaults
'keep_checkpoint_max': 10,       # max number ckpts to keep
'loss_scale': 1024,              # loss scale
'resume_start_epoch': 0,         # resume start epoch

CIFAR-10 Config for CPU/GPU

'random_seed': 1,                # fix random seed
'model': 'efficientnet_b0',      # model name
'drop': 0.2,                     # dropout rate
'drop_connect': 0.2,             # drop connect rate
'opt_eps': 0.0001,               # optimizer epsilon
'lr': 0.0002,                    # learning rate LR
'batch_size': 32,                # batch size
'decay_epochs': 2.4,             # epoch interval to decay LR
'warmup_epochs': 5,              # epochs to warmup LR
'decay_rate': 0.97,              # LR decay rate
'weight_decay': 1e-5,            # weight decay
'epochs': 150,                   # number of epochs to train
'workers': 8,                    # number of data processing processes
'amp_level': 'O0',               # amp level
'opt': 'rmsprop',                # optimizer
'num_classes': 10,               # number of classes
'gp': 'avg',                     # type of global pool, "avg", "max", "avgmax", "avgmaxc"
'momentum': 0.9,                 # optimizer momentum
'warmup_lr_init': 0.0001,        # init warmup LR
'smoothing': 0.1,                # label smoothing factor
'bn_tf': False,                  # use Tensorflow BatchNorm defaults
'keep_checkpoint_max': 10,       # max number ckpts to keep
'loss_scale': 1024,              # loss scale
'resume_start_epoch': 0,         # resume start epoch

Training Process

Usage

GPU

    # distribute training
    bash run_distribute_train_gpu.sh [DEVICE_NUM] [VISIABLE_DEVICES(0,1,2,3,4,5,6,7)] [DATASET_TYPE] [DATASET_PATH] [MODEL_NAME] [PRETRAINED_CKPT_PATH](optional)
    # standalone training
    bash run_standalone_train_gpu.sh [DATASET_TYPE] [DATASET_PATH] [MODEL_NAME] [PRETRAINED_CKPT_PATH](optional)

CPU

    bash run_train_cpu.sh [DATASET_TYPE] [DATASET_PATH] [MODEL_NAME] [PRETRAINED_CKPT_PATH](optional)

Launch Example

# distributed training example(8p) for GPU
cd scripts
bash run_distribute_train_gpu.sh 8 0,1,2,3,4,5,6,7 ImageNet ./dataset

# standalone training example for GPU
cd scripts
bash run_standalone_train_gpu.sh ImageNet ./dataset

# training example for CPU
cd scripts
bash run_train_cpu.sh ImageNet ./dataset

You can find checkpoint file together with result in log.

Evaluation Process

Usage

CPU

bash run_eval_cpu.sh [DATASET_TYPE] [DATASET_PATH] [MODEL_NAME] [CHECKPOINT_PATH]

GPU

bash run_eval_gpu.sh [DATASET_TYPE] [DATASET_PATH] [MODEL_NAME] [CHECKPOINT_PATH]

Launch Example

# Evaluation with checkpoint for GPU
cd scripts
bash run_eval_gpu.sh ImageNet ./dataset ./checkpoint/efficientnet_b0-600_1251.ckpt

# Evaluation with checkpoint for CPU
cd scripts
bash run_eval_cpu.sh ImageNet ./dataset ./checkpoint/efficientnet_b0-600_1251.ckpt

Result

Evaluation result will be stored in the scripts path. Under this, you can find result like the following in log.

acc=76.96%(TOP1)

Model description

Performance in ImageNet

Training Performance

Parameters	efficientnet_b0	efficientnet_b1
Resource	NV SMX2 V100-32G	GPU(Tesla V100-PCIE 32G)；CPU：2.70GHz 52cores；RAM：1.5T
uploaded Date	10/26/2020	10/22/2021
MindSpore Version	1.0.0	1.6.0.20211119
Dataset	ImageNet	ImageNet
Training Parameters	src/config.py	src/config.py
Optimizer	rmsprop	rmsprop
Loss Function	LabelSmoothingCrossEntropy	LabelSmoothingCrossEntropy
Loss	1.8886	1.8227
Accuracy	76.96%(TOP1)	78.44%(TOP1)
Total time	132 h 8ps	164 h 28 m (8p: 252ms/step, 1p：239ms/step)
Checkpoint for Fine tuning	64 M(.ckpt file)	100 M(.ckpt file)

Inference Performance

Parameters	efficientnet_b0	efficientnet_b1
Resource	NV SMX2 V100-32G	RTX 3090 24G
uploaded Date	10/26/2020	10/22/2021
MindSpore Version	1.0.0	1.6.0.20211119
Dataset	ImageNet	ImageNet
batch_size	128	128
outputs	probability	probability
Accuracy	acc=76.96%(TOP1)	acc=78.44%(TOP1)

Performance in CIFAR-10

Training Performance

Parameters	efficientnet_b0
Resource	NV GTX 1080Ti-12G
uploaded Date	06/28/2021
MindSpore Version	1.3.0
Dataset	CIFAR-10
Training Parameters	src/config.py
Optimizer	rmsprop
Loss Function	LabelSmoothingCrossEntropy
Loss	1.2773
Accuracy	97.75%(TOP1)
Total time	2 h 4ps
Checkpoint for Fine tuning	47 M(.ckpt file)

Inference Performance

Parameters	efficientnet_b0
Resource	NV GTX 1080Ti-12G
uploaded Date	06/28/2021
MindSpore Version	1.3.0
Dataset	CIFAR-10
batch_size	128
outputs	probability
Accuracy	acc=93.12%(TOP1)

ModelZoo Homepage

Please check the official homepage.

基于mindspore实现的EfficientNetV1网络

Text Python

How to access data resources in code

README.md

Contents

EfficientNet-B0 Description

Model architecture

Dataset

Environment Requirements

Quick Start

Script description

Script and sample code

Script Parameters

Training Process

Usage

Launch Example

Evaluation Process

Usage

Launch Example

Result

Model description

Performance in ImageNet

Training Performance

Inference Performance

Performance in CIFAR-10

Training Performance

Inference Performance

ModelZoo Homepage

Contributors (2)
All

README.md

Contents

Usage

Launch Example

Usage

Launch Example

Result

Training Performance

Inference Performance

Training Performance

Inference Performance

Contributors (2) All

Contributors (2)
All