dengjian 0e0e1d1461 上传文件至 'src'		2 years ago
ascend310_infer	上传文件至 'ascend310_infer/src'	2 years ago

infer	上传文件至 'infer/sdk'	2 years ago

modelarts	上传文件至 'modelarts'	2 years ago

scripts	上传文件至 'scripts'	2 years ago

src	上传文件至 'src'	2 years ago

Dockerfile	上传文件至 ''	2 years ago

README.md	Add code	2 years ago

README_CN.md	Add code	2 years ago

eval.py	上传文件至 ''	2 years ago

export.py	上传文件至 ''	2 years ago

postprocess.py	Add code	2 years ago

preprocess.py	上传文件至 ''	2 years ago

requirements.txt	Add code	2 years ago

train.py	Add code	2 years ago

README.md

NASNet Description

Paper: Barret Zoph, Vijay Vasudevan, Jonathon Shlens, Quoc V. Le. Learning Transferable Architectures for Scalable Image Recognition. 2017.

Model architecture

The overall network architecture of NASNet is show below:

Link

Dataset

Dataset used: imagenet

Dataset size: ~125G, 1.2M colorful images in 1000 classes
- Train: 120G, 1.2M images
- Test: 5G, 50000 images
Data format: RGB images.
- Note: Data will be processed in src/dataset.py

Environment Requirements

Hardware(Ascend/GPU)
- Prepare hardware environment with Ascend or GPU processor.
Framework
- MindSpore
For more information, please check the resources below：
- MindSpore Tutorials
- MindSpore Python API

Script description

Script and sample code

.
└─nasnet
  ├─README.md
  ├─README_CN.md
  ├─scripts
    ├─run_standalone_train_for_ascend.sh   # launch standalone training with Ascend platform(1p)
    ├─run_distribute_train_for_ascend.sh   # launch distributed training with Ascend platform(8p)
    ├─run_standalone_train_for_gpu.sh      # launch standalone training with gpu platform(1p)
    ├─run_distribute_train_for_gpu.sh      # launch distributed training with gpu platform(8p)
    └─run_eval_for_ascend                  # launch evaluating with Ascend platform
    └─run_eval_for_gpu.sh                  # launch evaluating with gpu platform
  ├─src
    ├─config.py                            # parameter configuration
    ├─dataset.py                           # data preprocessing
    ├─loss.py                              # Customized CrossEntropy loss function
    ├─lr_generator.py                      # learning rate generator
├─nasnet_a_mobile.py                       # network definition
├─eval.py                                  # eval net
├─export.py                                # convert checkpoint
└─train.py                                 # train net

Script Parameters

Parameters for both training and evaluating can be set in config.py.

'random_seed': 1,                # fix random seed
'rank': 0,                       # local rank of distributed
'group_size': 1,                 # world size of distributed
'work_nums': 8,                  # number of workers to read the data
'epoch_size': 600,               # total epoch numbers
'keep_checkpoint_max': 30,       # max numbers to keep checkpoints
'ckpt_path': './',               # save checkpoint path
'is_save_on_master': 0           # save checkpoint on rank0, distributed parameters
'train_batch_size': 32,          # input batch size for trainning
'val_batch_size': 32,            # input batch size for validating
'image_size' : 224,              # the size of one image
'num_classes': 1000,             # dataset class numbers
'label_smooth_factor': 0.1,      # label smoothing factor
'aux_factor': 0.4,               # loss factor of aux logit
'lr_init': 0.04*8,               # initiate learning rate
'lr_decay_rate': 0.97,           # decay rate of learning rate
'num_epoch_per_decay': 2.4,      # decay epoch number
'weight_decay': 0.00004,         # weight decay
'momentum': 0.9,                 # momentum
'opt_eps': 1.0,                  # epsilon
'rmsprop_decay': 0.9,            # rmsprop decay
'loss_scale': 1,                 # loss scale
'cutout': True,                  # whether to cutout the input data for training
'coutout_leng': 56,              # the length of cutout when cutout is True

'random_seed': 1,                # fix random seed
'rank': 0,                       # local rank of distributed
'group_size': 1,                 # world size of distributed
'work_nums': 8,                  # number of workers to read the data
'epoch_size': 600,               # total epoch numbers
'keep_checkpoint_max': 100,      # max numbers to keep checkpoints
'ckpt_path': './checkpoint/',    # save checkpoint path
'is_save_on_master': 0           # save checkpoint on rank0, distributed parameters
'train_batch_size': 32,          # input batch size for trainning
'val_batch_size': 32,            # input batch size for validating
'image_size' : 224,              # the size of one image
'num_classes': 1000,             # dataset class numbers
'label_smooth_factor': 0.1,      # label smoothing factor
'aux_factor': 0.4,               # loss factor of aux logit
'lr_init': 0.04*8,               # initiate learning rate
'lr_decay_rate': 0.97,           # decay rate of learning rate
'num_epoch_per_decay': 2.4,      # decay epoch number
'weight_decay': 0.00004,         # weight decay
'momentum': 0.9,                 # momentum
'opt_eps': 1.0,                  # epsilon
'rmsprop_decay': 0.9,            # rmsprop decay
'loss_scale': 1,                 # loss scale
'cutout': False,                 # whether to cutout the input data for training
'coutout_leng': 56,              # the length of cutout when cutout is True

Training Process

Usage

Ascend:
    # distribute training example(8p)
    bash run_distribute_train_for_ascend.sh DATA_DIR
    # standalone training
    bash run_standalone_train_for_ascend.sh DEVICE_ID DATA_DIR
GPU:
    # distribute training example(8p)
    bash run_distribute_train_for_gpu.sh DATA_DIR
    # standalone training
    bash run_standalone_train_for_gpu.sh DEVICE_ID DATA_DIR

Launch

# distributed training example(8p) for Ascend
bash scripts/run_distribute_train_for_ascend.sh /dataset
# standalone training example for for Ascend
bash scripts/run_standalone_train_for_ascend.sh 0 /dataset
# distributed training example(8p) for GPU
bash scripts/run_distribute_train_for_gpu.sh /dataset/train
# standalone training example for GPU
bash scripts/run_standalone_train_for_gpu.sh 0 /dataset/train

You can find checkpoint file together with result in log.

Evaluation Process

Usage

# Evaluation
bash run_eval_for_ascend.sh DEVICE_ID DATA_DIR PATH_CHECKPOINT
bash run_eval_for_gpu.sh DEVICE_ID DATA_DIR PATH_CHECKPOINT

Launch

# Evaluation with checkpoint
bash scripts/run_eval_for_ascend.sh 0 /dataset ./checkpoint/nasnet-a-mobile-rank0-248_10009.ckpt
bash scripts/run_eval_for_gpu.sh 0 /dataset/val ./checkpoint/nasnet-a-mobile-rank0-248_10009.ckpt

Result

Evaluation result will be stored in the scripts path. Under this, you can find result like the followings in log.

acc=74.0%(TOP1,Ascend)
acc=73.5%(TOP1,GPU)

Model description

Performance

Training Performance

Parameters	Ascend 910	GPU
Model Version	NASNet	NASNet
Resource	Ascend 910	NV SMX2 V100-32G
uploaded Date	09/08/2021 (month/day/year)	09/24/2020
MindSpore Version	1.2.0	1.0.0
Dataset	ImageNet	ImageNet
Training Parameters	src/config.py	src/config.py
Optimizer	RMSProp	RMSProp
Loss Function	SoftmaxCrossEntropyWithLogits	SoftmaxCrossEntropyWithLogits
Loss	1.9598	1.8965
Total time	564 h 8ps	144 h 8ps
Checkpoint for Fine tuning	89 M(.ckpt file)	89 M(.ckpt file)

Inference Performance

Parameters	Ascend 910	GPU
Model Version	NASNet	NASNet
Resource	Ascend 910	NV SMX2 V100-32G
uploaded Date	09/08/2021 (month/day/year)	09/24/2020
MindSpore Version	1.2.0	1.0.0
Dataset	ImageNet	ImageNet
batch_size	32	32
outputs	probability	probability
Accuracy	acc=74.0%(TOP1)	acc=73.5%(TOP1)

ModelZoo Homepage

Please check the official homepage.

No Description

mindspore

Python C++ Shell Markdown Text other

How to access data resources in code

README.md

Contents

NASNet Description

Model architecture

Dataset

Environment Requirements

Script description

Script and sample code

Script Parameters

Training Process

Usage

Launch

Evaluation Process

Usage

Launch

Result

Model description

Performance

Training Performance

Inference Performance

ModelZoo Homepage

Contributors (2)
All

README.md

Contents

Usage

Launch

Usage

Launch

Result

Training Performance

Inference Performance

Contributors (2) All

Contributors (2)
All