!19341 ghostnet

Merge pull request !19341 from despicablemme/ghost
2021-07-15 07:52:49 +00:00 · 2021-07-15 07:52:49 +00:00 · 815760cd35
parent 5ee72fff99 ab3796931f
commit 815760cd35
14 changed files with 814 additions and 336 deletions
--- a/model_zoo/research/cv/ghostnet/README_CN.md
+++ b/model_zoo/research/cv/ghostnet/README_CN.md
@ -0,0 +1,256 @@
+# 目录
+
+<!-- TOC -->
+
+- [目录](#目录)
+    - [概述](#概述)
+    - [论文](#论文)
+- [模型架构](#模型架构)
+- [数据集](#数据集)
+- [环境要求](#环境要求)
+- [快速入门](#快速入门)
+- [脚本说明](#脚本说明)
+    - [脚本结构与说明](#脚本结构与说明)
+- [脚本参数](#脚本参数)
+- [训练过程](#训练过程)
+    - [用法](#用法)
+        - [Ascend处理器环境运行](#Ascend处理器环境运行)
+    - [结果](#结果)
+- [评估过程](#评估过程)
+    - [用法](#用法-1)
+        - [Ascend处理器环境运行](#Ascend处理器环境运行-1)
+    - [结果](#结果-1)
+- [推理过程](#推理过程)
+    - [导出MindIR](#导出MindIR)
+    - [结果](#结果)
+- [模型描述](#模型描述)
+    - [性能](#性能)
+        - [评估性能](#评估性能)
+- [随机情况说明](#随机情况说明)
+- [ModelZoo主页](#modelzoo主页)
+
+<!-- /TOC -->
+
+# GhostNet描述
+
+## 概述
+
+GhostNet由华为诺亚方舟实验室在2020年提出，此网络提供了一个全新的Ghost模块，旨在通过廉价操作生成更多的特征图。基于一组原始的特征图，作者应用一系列线性变换，以很小的代价生成许多能从原始特征发掘所需信息的“幻影”特征图（Ghost feature maps）。该Ghost模块即插即用，通过堆叠Ghost模块得出Ghost bottleneck，进而搭建轻量级神经网络——GhostNet。该架构可以在同样精度下，速度和计算量均少于SOTA算法。
+
+如下为MindSpore使用ImageNet2012数据集对GhostNet进行训练的示例。
+
+## 论文
+
+1. [论文](https://arxiv.org/pdf/1911.11907.pdf): Kai Han, Yunhe Wang, Qi Tian."GhostNet: More Features From Cheap Operations"
+
+# 模型架构
+
+GhostNet的总体网络架构如下：[链接](https://arxiv.org/pdf/1911.11907.pdf)
+
+# 数据集
+
+使用的数据集：[ImageNet2012](http://www.image-net.org/)
+
+- 数据集大小：共1000个类、224*224彩色图像
+    - 训练集：共1,281,167张图像
+    - 测试集：共50,000张图像
+- 数据格式：JPEG
+    - 注：数据在dataset.py中处理。
+- 下载数据集，目录结构如下：
+
+```text
+└─dataset
+    ├─ilsvrc                  # 训练数据集
+    └─validation_preprocess   # 评估数据集
+```
+
+# 环境要求
+
+- 硬件
+    - 准备Ascend处理器搭建硬件环境。如需试用昇腾处理器，请发送[申请表](https://obs-9be7.obs.cn-east-2.myhuaweicloud.com/file/other/Ascend%20Model%20Zoo%E4%BD%93%E9%AA%8C%E8%B5%84%E6%BA%90%E7%94%B3%E8%AF%B7%E8%A1%A8.docx)至ascend@huawei.com，审核通过即可获得资源。
+- 框架
+    - [MindSpore](https://www.mindspore.cn/install/en)
+- 如需查看详情，请参见如下资源：
+    - [MindSpore教程](https://www.mindspore.cn/tutorial/training/zh-CN/master/index.html)
+    - [MindSpore Python API](https://www.mindspore.cn/doc/api_python/zh-CN/master/index.html)
+
+# 快速入门
+
+通过官方网站安装MindSpore后，您可以按照如下步骤进行训练和评估：
+
+- Ascend处理器环境运行
+
+```Shell
+# 分布式训练
+用法：sh run_distribute_train.sh [RANK_TABLE_FILE] [DATASET_PATH] [PRETRAINED_CKPT_PATH]（可选）
+
+# 单机训练
+用法：sh run_standalone_train.sh [DATASET_PATH] [PRETRAINED_CKPT_PATH]（可选）
+
+# 运行评估示例
+用法：sh run_eval.sh [DATASET_PATH] [CHECKPOINT_PATH]
+```
+
+# 脚本说明
+
+## 脚本结构与说明
+
+```text
+└──ghostnet
+  ├── README.md
+  ├── scripts
+    ├── run_distribute_train.sh            # 启动Ascend分布式训练（8卡）
+    ├── run_eval.sh                        # 启动Ascend评估
+    └── run_standalone_train.sh            # 启动Ascend单机训练（单卡）
+  ├── src
+    ├── config.py                          # 参数配置
+    ├── dataset.py                         # 数据预处理
+    ├── CrossEntropySmooth.py              # ImageNet2012数据集的损失定义
+    ├── lr_generator.py                    # 生成每个步骤的学习率
+    └── ghostnet.py                          # ghostnet网络
+  ├── eval.py                              # 评估网络
+  └── train.py                             # 训练网络
+```
+
+# 脚本参数
+
+在config.py中可以同时配置训练参数和评估参数。
+
+- 配置GhostNet和ImageNet2012数据集。
+
+```Python
+"num_classes": 1000,           # 数据集类数
+"batch_size": 128,             # 输入张量的批次大小
+"epoch_size": 500,             # 训练周期大小
+"warmup_epochs": 20,           # 热身周期数
+"lr_init": 0.1,                # 基础学习率
+"lr_max": 0.4,                 # 最大学习率
+'lr_end': 1e-6,                # 最终学习率
+'lr_decay_mode': 'cosine',     # 用于生成学习率的衰减模式
+"momentum": 0.9,               # 动量优化器
+"weight_decay": 4e-5,          # 权重衰减
+"label_smooth": 0.1,           # 标签平滑因子
+"loss_scale": 128,             # 损失等级
+"use_label_smooth": True,      # 标签平滑
+"label_smooth_factor": 0.1,    # 标签平滑因子
+"save_checkpoint": True,       # 是否保存检查点
+"save_checkpoint_epochs": 20,  # 两个检查点之间的周期间隔；默认情况下，最后一个检查点将在最后一个周期完成后保存
+"keep_checkpoint_max": 10,     # 只保存最后一个keep_checkpoint_max检查点
+"save_checkpoint_path": "./",  # 检查点相对于执行路径的保存路径
+```
+
+# 训练过程
+
+## 用法
+
+### Ascend处理器环境运行
+
+```Shell
+# 分布式训练
+用法：sh run_distribute_train.sh [RANK_TABLE_FILE] [DATASET_PATH] [PRETRAINED_CKPT_PATH]（可选）
+
+# 单机训练
+用法：sh run_standalone_train.sh [DATASET_PATH] [PRETRAINED_CKPT_PATH]（可选）
+
+```
+
+分布式训练需要提前创建JSON格式的HCCL配置文件。
+
+具体操作，参见[hccn_tools](https://gitee.com/mindspore/mindspore/tree/master/model_zoo/utils/hccl_tools)中的说明。
+
+训练结果保存在示例路径中，文件夹名称以“train”或“train_parallel”开头。您可在此路径下的日志中找到检查点文件以及结果，如下所示。
+
+## 结果
+
+- 使用ImageNet2012数据集训练GhostNet
+
+```text
+# 分布式训练结果（8P）
+epoch: 1 step: 1251, loss is 5.001419
+epoch time: 457012.100 ms, per step time: 365.317 ms
+epoch: 2 step: 1251, loss is 4.275552
+epoch time: 280175.784 ms, per step time: 223.961 ms
+epoch: 3 step: 1251, loss is 4.0788813
+epoch time: 280134.943 ms, per step time: 223.929 ms
+epoch: 4 step: 1251, loss is 4.0310946
+epoch time: 280161.342 ms, per step time: 223.950 ms
+epoch: 5 step: 1251, loss is 3.7326777
+epoch time: 280178.602 ms, per step time: 223.964 ms
+...
+```
+
+# 评估过程
+
+## 用法
+
+### Ascend处理器环境运行
+
+```Shell
+# 评估
+Usage: sh run_eval.sh [DATASET_PATH] [CHECKPOINT_PATH]
+```
+
+```Shell
+# 评估示例
+sh  run_eval.sh  /data/dataset/ImageNet/imagenet_original  ghostnet-500_1251.ckpt
+```
+
+训练过程中可以生成检查点。
+
+## 结果
+
+评估结果保存在示例路径中，文件夹名为“eval”。您可在此路径下的日志找到如下结果：
+
+- 使用ImageNet2012数据集评估GhostNet
+
+```text
+result: {'top_5_accuracy': 0.9162371134020618, 'top_1_accuracy': 0.739368556701031}
+ckpt = /home/lzu/ghost_Mindspore/scripts/device0/ghostnet-500_1251.ckpt
+```
+
+# 推理过程
+
+## [导出MindIR](#contents)
+
+```shell
+python export.py --ckpt_file [CKPT_PATH] --file_name [FILE_NAME] --file_format [FILE_FORMAT]
+```
+
+参数ckpt_file为必填项，
+`EXPORT_FORMAT` 必须在 ["AIR", "MINDIR"]中选择。
+
+## 结果
+
+导出“.mindir”文件可在当前目录查看
+
+# 模型描述
+
+## 性能
+
+### 评估性能
+
+| 参数 | Ascend 910  |
+|---|---|
+| 模型版本  | GhostNet |
+| 资源  |  Ascend 910；CPU：2.60GHz，192核；内存：755G |
+| 上传日期  |2021-06-22 ;  |
+| MindSpore版本  | 1.0.1 |
+| 数据集  |  ImageNet2012 |
+| 训练参数  | epoch=500, steps per epoch=1251, batch_size = 128  |
+| 优化器  | Momentum  |
+| 损失函数  |Softmax交叉熵  |
+| 输出  | 概率 |
+|  损失 | 1.7887309  |
+|速度|223.92毫秒/步（8卡） |
+|总时长   |  39小时 |
+|参数(M)   | 5.18 |
+|  微调检查点 | 42.05M（.ckpt文件）  |
+| 脚本  | [链接](https://gitee.com/alreadyhad/mindspore/tree/master/model_zoo/research/cv/ghostnet)  |
+
+# 随机情况说明
+
+dataset.py中设置了“create_dataset”函数内的种子，同时还使用了train.py中的随机种子。
+
+# ModelZoo主页
+
+请浏览官网[主页](https://gitee.com/mindspore/mindspore/tree/master/model_zoo)。
--- a/model_zoo/research/cv/ghostnet/Readme.md
+++ b/model_zoo/research/cv/ghostnet/Readme.md
@ -1,145 +0,0 @@
-# Contents
-
- [GhostNet Description](#ghostnet-description)
- [Model Architecture](#model-architecture)
- [Dataset](#dataset)
- [Environment Requirements](#environment-requirements)
- [Script Description](#script-description)
-    - [Script and Sample Code](#script-and-sample-code)
-        - [Training Process](#training-process)
-        - [Evaluation Process](#evaluation-process)
-            - [Evaluation](#evaluation)
- [Model Description](#model-description)
-    - [Performance](#performance)  
-        - [Training Performance](#evaluation-performance)
-        - [Inference Performance](#evaluation-performance)
- [Description of Random Situation](#description-of-random-situation)
- [ModelZoo Homepage](#modelzoo-homepage)
-
-## [GhostNet Description](#contents)
-
-The GhostNet architecture is based on an Ghost module structure which generate more features from cheap operations. Based on a set of intrinsic feature maps, a series of cheap operations are applied to generate many ghost feature maps that could fully reveal information underlying intrinsic features.
-
-[Paper](https://openaccess.thecvf.com/content_CVPR_2020/papers/Han_GhostNet_More_Features_From_Cheap_Operations_CVPR_2020_paper.pdf): Kai Han, Yunhe Wang, Qi Tian, Jianyuan Guo, Chunjing Xu, Chang Xu. GhostNet: More Features from Cheap Operations. CVPR 2020.
-
-## [Model architecture](#contents)
-
-The overall network architecture of GhostNet is show below:
-
-[Link](https://openaccess.thecvf.com/content_CVPR_2020/papers/Han_GhostNet_More_Features_From_Cheap_Operations_CVPR_2020_paper.pdf)
-
-## [Dataset](#contents)
-
-Dataset used: [Oxford-IIIT Pet](https://www.robots.ox.ac.uk/~vgg/data/pets/)
-
- Dataset size: 7049 colorful images in 1000 classes
-    - Train:  3680 images
-    - Test: 3369 images
- Data format: RGB images.
-    - Note: Data will be processed in src/dataset.py
-
-## [Environment Requirements](#contents)
-
- Hardware（Ascend/GPU)
-    - Prepare hardware environment with Ascend or GPU.
- Framework
-    - [MindSpore](https://www.mindspore.cn/install/en)
- For more information, please check the resources below：
-    - [MindSpore Tutorials](https://www.mindspore.cn/tutorials/en/master/index.html)
-    - [MindSpore Python API](https://www.mindspore.cn/docs/api/en/master/index.html)
-
-## [Script description](#contents)
-
-### [Script and sample code](#contents)
-
-```python
-├── GhostNet
-  ├── Readme.md     # descriptions about ghostnet   # shell script for evaluation with CPU, GPU or Ascend
-  ├── src
-  │   ├──config.py      # parameter configuration
-  │   ├──dataset.py     # creating dataset
-  │   ├──launch.py      # start python script
-  │   ├──lr_generator.py     # learning rate config
-  │   ├──ghostnet.py      # GhostNet architecture
-  │   ├──ghostnet600.py      # GhostNet-600M architecture
-  ├── eval.py       # evaluation script
-  ├── mindspore_hub_conf.py       # export model for hub
-```
-
-## [Training process](#contents)
-
-To Be Done
-
-## [Eval process](#contents)
-
-### Usage
-
-After installing MindSpore via the official website, you can start evaluation as follows:
-
-### Launch
-
-```bash
-# infer example
-
-  Ascend: python eval.py --model [ghostnet/ghostnet-600] --dataset_path ~/Pets/test.mindrecord --platform Ascend --checkpoint_path [CHECKPOINT_PATH]
-  GPU: python eval.py --model [ghostnet/ghostnet-600] --dataset_path ~/Pets/test.mindrecord --platform GPU --checkpoint_path [CHECKPOINT_PATH]
-```
-
-> checkpoint can be produced in training process.
-
-### Result
-
-```bash
-result: {'acc': 0.8113927500681385} ckpt= ./ghostnet_nose_1x_pets.ckpt
-result: {'acc': 0.824475333878441} ckpt= ./ghostnet_1x_pets.ckpt
-result: {'acc': 0.8691741618969746} ckpt= ./ghostnet600M_pets.ckpt
-```
-
-## [Model Description](#contents)
-
-### [Performance](#contents)
-
-#### Evaluation Performance
-
-##### GhostNet on ImageNet2012
-
-| Parameters                 |                                        |   |
-| -------------------------- | -------------------------------------- |---------------------------------- |
-| Model Version              | GhostNet                                             |GhostNet-600|
-| uploaded Date              | 09/08/2020 (month/day/year)  ；                        | 09/08/2020 (month/day/year) |
-| MindSpore Version          | 0.6.0-alpha                                                       |0.6.0-alpha   |
-| Dataset                    | ImageNet2012                                                    | ImageNet2012|
-| Parameters (M)             | 5.2                                                   | 11.9 |
-| FLOPs (M) | 142 | 591 |
-| Accuracy (Top1) | 73.9 |80.2   |
-
-###### GhostNet on Oxford-IIIT Pet
-
-| Parameters                 |                                        |   |
-| -------------------------- | -------------------------------------- |---------------------------------- |
-| Model Version              | GhostNet                                             |GhostNet-600|
-| uploaded Date              | 09/08/2020 (month/day/year)  ；                        | 09/08/2020 (month/day/year) |
-| MindSpore Version          | 0.6.0-alpha                                                       |0.6.0-alpha   |
-| Dataset                    | Oxford-IIIT Pet                                                   | Oxford-IIIT Pet|
-| Parameters (M)             | 3.9                                                    | 10.6 |
-| FLOPs (M) | 140 | 590 |
-| Accuracy (Top1) |            82.4              |86.9   |
-
-###### Comparison with other methods on Oxford-IIIT Pet
-
-|Model|FLOPs (M)|Latency (ms)*|Accuracy (Top1)|
-|-|-|-|-|
-|MobileNetV2-1x|300|28.2|78.5|
-|Ghost-1x w\o SE|138|19.1|81.1|
-|Ghost-1x|140|25.3|82.4|
-|Ghost-600|590|-|86.9|
-
-*The latency is measured on Huawei Kirin 990 chip under single-threaded mode with batch size 1.
-
-## [Description of Random Situation](#contents)
-
-In dataset.py, we set the seed inside “create_dataset" function. We also use random seed in train.py.
-
-## [ModelZoo Homepage](#contents)
-
-Please check the official [homepage](https://gitee.com/mindspore/mindspore/tree/master/model_zoo).
--- a/model_zoo/research/cv/ghostnet/eval.py
+++ b/model_zoo/research/cv/ghostnet/eval.py
@ -1,4 +1,4 @@
-# Copyright 2020 Huawei Technologies Co., Ltd
+# Copyright 2021 Huawei Technologies Co., Ltd
 #
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
@ -21,56 +21,25 @@ from mindspore import context
 from mindspore import nn
 from mindspore.train.model import Model
 from mindspore.train.serialization import load_checkpoint, load_param_into_net
-from mindspore.common import dtype as mstype
 from src.dataset import create_dataset
-from src.config import config_ascend, config_gpu
-from src.ghostnet import ghostnet_1x, ghostnet_nose_1x
-from src.ghostnet600 import ghostnet_600m
-
+from src.ghostnet import ghostnet_1x

 parser = argparse.ArgumentParser(description='Image classification')
 parser.add_argument('--checkpoint_path', type=str, default=None, help='Checkpoint file path')
-parser.add_argument('--dataset_path', type=str, default=None, help='Dataset path')
-parser.add_argument('--platform', type=str, default=None, help='run platform')
-parser.add_argument('--model', type=str, default=None, help='ghostnet')
+parser.add_argument('--data_url', type=str, default=None, help='Dataset path')
 args_opt = parser.parse_args()


 if __name__ == '__main__':
-    config_platform = None
-    if args_opt.platform == "Ascend":
-        config_platform = config_ascend
-        device_id = int(os.getenv('DEVICE_ID'))
-        context.set_context(mode=context.GRAPH_MODE, device_target="Ascend",
-                            device_id=device_id, save_graphs=False)
-    elif args_opt.platform == "GPU":
-        config_platform = config_gpu
-        context.set_context(mode=context.GRAPH_MODE,
-                            device_target="GPU", save_graphs=False)
-    else:
-        raise ValueError("Unsupported platform.")
+    device_id = int(os.getenv('DEVICE_ID'))
+    context.set_context(mode=context.GRAPH_MODE, device_target="Ascend",
+                        device_id=device_id, save_graphs=False)

    loss = nn.SoftmaxCrossEntropyWithLogits(sparse=True, reduction='mean')

-    if args_opt.model == 'ghostnet':
-        net = ghostnet_1x(num_classes=config_platform.num_classes)
-    elif args_opt.model == 'ghostnet_nose':
-        net = ghostnet_nose_1x(num_classes=config_platform.num_classes)
-    elif args_opt.model == 'ghostnet-600':
-        net = ghostnet_600m(num_classes=config_platform.num_classes)
+    net = ghostnet_1x()

-    if args_opt.platform == "Ascend":
-        net.to_float(mstype.float16)
-        for _, cell in net.cells_and_names():
-            if isinstance(cell, nn.Dense):
-                cell.to_float(mstype.float32)
-
-    dataset = create_dataset(dataset_path=args_opt.dataset_path,
-                             do_train=False,
-                             config=config_platform,
-                             platform=args_opt.platform,
-                             batch_size=config_platform.batch_size,
-                             model=args_opt.model)
+    dataset = create_dataset(dataset_path=args_opt.data_url, do_train=False)
    step_size = dataset.get_dataset_size()

    if args_opt.checkpoint_path:
@ -78,6 +47,6 @@ if __name__ == '__main__':
        load_param_into_net(net, param_dict)
    net.set_train(False)

-    model = Model(net, loss_fn=loss, metrics={'acc'})
+    model = Model(net, loss_fn=loss, metrics={'top_1_accuracy', 'top_5_accuracy'})
    res = model.eval(dataset)
    print("result:", res, "ckpt=", args_opt.checkpoint_path)
--- a/model_zoo/research/cv/ghostnet/export.py
+++ b/model_zoo/research/cv/ghostnet/export.py
@ -0,0 +1,43 @@
+# Copyright 2021 Huawei Technologies Co., Ltd
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ============================================================================
+""" export MINDIR """
+import argparse as arg
+import numpy as np
+import mindspore as ms
+from mindspore import context, Tensor, export, load_checkpoint
+from src.ghostnet import ghostnet_1x
+from src.config import config
+
+
+if __name__ == '__main__':
+    parser = arg.ArgumentParser(description='SID export')
+    parser.add_argument('--device_target', type=str, choices=['Ascend', 'GPU', 'CPU'], default='Ascend',
+                        help='device where the code will be implemented')
+    parser.add_argument('--device_id', type=int, default=0, help='device id')
+    parser.add_argument('--file_format', type=str, choices=['AIR', 'MINDIR'], default='MINDIR',
+                        help='file format')
+    parser.add_argument('--checkpoint_path', required=True, default=None, help='ckpt file path')
+    args = parser.parse_args()
+    context.set_context(mode=context.GRAPH_MODE, device_target=args.device_target)
+    if args.device_target == 'Ascend':
+        context.set_context(device_id=args.device_id)
+
+    ckpt_dir = args.checkpoint_path
+    net = ghostnet_1x(num_classes=config.num_classes)
+    load_checkpoint(ckpt_dir, net=net)
+    net.set_train(False)
+
+    input_data = Tensor(np.zeros([1, 3, 224, 224]), ms.float32)
+    export(net, input_data, file_name='ghost', file_format=args.file_format)
--- a/model_zoo/research/cv/ghostnet/mindpsore_hub_conf.py
+++ b/model_zoo/research/cv/ghostnet/mindpsore_hub_conf.py
@ -1,27 +0,0 @@
-# Copyright 2020 Huawei Technologies Co., Ltd
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-# http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ============================================================================
-"""hub config."""
-from src.ghostnet import ghostnet_1x, ghostnet_nose_1x
-from src.ghostnet600 import ghostnet_600m
-
-
-def create_network(name, *args, **kwargs):
-    if name == 'ghostnet':
-        return ghostnet_1x(*args, **kwargs)
-    if name == 'ghostnet_nose':
-        return ghostnet_nose_1x(*args, **kwargs)
-    if name == 'ghostnet-600':
-        return ghostnet_600m(*args, **kwargs)
-    raise NotImplementedError(f"{name} is not implemented in the repo")
--- a/model_zoo/research/cv/ghostnet/scripts/run_distribute_train.sh
+++ b/model_zoo/research/cv/ghostnet/scripts/run_distribute_train.sh
@ -0,0 +1,90 @@
+#!/bin/bash
+# Copyright 2021 Huawei Technologies Co., Ltd
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ============================================================================
+echo "=============================================================================================================="
+echo "Please run the script as: "
+echo "bash run_distribute_train.sh RANK_TABLE_FILE DATA_PATH PRETRAINED_CKPT_PATH](optional)"
+echo "For example: bash run_distribute_train.sh hccl_8p_01234567_127.0.0.1.json /path/dataset"
+echo "It is better to use the absolute path."
+echo "=============================================================================================================="
+
+get_real_path(){
+  if [ "${1:0:1}" == "/" ]; then
+    echo "$1"
+  else
+    echo "$(realpath -m $PWD/$1)"
+  fi
+}
+
+PATH1=$(get_real_path $1)
+PATH2=$(get_real_path $2)
+
+if [ $# == 3 ]
+then 
+    PATH3=$(get_real_path $3)
+fi
+
+if [ ! -f $PATH1 ]
+then 
+    echo "error: RANK_TABLE_FILE=$PATH1 is not a file"
+exit 1
+fi 
+
+if [ ! -d $PATH2 ]
+then 
+    echo "error: DATA_PATH=$PATH2 is not a directory"
+exit 1
+fi 
+
+if [ $# == 3 ] && [ ! -f $PATH3 ]
+then
+    echo "error: PRETRAINED_CKPT_PATH=$PATH3 is not a file"
+exit 1
+fi
+
+ulimit -u unlimited
+export DEVICE_NUM=8
+export RANK_SIZE=8
+export RANK_TABLE_FILE=$PATH1
+export MINDSPORE_HCCL_CONFIG_PATH=$PATH1
+
+DATA_PATH=$2
+export DATA_PATH=${DATA_PATH}
+
+for((i=0;i<${RANK_SIZE};i++))
+do
+    rm -rf device$i
+    mkdir device$i
+    cp ../*.py ./device$i
+    cp *.sh ./device$i
+    cp -r ../src ./device$i
+    cd ./device$i
+    export DEVICE_ID=$i
+    export RANK_ID=$i
+    echo "start training for device $i"
+    env > env$i.log
+
+    if [ $# == 2 ]
+    then
+        python train.py --run_distribute=True  --data_url=$PATH2   &> train.log &
+    fi
+    
+    if [ $# == 3 ]
+    then
+        python train.py --run_distribute=True  --data_url=$PATH2 --pre_trained=$PATH3 &> train.log &
+    fi
+
+    cd ../
+done
--- a/model_zoo/research/cv/ghostnet/scripts/run_eval.sh
+++ b/model_zoo/research/cv/ghostnet/scripts/run_eval.sh
@ -0,0 +1,64 @@
+#!/bin/bash
+# Copyright 2021 Huawei Technologies Co., Ltd
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ============================================================================
+echo "=============================================================================================================="
+echo "Please run the script as: "
+echo "bash run_eval.sh DATA_PATH CHECKPOINT_PATH "
+echo "For example: bash run.sh /path/dataset ghostnet-500_1251.ckpt"
+echo "It is better to use the absolute path."
+echo "=============================================================================================================="
+
+get_real_path(){
+  if [ "${1:0:1}" == "/" ]; then
+    echo "$1"
+  else
+    echo "$(realpath -m $PWD/$1)"
+  fi
+}
+
+PATH1=$(get_real_path $1)
+PATH2=$(get_real_path $2)
+
+if [ ! -d $PATH1 ]
+then
+    echo "error: DATASET_PATH=$PATH1 is not a directory"
+exit 1
+fi
+
+if [ ! -f $PATH2 ]
+then
+    echo "error: CHECKPOINT_PATH=$PATH2 is not a file"
+exit 1
+fi 
+
+ulimit -u unlimited
+export DEVICE_NUM=1
+export DEVICE_ID=0
+export RANK_SIZE=$DEVICE_NUM
+export RANK_ID=0
+
+if [ -d "eval" ];
+then
+    rm -rf ./eval
+fi
+mkdir ./eval
+cp ../*.py ./eval
+cp *.sh ./eval
+cp -r ../src ./eval
+cd ./eval 
+env > env.log
+echo "start evaluation for device $DEVICE_ID"
+python eval.py --data_url=$PATH1 --checkpoint_path=$PATH2 &> eval.log &
+cd ..
--- a/model_zoo/research/cv/ghostnet/scripts/run_standalone_train.sh
+++ b/model_zoo/research/cv/ghostnet/scripts/run_standalone_train.sh
@ -0,0 +1,77 @@
+#!/bin/bash
+# Copyright 2021 Huawei Technologies Co., Ltd
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ============================================================================
+echo "=============================================================================================================="
+echo "Please run the script as: "
+echo "bash run_standalone_train.sh DATA_PATH PRETRAINED_CKPT_PATH(optional)"
+echo "For example: bash run_standalone_train.sh /path/dataset"
+echo "It is better to use the absolute path."
+echo "=============================================================================================================="
+
+get_real_path(){
+  if [ "${1:0:1}" == "/" ]; then
+    echo "$1"
+  else
+    echo "$(realpath -m $PWD/$1)"
+  fi
+}
+
+PATH1=$(get_real_path $1)
+if [ $# == 2 ]
+then
+    PATH2=$(get_real_path $2)
+fi
+
+if [ ! -d $PATH1 ]
+then 
+    echo "error: DATASET_PATH=$PATH1 is not a directory"
+exit 1
+fi
+
+if [ $# == 2 ] && [ ! -f $PATH2 ]
+then
+    echo "error: PRETRAINED_CKPT_PATH=$PATH2 is not a file"
+exit 1
+fi
+
+ulimit -u unlimited
+export DEVICE_NUM=1
+export DEVICE_ID=0
+export RANK_SIZE=$DEVICE_NUM
+export RANK_ID=0
+
+if [ -d "train" ];
+then
+    rm -rf ./train
+fi
+mkdir ./train
+cp ../*.py ./train
+cp *.sh ./train
+cp -r ../src ./train
+cd ./train 
+echo "start training for device $DEVICE_ID"
+env > env.log
+if [ $# == 1 ]
+then
+    python train.py  --data_url=$PATH1 &> train.log &
+fi
+
+if [ $# == 2 ]
+then
+    python train.py  --data_url=$PATH1 --pre_trained=$PATH2 &> train.log &
+fi
+cd ..
+
+
--- a/model_zoo/research/cv/ghostnet/src/CrossEntropySmooth.py
+++ b/model_zoo/research/cv/ghostnet/src/CrossEntropySmooth.py
@ -0,0 +1,38 @@
+# Copyright 2021 Huawei Technologies Co., Ltd
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ============================================================================
+"""define loss function for network"""
+import mindspore.nn as nn
+from mindspore import Tensor
+from mindspore.common import dtype as mstype
+from mindspore.nn.loss.loss import _Loss
+from mindspore.ops import functional as F
+from mindspore.ops import operations as P
+
+
+class CrossEntropySmooth(_Loss):
+    """CrossEntropy"""
+    def __init__(self, sparse=True, reduction='mean', smooth_factor=0., num_classes=1000):
+        super(CrossEntropySmooth, self).__init__()
+        self.onehot = P.OneHot()
+        self.sparse = sparse
+        self.on_value = Tensor(1.0 - smooth_factor, mstype.float32)
+        self.off_value = Tensor(1.0 * smooth_factor / (num_classes - 1), mstype.float32)
+        self.ce = nn.SoftmaxCrossEntropyWithLogits(reduction=reduction)
+
+    def construct(self, logit, label):
+        if self.sparse:
+            label = self.onehot(label, F.shape(logit)[1], self.on_value, self.off_value)
+        loss = self.ce(logit, label)
+        return loss
--- a/model_zoo/research/cv/ghostnet/src/config.py
+++ b/model_zoo/research/cv/ghostnet/src/config.py
@ -1,4 +1,4 @@
-# Copyright 2020 Huawei Technologies Co., Ltd
+# Copyright 2021 Huawei Technologies Co., Ltd
 #
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
@ -17,38 +17,23 @@ network config setting, will be used in train.py and eval.py
 """
 from easydict import EasyDict as ed

-config_ascend = ed({
-    "num_classes": 37,
-    "image_height": 224,
-    "image_width": 224,
-    "batch_size": 256,
-    "epoch_size": 200,
-    "warmup_epochs": 4,
-    "lr": 0.4,
+config = ed({
+    "num_classes": 1000,
+    "batch_size": 128,
+    "epoch_size": 500,
+    "warmup_epochs": 20,
+    "lr_init": 0.1,
+    "lr_max": 0.4,
+    'lr_end': 1e-6,
+    'lr_decay_mode': 'cosine',
    "momentum": 0.9,
    "weight_decay": 4e-5,
    "label_smooth": 0.1,
-    "loss_scale": 1024,
+    "loss_scale": 128,
+    "use_label_smooth": True,
+    "label_smooth_factor": 0.1,
    "save_checkpoint": True,
-    "save_checkpoint_epochs": 1,
-    "keep_checkpoint_max": 200,
-    "save_checkpoint_path": "./checkpoint",
-})
-
-config_gpu = ed({
-    "num_classes": 37,
-    "image_height": 224,
-    "image_width": 224,
-    "batch_size": 3,
-    "epoch_size": 370,
-    "warmup_epochs": 4,
-    "lr": 0.4,
-    "momentum": 0.9,
-    "weight_decay": 4e-5,
-    "label_smooth": 0.1,
-    "loss_scale": 1024,
-    "save_checkpoint": True,
-    "save_checkpoint_epochs": 1,
-    "keep_checkpoint_max": 500,
-    "save_checkpoint_path": "./checkpoint",
+    "save_checkpoint_epochs": 20,
+    "keep_checkpoint_max": 10,
+    "save_checkpoint_path": "./",
 })
--- a/model_zoo/research/cv/ghostnet/src/dataset.py
+++ b/model_zoo/research/cv/ghostnet/src/dataset.py
@ -1,4 +1,4 @@
-# Copyright 2020 Huawei Technologies Co., Ltd
+# Copyright 2021 Huawei Technologies Co., Ltd
 #
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
@ -12,99 +12,83 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ============================================================================
-"""
-create train or eval dataset.
-"""
+"""Data operations, will be used in train.py and eval.py"""
 import os
+from src.config import config
 import mindspore.common.dtype as mstype
-import mindspore.dataset as ds
-import mindspore.dataset.transforms.vision.c_transforms as C
-import mindspore.dataset.transforms.vision.py_transforms as P
+import mindspore.dataset.engine as de
 import mindspore.dataset.transforms.c_transforms as C2
-from mindspore.dataset.transforms.vision import Inter
+import mindspore.dataset.vision.c_transforms as C
+from mindspore.communication.management import get_rank, get_group_size


-def create_dataset(dataset_path, do_train, config, platform, repeat_num=1, batch_size=100, model='ghsotnet'):
+def create_dataset(dataset_path, do_train, target="Ascend"):
    """
    create a train or eval dataset

    Args:
        dataset_path(string): the path of dataset.
        do_train(bool): whether dataset is used for train or eval.
-        repeat_num(int): the repeat times of dataset. Default: 1
-        batch_size(int): the batch size of dataset. Default: 32
+        rank (int): The shard ID within num_shards (default=None).
+        group_size (int): Number of shards that the dataset should be divided into (default=None).
+        repeat_num(int): the repeat times of dataset. Default: 1.

    Returns:
        dataset
    """
-    if platform == "Ascend":
-        rank_size = int(os.getenv("RANK_SIZE"))
-        rank_id = int(os.getenv("RANK_ID"))
-        if rank_size == 1:
-            data_set = ds.MindDataset(
-                dataset_path, num_parallel_workers=8, shuffle=True)
-        else:
-            data_set = ds.MindDataset(dataset_path, num_parallel_workers=8, shuffle=True,
-                                      num_shards=rank_size, shard_id=rank_id)
-    elif platform == "GPU":
-        if do_train:
-            from mindspore.communication.management import get_rank, get_group_size
-            data_set = ds.MindDataset(dataset_path, num_parallel_workers=8, shuffle=True,
-                                      num_shards=get_group_size(), shard_id=get_rank())
-        else:
-            data_set = ds.MindDataset(
-                dataset_path, num_parallel_workers=8, shuffle=True)
+    if not do_train:
+        dataset_path = os.path.join(dataset_path, 'val')
    else:
-        raise ValueError("Unsupported platform.")
+        dataset_path = os.path.join(dataset_path, 'train')
+    if target == "Ascend":
+        device_num, rank_id = _get_rank_info()

-    resize_height = config.image_height
-    buffer_size = 1000
+    if device_num == 1:
+        ds = de.ImageFolderDataset(dataset_path, num_parallel_workers=8, shuffle=True)
+    else:
+        ds = de.ImageFolderDataset(dataset_path, num_parallel_workers=8, shuffle=True,
+                                   num_shards=device_num, shard_id=rank_id)

+    mean = [0.485 * 255, 0.456 * 255, 0.406 * 255]
+    std = [0.229 * 255, 0.224 * 255, 0.225 * 255]
    # define map operations
-    resize_crop_op = C.RandomCropDecodeResize(
-        resize_height, scale=(0.08, 1.0), ratio=(0.75, 1.333))
-    horizontal_flip_op = C.RandomHorizontalFlip(prob=0.5)
-
-    color_op = C.RandomColorAdjust(
-        brightness=0.4, contrast=0.4, saturation=0.4)
-    rescale_op = C.Rescale(1 / 255.0, 0)
-    normalize_op = C.Normalize(
-        mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225])
-    change_swap_op = C.HWC2CHW()
-
-    # define python operations
-    decode_p = P.Decode()
-    if model == 'ghostnet-600':
-        s = 274
-        c = 240
-    else:
-        s = 256
-        c = 224
-    resize_p = P.Resize(s, interpolation=Inter.BICUBIC)
-    center_crop_p = P.CenterCrop(c)
-    totensor = P.ToTensor()
-    normalize_p = P.Normalize((0.485, 0.456, 0.406), (0.229, 0.224, 0.225))
-    composeop = P.ComposeOp(
-        [decode_p, resize_p, center_crop_p, totensor, normalize_p])
    if do_train:
-        trans = [resize_crop_op, horizontal_flip_op, color_op,
-                 rescale_op, normalize_op, change_swap_op]
+        trans = [
+            C.RandomCropDecodeResize(224),
+            C.RandomHorizontalFlip(prob=0.5),
+            C.RandomColorAdjust(brightness=0.4, contrast=0.4, saturation=0.4)
+        ]
    else:
-        trans = composeop()
+        trans = [
+            C.Decode(),
+            C.Resize(256),
+            C.CenterCrop(224),
+        ]
+    trans += [
+        C.Normalize(mean=mean, std=std),
+        C.HWC2CHW(),
+    ]
+
    type_cast_op = C2.TypeCast(mstype.int32)
-
-    data_set = data_set.map(input_columns="image", operations=trans,
-                            num_parallel_workers=8)
-    data_set = data_set.map(input_columns="label_list",
-                            operations=type_cast_op, num_parallel_workers=8)
-
-    # apply shuffle operations
-    data_set = data_set.shuffle(buffer_size=buffer_size)
+    ds = ds.map(input_columns="image", operations=trans, num_parallel_workers=8)
+    ds = ds.map(input_columns="label", operations=type_cast_op, num_parallel_workers=8)

    # apply batch operations
-    data_set = data_set.batch(batch_size, drop_remainder=True)
+    ds = ds.batch(config.batch_size, drop_remainder=True)
+    return ds

-    # apply dataset repeat operation
-    data_set = data_set.repeat(repeat_num)

-    return data_set
+def _get_rank_info():
+    """
+    get rank size and rank id
+    """
+    rank_size = int(os.environ.get("RANK_SIZE", 1))
+
+    if rank_size > 1:
+        rank_size = get_group_size()
+        rank_id = get_rank()
+    else:
+        rank_size = 1
+        rank_id = 0
+
+    return rank_size, rank_id
--- a/model_zoo/research/cv/ghostnet/src/ghostnet.py
+++ b/model_zoo/research/cv/ghostnet/src/ghostnet.py
@ -1,4 +1,4 @@
-# Copyright 2020 Huawei Technologies Co., Ltd
+# Copyright 2021 Huawei Technologies Co., Ltd
 #
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
@ -46,6 +46,7 @@ class MyHSigmoid(nn.Cell):
        self.relu6 = nn.ReLU6()

    def construct(self, x):
+        """ construct """
        return self.relu6(x + 3.) * 0.16666667


@ -74,6 +75,7 @@ class Activation(nn.Cell):
            raise NotImplementedError

    def construct(self, x):
+        """ construct """
        return self.act(x)


@ -95,6 +97,7 @@ class GlobalAvgPooling(nn.Cell):
        self.mean = P.ReduceMean(keep_dims=keep_dims)

    def construct(self, x):
+        """ construct """
        x = self.mean(x, (2, 3))
        return x

@ -127,6 +130,7 @@ class SE(nn.Cell):
        self.mul = P.Mul()

    def construct(self, x):
+        """ construct of SE module """
        out = self.pool(x)
        out = self.conv_reduce(out)
        out = self.act1(out)
@ -173,6 +177,7 @@ class ConvUnit(nn.Cell):
        self.act = Activation(act_type) if use_act else None

    def construct(self, x):
+        """ construct of conv unit """
        out = self.conv(x)
        out = self.bn(out)
        if self.use_act:
@ -209,12 +214,14 @@ class GhostModule(nn.Cell):
        new_channels = init_channels * (ratio - 1)

        self.primary_conv = ConvUnit(num_in, init_channels, kernel_size=kernel_size, stride=stride, padding=padding,
-                                     num_groups=1, use_act=use_act, act_type='relu')
-        self.cheap_operation = ConvUnit(init_channels, new_channels, kernel_size=dw_size, stride=1, padding=dw_size//2,
-                                        num_groups=init_channels, use_act=use_act, act_type='relu')
+                                     num_groups=1, use_act=use_act, act_type=act_type)
+        self.cheap_operation = ConvUnit(init_channels, new_channels, kernel_size=dw_size, stride=1,
+                                        padding=dw_size // 2, num_groups=init_channels,
+                                        use_act=use_act, act_type=act_type)
        self.concat = P.Concat(axis=1)

    def construct(self, x):
+        """ ghost module construct """
        x1 = self.primary_conv(x)
        x2 = self.cheap_operation(x1)
        return self.concat((x1, x2))
@ -269,10 +276,10 @@ class GhostBottleneck(nn.Cell):
                ConvUnit(num_in, num_out, kernel_size=1, stride=1,
                         padding=0, num_groups=1, use_act=False),
            ])
-        self.add = P.Add()
+        self.add = P.TensorAdd()

    def construct(self, x):
-        r"""construct of ghostnet"""
+        """ construct of ghostnet """
        shortcut = x
        out = self.ghost1(x)
        if self.use_dw:
@ -318,7 +325,7 @@ class GhostNet(nn.Cell):
        >>> GhostNet(num_classes=1000)
    """

-    def __init__(self, model_cfgs, num_classes=1000, multiplier=1., final_drop=0., round_nearest=8):
+    def __init__(self, model_cfgs, num_classes=1000, multiplier=1., final_drop=0.):
        super(GhostNet, self).__init__()
        self.cfgs = model_cfgs['cfg']
        self.inplanes = 16
@ -365,7 +372,7 @@ class GhostNet(nn.Cell):
        self._initialize_weights()

    def construct(self, x):
-        r"""construct of GhostNet"""
+        """ construct of GhostNet """
        x = self.conv_stem(x)
        x = self.bn1(x)
        x = self.act1(x)
@ -403,21 +410,21 @@ class GhostNet(nn.Cell):
        for _, m in self.cells_and_names():
            if isinstance(m, (nn.Conv2d)):
                n = m.kernel_size[0] * m.kernel_size[1] * m.out_channels
-                m.weight.set_parameter_data(Tensor(np.random.normal(0, np.sqrt(2. / n),
-                                                                    m.weight.data.shape).astype("float32")))
+                m.weight.set_data(Tensor(np.random.normal(0, np.sqrt(2. / n),
+                                                          m.weight.data.shape).astype("float32")))
                if m.bias is not None:
-                    m.bias.set_parameter_data(
+                    m.bias.set_data(
                        Tensor(np.zeros(m.bias.data.shape, dtype="float32")))
            elif isinstance(m, nn.BatchNorm2d):
-                m.gamma.set_parameter_data(
+                m.gamma.set_data(
                    Tensor(np.ones(m.gamma.data.shape, dtype="float32")))
-                m.beta.set_parameter_data(
+                m.beta.set_data(
                    Tensor(np.zeros(m.beta.data.shape, dtype="float32")))
            elif isinstance(m, nn.Dense):
-                m.weight.set_parameter_data(Tensor(np.random.normal(
+                m.weight.set_data(Tensor(np.random.normal(
                    0, 0.01, m.weight.data.shape).astype("float32")))
                if m.bias is not None:
-                    m.bias.set_parameter_data(
+                    m.bias.set_data(
                        Tensor(np.zeros(m.bias.data.shape, dtype="float32")))


--- a/model_zoo/research/cv/ghostnet/src/lr_generator.py
+++ b/model_zoo/research/cv/ghostnet/src/lr_generator.py
@ -1,4 +1,4 @@
-# Copyright 2020 Huawei Technologies Co., Ltd
+# Copyright 2021 Huawei Technologies Co., Ltd
 #
 # Licensed under the Apache License, Version 2.0 (the "License");
 # you may not use this file except in compliance with the License.
@ -16,8 +16,7 @@
 import math
 import numpy as np

-
-def get_lr(global_step, lr_init, lr_end, lr_max, warmup_epochs, total_epochs, steps_per_epoch):
+def get_lr(lr_init, lr_end, lr_max, warmup_epochs, total_epochs, steps_per_epoch):
    """
    generate learning rate array

@ -47,9 +46,6 @@ def get_lr(global_step, lr_init, lr_end, lr_max, warmup_epochs, total_epochs, st
        if lr < 0.0:
            lr = 0.0
        lr_each_step.append(lr)
-
-    current_step = global_step
    lr_each_step = np.array(lr_each_step).astype(np.float32)
-    learning_rate = lr_each_step[current_step:]

-    return learning_rate
+    return lr_each_step
--- a/model_zoo/research/cv/ghostnet/train.py
+++ b/model_zoo/research/cv/ghostnet/train.py
@ -0,0 +1,141 @@
+# Copyright 2021 Huawei Technologies Co., Ltd
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+# http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+# ============================================================================
+"""
+train.
+"""
+import os
+import argparse
+import ast
+import mindspore.common.initializer as weight_init
+
+from mindspore import context
+from mindspore import nn
+from mindspore import Tensor
+from mindspore.train.model import Model
+from mindspore.train.loss_scale_manager import FixedLossScaleManager
+from mindspore.train.callback import ModelCheckpoint, CheckpointConfig, LossMonitor, TimeMonitor
+from mindspore.train.serialization import load_checkpoint, load_param_into_net
+from mindspore.common import dtype as mstype
+from mindspore.common import set_seed
+from mindspore.nn.optim.momentum import Momentum
+from mindspore.communication.management import init, get_rank
+from mindspore.context import ParallelMode
+from src.lr_generator import get_lr
+from src.CrossEntropySmooth import CrossEntropySmooth
+from src.dataset import create_dataset
+from src.config import config
+from src.ghostnet import ghostnet_1x
+
+
+parser = argparse.ArgumentParser(description='Image classification--GhostNet')
+parser.add_argument('--data_url', type=str, default=None, help='Dataset path')
+parser.add_argument('--run_distribute', type=ast.literal_eval, default=False, help='Run distribute')
+parser.add_argument('--pre_trained', type=str, default=None, help='Pretrained checkpoint path')
+parser.add_argument('--rank', type=int, default=0, help='local rank of distributed')
+parser.add_argument('--is_save_on_master', type=int, default=1, help='save ckpt on master or all rank')
+args_opt = parser.parse_args()
+
+set_seed(1)
+
+if __name__ == '__main__':
+    # init context
+    context.set_context(mode=context.GRAPH_MODE, device_target="Ascend",
+                        save_graphs=False)
+
+    if args_opt.run_distribute:
+        device_id = int(os.getenv('DEVICE_ID'))
+        rank_size = int(os.environ.get("RANK_SIZE", 1))
+        print(rank_size)
+        device_num = rank_size
+        context.set_context(device_id=device_id, enable_auto_mixed_precision=True)
+        context.set_auto_parallel_context(device_num=device_num, parallel_mode=ParallelMode.DATA_PARALLEL,
+                                          gradients_mean=True)
+        init()
+        args_opt.rank = get_rank()
+
+    # select for master rank save ckpt or all rank save, compatible for model parallel
+    args_opt.rank_save_ckpt_flag = 0
+    if args_opt.is_save_on_master:
+        if args_opt.rank == 0:
+            args_opt.rank_save_ckpt_flag = 1
+    else:
+        args_opt.rank_save_ckpt_flag = 1
+
+    # define net
+    net = ghostnet_1x(num_classes=config.num_classes)
+    net.to_float(mstype.float16)
+    for _, cell in net.cells_and_names():
+        if isinstance(cell, nn.Dense):
+            cell.to_float(mstype.float32)
+
+    local_data_path = args_opt.data_url
+    print('Download data:')
+    dataset = create_dataset(dataset_path=local_data_path,
+                             do_train=True,
+                             target="Ascend")
+
+    step_size = dataset.get_dataset_size()
+    print('steps:', step_size)
+
+    # init weight
+    if args_opt.pre_trained:
+        param_dict = load_checkpoint(args_opt.pre_trained)
+        load_param_into_net(net, param_dict)
+    else:
+        for _, cell in net.cells_and_names():
+            if isinstance(cell, nn.Conv2d):
+                cell.weight.set_data(weight_init.initializer(weight_init.HeUniform(),
+                                                             cell.weight.shape,
+                                                             cell.weight.dtype))
+            if isinstance(cell, nn.Dense):
+                cell.weight.set_data(weight_init.initializer(weight_init.HeNormal(),
+                                                             cell.weight.shape,
+                                                             cell.weight.dtype))
+
+    # init lr
+    lr = get_lr(lr_init=config.lr_init, lr_end=config.lr_end,
+                lr_max=config.lr_max, warmup_epochs=config.warmup_epochs,
+                total_epochs=config.epoch_size, steps_per_epoch=step_size)
+    lr = Tensor(lr)
+
+    if not config.use_label_smooth:
+        config.label_smooth_factor = 0.0
+
+    loss = CrossEntropySmooth(sparse=True, reduction="mean",
+                              smooth_factor=config.label_smooth_factor, num_classes=config.num_classes)
+
+    opt = Momentum(net.trainable_params(), lr, config.momentum, loss_scale=config.loss_scale,
+                   weight_decay=config.weight_decay)
+
+    loss_scale = FixedLossScaleManager(config.loss_scale, drop_overflow_update=False)
+
+    model = Model(net, loss_fn=loss, optimizer=opt, loss_scale_manager=loss_scale,
+                  metrics={'top_1_accuracy', 'top_5_accuracy'},
+                  amp_level="O3", keep_batchnorm_fp32=False)
+
+    # define callbacks
+    time_cb = TimeMonitor(data_size=step_size)
+    loss_cb = LossMonitor()
+    cb = [time_cb, loss_cb]
+    if config.save_checkpoint:
+        if args_opt.rank_save_ckpt_flag:
+            config_ck = CheckpointConfig(save_checkpoint_steps=config.save_checkpoint_epochs * step_size,
+                                         keep_checkpoint_max=config.keep_checkpoint_max)
+            ckpt_cb = ModelCheckpoint(prefix="ghostnet", directory=config.save_checkpoint_path, config=config_ck)
+            cb += [ckpt_cb]
+
+    # train model
+    model.train(config.epoch_size, dataset, callbacks=cb,
+                sink_size=dataset.get_dataset_size())