!21353 Add faq in bert

Merge pull request !21353 from chenhaozhe/code_docs_add_faq_in_bert
2021-08-04 09:03:50 +00:00 · 2021-08-04 09:03:50 +00:00 · ca405f1783
parent b2e5612c64 092fce4594
commit ca405f1783
2 changed files with 20 additions and 0 deletions
--- a/model_zoo/official/nlp/bert/README.md
+++ b/model_zoo/official/nlp/bert/README.md
@ -786,3 +786,15 @@ In run_pretrain.py, we set a random seed to make sure that each node has the sam
 # [ModelZoo Homepage](#contents)

 Please check the official [homepage](https://gitee.com/mindspore/mindspore/tree/master/model_zoo).
+
+# FAQ
+
+- **Q: How to resolve the continually overflow?**
+
+  **A**: Continually overflow is usually caused by using too high learning rate.
+  You could try lower `learning_rate` to use lower base learning rate or higher `power` to make learning rate decrease faster in config yaml.
+
+- **Q: Why the training process failed with error for the shape can not match?**
+  **A**: This is usually caused by the config `seq_length` of model can't match the dataset. You could check and modified the `seq_length` in yaml config according to the dataset you used.
+  The parameter of model won't change with `seq_length`, the shapes of parameter only depends on model config `max_position_embeddings`.
+
--- a/model_zoo/official/nlp/bert/README_CN.md
+++ b/model_zoo/official/nlp/bert/README_CN.md
@ -744,3 +744,11 @@ run_pretrain.py中设置了随机种子，确保分布式训练中每个节点
 # ModelZoo主页

 请浏览官网[主页](https://gitee.com/mindspore/mindspore/tree/master/model_zoo)。
+
+# FAQ
+
+- **Q: 运行过程中发生持续溢出怎么办？**
+  **A**： 持续溢出通常是因为使用了较高的学习率导致训练不收敛。可以考虑修改yaml配置文件中的参数，调低`learning_rate`来降低初始学习率或提高`power`加速学习率衰减。
+
+- **Q: 运行报错shape不匹配是什么问题？**
+  **A**： Bert模型中的shape不匹配通常是因为模型参数配置和使用的数据集规格不匹配，主要是句长问题，可以考虑修改`seq_length`参数来匹配所使用的具体数据集。改变该参数不影响权重的规格，权重的规格仅与`max_position_embeddings`参数有关。