!1647 dataset: re-fix some doc format problem in take and split

Merge pull request !1647 from ms_yan/doc_format
This commit is contained in:
mindspore-ci-bot 2020-05-29 10:45:57 +08:00 committed by Gitee
commit 63765cf184
1 changed files with 10 additions and 10 deletions

View File

@ -1256,7 +1256,7 @@ class MappableDataset(SourceDataset):
1. Dataset should not be sharded if split is going to be called. Instead, create a 1. Dataset should not be sharded if split is going to be called. Instead, create a
DistributedSampler and specify a split to shard after splitting. If dataset is DistributedSampler and specify a split to shard after splitting. If dataset is
sharded after a split, it is strongly recommended to set the same seed in each instance sharded after a split, it is strongly recommended to set the same seed in each instance
of execution, otherwise each shard may not be part of the same split (see Examples) of execution, otherwise each shard may not be part of the same split (see Examples).
2. It is strongly recommended to not shuffle the dataset, but use randomize=True instead. 2. It is strongly recommended to not shuffle the dataset, but use randomize=True instead.
Shuffling the dataset may not be deterministic, which means the data in each split Shuffling the dataset may not be deterministic, which means the data in each split
will be different in each epoch. Furthermore, if sharding occurs after split, each will be different in each epoch. Furthermore, if sharding occurs after split, each