forked from mindspore-Ecosystem/mindspore
!1647 dataset: re-fix some doc format problem in take and split
Merge pull request !1647 from ms_yan/doc_format
This commit is contained in:
commit
63765cf184
|
@ -1256,7 +1256,7 @@ class MappableDataset(SourceDataset):
|
||||||
1. Dataset should not be sharded if split is going to be called. Instead, create a
|
1. Dataset should not be sharded if split is going to be called. Instead, create a
|
||||||
DistributedSampler and specify a split to shard after splitting. If dataset is
|
DistributedSampler and specify a split to shard after splitting. If dataset is
|
||||||
sharded after a split, it is strongly recommended to set the same seed in each instance
|
sharded after a split, it is strongly recommended to set the same seed in each instance
|
||||||
of execution, otherwise each shard may not be part of the same split (see Examples)
|
of execution, otherwise each shard may not be part of the same split (see Examples).
|
||||||
2. It is strongly recommended to not shuffle the dataset, but use randomize=True instead.
|
2. It is strongly recommended to not shuffle the dataset, but use randomize=True instead.
|
||||||
Shuffling the dataset may not be deterministic, which means the data in each split
|
Shuffling the dataset may not be deterministic, which means the data in each split
|
||||||
will be different in each epoch. Furthermore, if sharding occurs after split, each
|
will be different in each epoch. Furthermore, if sharding occurs after split, each
|
||||||
|
|
Loading…
Reference in New Issue