mindspore/docs/api/api_python/mindspore.dataset.rst

175 lines
5.1 KiB
ReStructuredText
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

mindspore.dataset
=================
该模块提供了加载和处理各种通用数据集的API如MNIST、CIFAR-10、CIFAR-100、VOC、COCO、ImageNet、CelebA、CLUE等
也支持加载业界标准格式的数据集包括MindRecord、TFRecord、Manifest等。此外用户还可以使用此模块定义和加载自己的数据集。
该模块还提供了在加载时进行数据采样的API如SequentialSample、RandomSampler、DistributedSampler等。
大多数数据集可以通过指定参数 `cache` 启用缓存服务,以提升整体数据处理效率。
请注意Windows平台上还不支持缓存服务因此在Windows上加载和处理数据时请勿使用。更多介绍和限制
请参考 `Single-Node Tensor Cache <https://www.mindspore.cn/tutorials/experts/zh-CN/master/dataset/cache.html>`_
在API示例中常用的模块导入方法如下
.. code-block::
import mindspore.dataset as ds
from mindspore.dataset.transforms import c_transforms
常用数据集术语说明如下:
- Dataset所有数据集的基类提供了数据处理方法来帮助预处理数据。
- SourceDataset一个抽象类表示数据集管道的来源从文件和数据库等数据源生成数据。
- MappableDataset一个抽象类表示支持随机访问的源数据集。
- Iterator用于枚举元素的数据集迭代器的基类。
视觉
-----
.. mscnautosummary::
:toctree: dataset
:nosignatures:
:template: classtemplate_inherited.rst
mindspore.dataset.Caltech101Dataset
mindspore.dataset.Caltech256Dataset
mindspore.dataset.CelebADataset
mindspore.dataset.Cifar10Dataset
mindspore.dataset.Cifar100Dataset
mindspore.dataset.CityscapesDataset
mindspore.dataset.CocoDataset
mindspore.dataset.DIV2KDataset
mindspore.dataset.EMnistDataset
mindspore.dataset.FakeImageDataset
mindspore.dataset.FashionMnistDataset
mindspore.dataset.FlickrDataset
mindspore.dataset.Flowers102Dataset
mindspore.dataset.ImageFolderDataset
mindspore.dataset.KMnistDataset
mindspore.dataset.ManifestDataset
mindspore.dataset.MnistDataset
mindspore.dataset.PhotoTourDataset
mindspore.dataset.Places365Dataset
mindspore.dataset.QMnistDataset
mindspore.dataset.SBDataset
mindspore.dataset.SBUDataset
mindspore.dataset.SemeionDataset
mindspore.dataset.STL10Dataset
mindspore.dataset.SVHNDataset
mindspore.dataset.USPSDataset
mindspore.dataset.VOCDataset
mindspore.dataset.WIDERFaceDataset
文本
----
.. mscnautosummary::
:toctree: dataset
:nosignatures:
:template: classtemplate_inherited.rst
mindspore.dataset.AGNewsDataset
mindspore.dataset.AmazonReviewDataset
mindspore.dataset.CLUEDataset
mindspore.dataset.CoNLL2000Dataset
mindspore.dataset.DBpediaDataset
mindspore.dataset.EnWik9Dataset
mindspore.dataset.IMDBDataset
mindspore.dataset.IWSLT2016Dataset
mindspore.dataset.IWSLT2017Dataset
mindspore.dataset.PennTreebankDataset
mindspore.dataset.SogouNewsDataset
mindspore.dataset.TextFileDataset
mindspore.dataset.UDPOSDataset
mindspore.dataset.WikiTextDataset
mindspore.dataset.YahooAnswersDataset
mindspore.dataset.YelpReviewDataset
音频
------
.. mscnautosummary::
:toctree: dataset
:nosignatures:
:template: classtemplate_inherited.rst
mindspore.dataset.LJSpeechDataset
mindspore.dataset.SpeechCommandsDataset
mindspore.dataset.TedliumDataset
mindspore.dataset.YesNoDataset
标准格式
--------
.. mscnautosummary::
:toctree: dataset
:nosignatures:
:template: classtemplate_inherited.rst
mindspore.dataset.CSVDataset
mindspore.dataset.MindDataset
mindspore.dataset.OBSMindDataset
mindspore.dataset.TFRecordDataset
用户自定义
----------
.. mscnautosummary::
:toctree: dataset
:nosignatures:
:template: classtemplate_inherited.rst
mindspore.dataset.GeneratorDataset
mindspore.dataset.NumpySlicesDataset
mindspore.dataset.PaddedDataset
mindspore.dataset.RandomDataset
---
.. mscnautosummary::
:toctree: dataset
mindspore.dataset.GraphData
采样器
-------
.. mscnautosummary::
:toctree: dataset
mindspore.dataset.DistributedSampler
mindspore.dataset.PKSampler
mindspore.dataset.RandomSampler
mindspore.dataset.SequentialSampler
mindspore.dataset.SubsetRandomSampler
mindspore.dataset.SubsetSampler
mindspore.dataset.WeightedRandomSampler
其他
-----
.. mscnautosummary::
:toctree: dataset
:nosignatures:
:template: classtemplate_inherited.rst
mindspore.dataset.BatchInfo
mindspore.dataset.DatasetCache
mindspore.dataset.DSCallback
mindspore.dataset.SamplingStrategy
mindspore.dataset.Schema
mindspore.dataset.Shuffle
mindspore.dataset.WaitedDSCallback
mindspore.dataset.OutputFormat
mindspore.dataset.compare
mindspore.dataset.deserialize
mindspore.dataset.serialize
mindspore.dataset.show
mindspore.dataset.sync_wait_for_dataset
mindspore.dataset.utils.imshow_det_bbox
mindspore.dataset.zip