burn/burn-dataset
Dilshod Tadjibaev 483f9acca5
Enhance SqliteDatasetWriter to remove tmp on SIGTERM (#375)
Temporary files were dangling when a process is terminated by SIGINT (ctrl/cmd+c) or SIGTERM. This change used gix-tempfile to wrap a temporary file with a special signal handle to will remove the tmp file when writer is dropped or when program is terminated early.
2023-06-02 09:49:48 -04:00
..
examples Add SqliteDatasetWriter to store items in sqlite db (#364) 2023-06-01 09:54:50 -04:00
src Enhance SqliteDatasetWriter to remove tmp on SIGTERM (#375) 2023-06-02 09:49:48 -04:00
tests/data Dataset Improvements: Add Sqlite storage backend and HF importer improvements (#353) 2023-05-20 14:24:55 -04:00
Cargo.toml Enhance SqliteDatasetWriter to remove tmp on SIGTERM (#375) 2023-06-02 09:49:48 -04:00
LICENSE-APACHE Update projects (#29) 2022-09-04 14:22:56 -04:00
LICENSE-MIT Update projects (#29) 2022-09-04 14:22:56 -04:00
README.md Dataset audio - Speech Commands (#361) 2023-05-22 14:24:02 -04:00

README.md

Burn Dataset

Burn dataset library

Current Crates.io Version license

The Burn Dataset library is designed to streamline your machine learning (ML) data pipeline creation process. It offers a variety of dataset implementations, transformation functions, and data sources.

Feature Flags

  • audio - enables audio dataset (SpeechCommandsDataset). Run the following example to try it out:

    cargo run --example speech_commands --features audio