dnrops/burn - burn - Trustie: Git with trustie

Commit Graph

Author	SHA1	Message	Date
Nathaniel Simard	19cd67a9e2	Migration/cubecl (#2041 )	2024-07-22 11:08:40 -04:00
Louis Fortier-Dubois	69be99b802	Cube: Matmul tiling (#1994 )	2024-07-09 12:43:13 -04:00
Guillaume Lagrange	c0211e2f94	Add static tensor quantization (#1963 ) * Add QuantizationBackend, QTensorOps and QTensor * Refactor QTensorOps as part of Backend trait * Add tensor dequantize, QFloat dtype and default affine/symmetric quant * Add ndarray default quantization implementation * Fix clippy * Add rayon parallel iter * Add quantization operations to book * Add q_shape and q_device ops to avoid converting the tensor just to get attributes * Implement autodiff grad ops * Mark autodiff todo for QAT * Remove note * Add q_inner and q_from_inner	2024-07-08 10:16:58 -04:00
nathaniel	882a27c52c	Revert "Revert "Implement 3D and transposed 3D convolutions. (#1945 )"" This reverts commit `b8b47ea6e6`.	2024-07-05 18:57:01 -04:00
nathaniel	b8b47ea6e6	Revert "Implement 3D and transposed 3D convolutions. (#1945 )" This reverts commit `d696d74e3d`.	2024-07-05 09:40:32 -04:00
Guillaume Charifi	d696d74e3d	Implement 3D and transposed 3D convolutions. (#1945 ) * Implement 3D and transposed 3D convolutions. * Merge changes from onnx-ir #1921 pr --------- Co-authored-by: Dilshod Tadjibaev <939125+antimora@users.noreply.github.com>	2024-07-02 17:54:35 -05:00
Nathaniel Simard	82a883a57d	Feat/cube/fma (#1947 )	2024-07-02 08:32:39 -04:00
Guillaume Lagrange	cdd1fa1672	Refactor tensor data (#1916 ) * Move distribution to module * Add new TensorData with serialization support * Implement display and from for TensorData * Add missing Cargo.lock * Add missing bytemuck feature * Add zeros, ones, full and random TensorData methods * Refactor Data -> TensorData usage * Fix tests Since TensorData is not generic over the element type anymore no type inference can be done by the compiler. We must explicitly cast the expected results to the expected backend type. * Remove commented line * Fix import * Add record-backward-compat * Remove dim const generic from TensorData * Support NestedValue de/serialization with TensorData * Fix burn-jit tests * Remove eprinln * Refactor onnx import to use TensorData * Fix tch from_data * Fix nested value serialization for u8 * Fix missing import * Fix reduce min onnx test * Fix deprecated attribute * Remove shape getter * Remove strict assert in tests * Add tensor data as_bytes * Add tensor check for rank mismatch * Fix typo (dimensions plural) * Fix error message * Update book examples with from_data and fix Display impl for TensorData * Add deprecation note	2024-06-26 20:22:19 -04:00
Arthur Brussee	c873d87ac8	Add option to flush queue instead of waiting for completion. (#1864 ) * Make sync_type an option on sync instead of adding submit	2024-06-13 09:56:08 -04:00
Louis Fortier-Dubois	de5b681b18	Cube: Vectorization + simple matmul implementation (#1866 )	2024-06-07 14:05:51 -04:00
Nathaniel Simard	c59a3b8b8a	Fix bench load record benchmarks (#1826 )	2024-05-27 10:29:09 -04:00
Louis Fortier-Dubois	033171920c	Cube: first ported kernel + comptime support + variable reuse + cleanup (#1797 )	2024-05-22 14:08:21 -04:00
Louis Fortier-Dubois	6ae3926006	New autodiff graph memory management strategy (#1698 ) --------- Co-authored-by: nathaniel <nathaniel.simard.42@gmail.com>	2024-04-26 12:25:53 -04:00
Nathaniel Simard	b0c5986d16	Feat/lazy init (#1539 )	2024-04-02 10:13:35 -04:00
Ilya Dmitrichenko	67994c02d5	Make backend names in JSON reports match burnbench CLI (#1375 ) * Make backend names in JSON reports match burnbench CLI - add `config_name` to `Backend` trait - add `backend_config_name` to `Benchmark` trait - fix documentation for JSON reports to use correct unit of time * Revert "Make backend names in JSON reports match burnbench CLI" This reverts commit `a09edb6389`. * [backend-comparison] Serialize the feature name passed to burnbench --------- Co-authored-by: syl20bnr <sylvain.benner@gmail.com>	2024-04-01 09:48:44 -04:00
Sylvain Benner	32a8d8041c	Tweak/add kind to gelu benchmark name (#1533 ) * Add kind to gelu benchmark name * [backend-comparison] Compute column size in benchmarks report	2024-03-28 12:35:15 -04:00
Louis Fortier-Dubois	279be0496a	Conv Transpose: migration + decent speedup (#1541 ) * convtranspose benchmark * adjust bench * conv transpose works * Conv Transpose: migration + decent speedup * delete template folder * typos * fix	2024-03-28 12:13:06 -04:00
Nathaniel Simard	efbe818465	Refactor wgpu max pooling (#1398 )	2024-03-04 13:23:11 -05:00
Sylvain Benner	1117757e2b	[backend-comparison] Upload benchmarks to server (#1381 ) Uploading is enabled with already implemented --share argument of the burnbench command line tool. The burnbench binary passes the URL of the server and the auth token to the cargo bench process using the additional arguments --sharing-url and --sharing-token respectively. The persistence module then upload the results when a --sharing-url is provided. The URL is for now hardcoded. The endpoint is production when compiling in release mode and it is localhost otherwise.	2024-03-02 11:38:18 -05:00
Louis Fortier-Dubois	576bb44bc8	Feat/autodiff/checkpoint ops (#1358 )	2024-02-26 17:19:09 -05:00
Joshua Ferguson	4a70a0f8bc	renaming FloatTensor Ops, Primitives, and maybe functions (#1174 )	2024-01-27 10:04:50 -05:00
Sylvain Benner	9bd2d7b7d4	Refactor serialization of backend-comparison benchmarks (#1131 ) * Refactor serialization of benchmarks * flatten benchmarks data to make it easier to save documents to a database and query them * split some information into their own fields like backend and device * add new seralized info: - computed values (mean, median, variance, min, max) - number of samples - operation name - tensor shapes if any * serialize to separate files, one file per benchmark run * simplify persistence module to only a save method * Update bench save file format to use name and uuid * Compute serialized fields count automatically via a macro * Rework naming of benchmarks, shapes and add options field Remove operations field Correctly create one file per ran benchmark * Serialize benchmark num_repeats * Fix expect message to follow the 'should' convention * Cargo fmt :-) * Make Clippy happy * Save files in the burn subdirectory * Change name of custom_gelu bench to just gelu * Remove num_repeats from backend-comparison benchmarks * Fix wrong variable name to compute the median * Remove false positive possibility in test_mean_duration	2024-01-12 11:15:00 -05:00
Nathaniel Simard	f43b686366	Feat/fusion inplace (#1128 )	2024-01-10 12:37:17 -05:00
Nathaniel Simard	de5f93220f	Wgpu fusion auto-vectorized operations (#1123 )	2024-01-08 16:58:39 -05:00
Kirill Mavreshko	1fd07fcb4a	Explicit device tensors (#1081 )	2023-12-20 17:49:59 -05:00
Nathaniel Simard	670280dda2	Feat/fusion/cache (#1020 )	2023-12-01 12:05:11 -05:00
Louis Fortier-Dubois	17f59057d6	Feat/backend comparison/persistence (#979 ) * setting up * wip * persistence works * cleanup * clippy * run checks * Cleanup * reverse json order --------- Co-authored-by: nathaniel <nathaniel.simard.42@gmail.com>	2023-11-22 11:50:27 -05:00
Luni-4	ec9df53d4c	ci/Fix `cargo clippy` action (#942 )	2023-11-16 19:35:38 -05:00
Nathaniel Simard	945014b7f1	Add new backend comparison benchmark (#958 ) * Add new benchmark * Remove bad comment * Add more gelu	2023-11-16 08:15:21 -05:00
Louis Fortier-Dubois	e2a3329997	Feat/wgpu/autotune compute (#906 )	2023-10-29 16:44:59 -04:00
Louis Fortier-Dubois	aa90fe8efb	Refactor/burn benchmark (#829 )	2023-09-28 09:38:21 -04:00

31 Commits