dnrops/burn - burn - Trustie: Git with trustie

Commit Graph

Author	SHA1	Message	Date
Guillaume Lagrange	c30ffcf6ac	Enable optimized handling of bytes (#2003 ) * Enable optimized handling of bytes * Implement byte buffer de/serialization * Use serde_bytes w/ alloc (no_std compatible)	2024-07-11 07:48:43 -04:00
Guillaume Lagrange	6f158af4b1	Fix warnings when using `record-backward-compat` (#1977 )	2024-07-08 07:58:50 -04:00
nathaniel	882a27c52c	Revert "Revert "Implement 3D and transposed 3D convolutions. (#1945 )"" This reverts commit `b8b47ea6e6`.	2024-07-05 18:57:01 -04:00
nathaniel	b8b47ea6e6	Revert "Implement 3D and transposed 3D convolutions. (#1945 )" This reverts commit `d696d74e3d`.	2024-07-05 09:40:32 -04:00
Guillaume Charifi	d696d74e3d	Implement 3D and transposed 3D convolutions. (#1945 ) * Implement 3D and transposed 3D convolutions. * Merge changes from onnx-ir #1921 pr --------- Co-authored-by: Dilshod Tadjibaev <939125+antimora@users.noreply.github.com>	2024-07-02 17:54:35 -05:00
Dilshod Tadjibaev	2bb76283ff	Improve pickle (CandleTensor) conversions to NestedValue (#1944 ) * Manually serialize tensor - fixes #1773 * Rename `value` to `bytes`	2024-07-02 08:34:19 -04:00
Arthur Brussee	849c8f453b	Consistent sync/async handling, allow more functions to be async for wasm. (#1936 )	2024-07-02 08:25:28 -04:00
Dilshod Tadjibaev	98a58c867d	Print module - implement module display for remaining modules (part2) (#1933 )	2024-06-28 08:37:40 -04:00
Guillaume Lagrange	cdd1fa1672	Refactor tensor data (#1916 ) * Move distribution to module * Add new TensorData with serialization support * Implement display and from for TensorData * Add missing Cargo.lock * Add missing bytemuck feature * Add zeros, ones, full and random TensorData methods * Refactor Data -> TensorData usage * Fix tests Since TensorData is not generic over the element type anymore no type inference can be done by the compiler. We must explicitly cast the expected results to the expected backend type. * Remove commented line * Fix import * Add record-backward-compat * Remove dim const generic from TensorData * Support NestedValue de/serialization with TensorData * Fix burn-jit tests * Remove eprinln * Refactor onnx import to use TensorData * Fix tch from_data * Fix nested value serialization for u8 * Fix missing import * Fix reduce min onnx test * Fix deprecated attribute * Remove shape getter * Remove strict assert in tests * Add tensor data as_bytes * Add tensor check for rank mismatch * Fix typo (dimensions plural) * Fix error message * Update book examples with from_data and fix Display impl for TensorData * Add deprecation note	2024-06-26 20:22:19 -04:00
Dilshod Tadjibaev	2c51615471	Print model structure like with PyTorch - Part 1 (#1912 )	2024-06-25 09:23:10 -04:00
Nathaniel Simard	560d77d154	Doc: Improve module to_device/fork docs (#1901 )	2024-06-18 16:45:38 -04:00
Nathaniel Simard	e758fd43db	Fix: constant record loading (#1902 )	2024-06-18 16:45:21 -04:00
Justin Restivo	263add23a0	Tanh nn wrapper (#1903 )	2024-06-18 16:45:04 -04:00
Jonathan Richard	5de1517232	Add documentation to burn core nn (#1746 ) * Updated documentation for unfold4d Added links between the struct and the config. Added a link to the related burn_tensor function in the documentation for the forward function. * Changing nn relu module documentation to functional api Removing the formula for relu from the module API to the functional API, citing a paper relevant to relu and mentionning the functional API in the module API * Linking gelu module API documentation to functional API documentation * Linear module : adding documentation Adding documentation to the Linear module mentionning that LinearConfig struct should be used when creating a Linear Layer Also adding links to the documentation that points people toward the right path * Updated documentation for dropout Added links between the struct and the config. Added a link to the struct in the forward function for more info. * embedding + swiglu * RotaryEncodying : adding documentation Adding documentation stating the RotaryEncoding should be created using a RotaryEncodingConfig * prelu: adding documentation Adding documentation to the prelu module: - Linking forward function documentation to the functional API - Citing the first paper to mention prelu - Adding documentation saying that prelu layer should be created using PReluConfig * pos_encoding: adding documentation * Updated documentation for mha Added links for more info. Added shape info at some places. * docs: Add documentation for Gru module Provide documentation for the Gru module, including its configuration and usage. Include a link to the paper that introduced the Gated Recurrent Unit (GRU) and specify that the module should be created using GruConfig. Also, mention that the forward function returns a state tensor with specific dimensions. * burn-core-nn-transformers: adding documentation Adding documentation: - Says to use config to create the layers - Add mathematical formula to the pwff forward pass - Add citation in the pwff to the "Attention is all you need" paper * Updated documentation: ConvTranspose1d and ConvTranspose2d * docs: Add documentation for Lstm and BiLstm modules Provide documentation for the Lstm and BiLstm modules, including their configurations and usage. Include links to the papers that introduced Long Short-Term Memory (LSTM) and Bidirectional LSTM. Specify that the modules should be created using LstmConfig and BiLstmConfig respectively. * docs: Update documentation for ConvTranspose1d and ConvTranspose2d modules * loss: Adding documenntation to the loss layers Adding documentation stating to use the config to create the layer * chore: Refactor Conv1d module imports and update documentation * docs: Add documentation for AdaptiveAvgPool1d and AdaptiveAvgPool2d modules Added references to the burn_tensor associated functions. Added links between the struct and the config. * Refactor Conv1d module imports and update documentation * chore: Refactor Conv2d module imports and update documentation * Add documentation for AvgPool1d and AvgPool2d modules Added references to the burn_tensor associated functions. Added links between the struct and the config. * Add documentation for MaxPool1d and MaxPool2d modules Added references to the burn_tensor associated functions. Added links between the struct and the config. * Add documentation for leaky_relu and removed Config generic Added references to the burn_tensor associated functions. Added links between the struct and the config. Removed the backend generic from the config since it's not needed (might be a breaking change). * refactor: Update BatchNormConfig initialization and add documentation. * Added link to config in embedding struct documentation * refactor: Update GroupNormConfig initialization and add documentation * refactor: Update InstanceNormConfig initialization and add documentation * feat: Update LayerNormConfig initialization and add documentation * refactor: Update RmsNormConfig initialization and add documentation * fixed: removed #derive accidentally * Added missing backticks in pools' shapes * Format nn doc * Make config fields public in nn modules * Update import statements in nn modules Changed burn_tensor imports to crate::tensor * Update import statements in nn modules' tests Changed burn_tensor imports to crate::tensor * breaking change refactor: Update GroupNormConfig and InstanceNormConfig initialization * Make SwiGlu fields public * grammar * slashes * input tensors grouping * copy-pasta mistake * a not an >:I * Capitalization * better desc * math 'n ticks * group_norm functional implementation * removed the ... struct * decoder typo * fmt * referring to private fn in docs --------- Co-authored-by: Thierry Cantin-Demers <piertcd@gmail.com> Co-authored-by: mepatrick73 <pameu17@ulaval.ca>	2024-06-13 12:50:21 -04:00
Arthur Brussee	675f6b3280	Make Param.id public (#1859 ) * Make Param.id public * Remove extra comment.	2024-06-06 11:03:14 -04:00
Guillaume Lagrange	e4836241e1	Fix `DataSerialize` conversion for elements of the same type (#1832 )	2024-05-28 18:12:44 -04:00
Guillaume Lagrange	b466fd7606	Add seq start position when applying RoPE encoding (#1796 )	2024-05-22 13:18:31 -04:00
Guillaume Lagrange	550086a5c1	Fix record nested value de/serialization (#1751 )	2024-05-22 09:15:32 -04:00
getumen	e823338750	Add Clone trait to the `OptimizerAdaptor` and Clone implementations to the optimizers (#1770 )	2024-05-15 09:18:09 -04:00
Ben Barber	d3cd6c4928	Replace opaque return types in optim (#1767 ) * update ARCHITECTURE.md links to project architecture section in contributor book * replace opaque return type in optim	2024-05-13 22:21:20 -04:00
Ahmed Yarub Hani Al Nuaimi	10737527d8	#1747 Upgrade Rust dependencies (#1748 ) * #1747 Upgrade Rust dependencies * Revert upgrade for tch The update of tch on windows gives an error: INTEL MKL ERROR: The specified module could not be found. mkl_vml_avx2.1.dll. Intel MKL FATAL ERROR: cannot load mkl_vml_avx2.1.dll or mkl_vml_def.1.dll. * Keep only .cargo/config.toml file which works with rust > 1.75 --------- Co-authored-by: Sylvain Benner <sylvain@benner.online>	2024-05-10 16:25:19 -04:00
Thierry Cantin-Demers	b09d8431df	Fix Cargo.toml repository links (#1749 ) * Fix wgpu github link * Fix burn-train repo link * Fix burn-tensor github repo * Fix burn-tensor repo link * Fix remaining repo links in crates Cargo.toml --------- Co-authored-by: Jonathan Richard <47578360+jwric@users.noreply.github.com>	2024-05-09 15:40:05 -04:00
Arjun31415	5bbc5ea944	Added ONNX AvgPool1d (#1744 )	2024-05-07 16:10:18 -05:00
Arjun31415	7f94f4c219	Add MaxPool1d ONNX Op(#1725 )	2024-05-06 10:51:00 -05:00
Anton Blomström	f8994e044c	Fix unstable tests when run concurrently (#1724 )	2024-05-05 15:27:42 -05:00
Nathaniel Simard	5d959e2884	[Fusion] Support multi-precision fusion (#1718 )	2024-05-02 18:22:56 -04:00
Nathaniel Simard	587b8f80b3	First draft CUDA runtime (#1685 ) Initial cuda runtime crate with a WIP compiler.	2024-04-30 09:46:29 -04:00
WU Chen	b387829731	Implement bidirectional LSTM (#1035 ) * resolve conflict * move `gate_product` to `GateController` * BiLstm needs to use its own initializer when init * resolve conflicts * add some comments * improve doc * correct the description of GateController * fix fmt * add `LstmState` * add test for state * set batch 2 in bilstm test * resolve conflict * fix * fix doc * change the batch size back to 1 * change the batch size back to 1 * modify docstring; delete dead comment	2024-04-26 13:28:36 -05:00
Nathaniel Simard	2f294c5092	Fix lstm batch size bug (#1695 )	2024-04-26 08:54:12 -04:00
Dilshod Tadjibaev	67ec06d5d8	ONNX support for scalar unsqueeze (#1690 ) * Revert `1c639c8393` `1c639c8393`?diff=unified&w=0 * Refactor by @laggui * Refactor unsqueeze * Add support for scalar unsqueeze * Removed dead comment	2024-04-25 16:05:28 -05:00
Nathaniel Simard	29fa2ee76c	Support linear 1d (#1682 )	2024-04-22 18:39:09 -04:00
Sylvain Benner	e303e31c8b	Bump next version of Burn to 0.14.0 (#1618 )	2024-04-12 17:14:45 -04:00
Guillaume Lagrange	9980db440d	Remove unused assets (#1616 )	2024-04-12 15:48:16 -04:00
Guillaume Lagrange	264c167c11	Update licenses symlinks (#1613 )	2024-04-12 14:43:58 -04:00
Aasheesh Singh	fb1da53a38	support for rotary positional encoding to transformer modules. (#1604 ) * add rotary positional encoding to transformer modules. * fix f64 error * use num_traits * add panic condition	2024-04-12 11:45:49 -04:00
Dilshod Tadjibaev	2f885480ed	Use num-traits for float ops (#1584 )	2024-04-08 10:16:20 -05:00
Louis Fortier-Dubois	f5159b6d22	Refactor: split JitKernel and SourceKernel (#1569 ) * refactor execute_dynamic into Execution * minor change * extension cfg * jitkernel and sourcekernel * add todo statement * cleanup and docs * update book * fix server dependancy on compiler * refactor into shader information * refactor to compile shader once * clippy * clippy * clippy * fix doc * fix doc * fmt * rename feature flag * refactor * All broked * compile at the right time * todo done * all dynamic * all dynamic in template too * fmt * fix ci --------- Co-authored-by: nathaniel <nathaniel.simard.42@gmail.com>	2024-04-05 12:58:10 -04:00
Nathaniel Simard	1239d9bfa3	[Breaking] Make Tensor, Module, Optimizer !Sync + Refactor Autodiff (#1575 )	2024-04-04 16:01:17 -04:00
Guillaume Lagrange	0978c8a586	Support multilabel binary cross entropy (#1571 ) * Support multilabel binary cross entropy * Add missing alloc Vec	2024-04-03 08:03:07 -04:00
Nathaniel Simard	b0c5986d16	Feat/lazy init (#1539 )	2024-04-02 10:13:35 -04:00
Karsten Becker	c21d5a3207	Add LeakyReLu implementation (#1208 ) * Implement LeakyReLu * Cargo fmt * Apply suggestions * cargo fmt * Use float_mul_scalar * Should be grad * Add to books module * Move test files * Update leaky relu to use activation function * Update tensor.md * Fix failing test due to approx * Add back the function comment * Fix comment per PR feedback --------- Co-authored-by: Dilshod Tadjibaev <939125+antimora@users.noreply.github.com>	2024-03-27 13:57:51 -05:00
jcmullwh	626457e1c6	Provide Tensor Padding Helpers #960 (#1097 ) * Initial padding approach Create padding implementation for the last two dimensions of Float and Int Tensors. Create PadMode Enum, allowing Constant padding. Create Padding Struct with Uniform, Asymmetric, height, and width implementations. Create tests for the padding implementation. * Update padding.rs remove unneeded import * Update from Merge Use crate Element Swap from old from_data() to new from_data_devauto() * Formatting Changes Formatting changes from cargo fmt --all * Additional Format Change One more format change that cargo fmt didn't get the first time. * Changes to Example Modify Example to ensure it works. * modify naming better names for impl / input variables. * Modify API - Change Padding to PadSize. - integrate padding value into PadMode. - update tests and examples. * Comments and print Improve comments+naming and remove println * Pad Fixes Moved pad to numeric Simplified PadMode Element updated tensor creations fixed doc example * Fix test location * Simplified pad API * Fix for failed unit tests * Remove bool_full * Rename `pads` to `padding` --------- Co-authored-by: Dilshod Tadjibaev <939125+antimora@users.noreply.github.com>	2024-03-27 12:46:55 -05:00
Aasheesh Singh	a77979e0b6	add rms norm layer (#1527 )	2024-03-25 18:59:11 -04:00
Aasheesh Singh	613e698007	Feat/swiglu (#1507 )	2024-03-25 15:55:27 -04:00
Rubén J.R	69f1877754	New learning rate schedulers (#1481 )	2024-03-19 08:28:42 -05:00
Dilshod Tadjibaev	8a8300c1fb	Add tril_mask, triu_mask and diag_mask ops (#1479 )	2024-03-18 10:15:40 -05:00
Arjun31415	d3af29c5b4	Missing `Debug` derive for Group Norm Config (#1482 )	2024-03-17 13:12:50 -04:00
Arjun31415	4de1272344	Feat: Add Leaky Relu Model (#1467 )	2024-03-14 10:53:40 -05:00
WorldSEnder	53eb3ecfa9	Implement Huber loss (#1444 ) * Implement Huber loss Instead of using a sign or abs function, uses clamping to compute it outside the bounds. This is better for the autodiff backend. * mention Huber loss in the book * unify naming of residuals in comments	2024-03-13 12:55:46 -05:00
carrotflakes	80aac1dde4	Add Rank0 variant to AdaptorRecordV1 and AdaptorRecordItemV1 (#1442 )	2024-03-12 13:08:20 -04:00

1 2

66 Commits