burn/README.md

<div align="center">
<img src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/logo-burn-neutral.webp" width="350px"/>

[![Discord](https://img.shields.io/discord/1038839012602941528.svg?color=7289da&&logo=discord)](https://discord.gg/uPEBbYYDB6)
[![Current Crates.io Version](https://img.shields.io/crates/v/burn.svg)](https://crates.io/crates/burn)
[![Documentation](https://img.shields.io/badge/docs-latest-blue)](https://burn.dev/docs/burn)
[![Test Status](https://github.com/tracel-ai/burn/actions/workflows/test.yml/badge.svg)](https://github.com/tracel-ai/burn/actions/workflows/test.yml)
[![CodeCov](https://codecov.io/gh/tracel-ai/burn/branch/main/graph/badge.svg)](https://codecov.io/gh/tracel-ai/burn)
[![Rust Version](https://img.shields.io/badge/Rust-1.75.0+-blue)](https://releases.rs/docs/1.75.0)
![license](https://shields.io/badge/license-MIT%2FApache--2.0-blue)

---

**Burn is a new comprehensive dynamic Deep Learning Framework built using Rust <br /> with extreme
flexibility, compute efficiency and portability as its primary goals.**

<br/>
</div>

<div align="left">

## Performance

<div align="left">
<img align="right" src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/ember-blazingly-fast.png" height="96px"/>

Because we believe the goal of a deep learning framework is to convert computation into useful
intelligence, we have made performance a core pillar of Burn. We strive to achieve top efficiency by
leveraging multiple optimization techniques described below.

**Click on each section for more details** 👇

</div>

<br />

<details>
<summary>
Automatic kernel fusion 💥
</summary>
<br />

Using Burn means having your models optimized on any backend. When possible, we provide a way to
automatically and dynamically create custom kernels that minimize data relocation between different
memory spaces, extremely useful when moving memory is the bottleneck.

As an example, you could write your own GELU activation function with the high level tensor api (see
Rust code snippet below).

```rust
fn gelu_custom<B: Backend, const D: usize>(x: Tensor<B, D>) -> Tensor<B, D> {
    let x = x.clone() * ((x / SQRT_2).erf() + 1);
    x / 2
}
```

Then, at runtime, a custom low-level kernel will be automatically created for your specific
implementation and will rival a handcrafted GPU implementation. The kernel consists of about 60
lines of WGSL [WebGPU Shading Language]("https://www.w3.org/TR/WGSL/https://www.w3.org/TR/WGSL/"),
an extremely verbose lower level shader language you probably don't want to program your deep
learning models in!

> As of now, our fusion strategy is only implemented for our own WGPU backend and supports only a
> subset of operations. We plan to add more operations very soon and extend this technique to other
> future in-house backends.

</details>

<details>
<summary>
Asynchronous execution ❤️‍🔥
</summary>
<br />

For [backends developed from scratch by the Burn team](#backends), an asynchronous execution style
is used, which allows to perform various optimizations, such as the previously mentioned automatic
kernel fusion.

Asynchronous execution also ensures that the normal execution of the framework does not block the
model computations, which implies that the framework overhead won't impact the speed of execution
significantly. Conversely, the intense computations in the model do not interfere with the
responsiveness of the framework. For more information about our asynchronous backends, see
[this blog post](https://burn.dev/blog/creating-high-performance-asynchronous-backends-with-burn-compute).

</details>

<details>
<summary>
Thread-safe building blocks 🦞
</summary>
<br />

Burn emphasizes thread safety by leveraging the
[ownership system of Rust](https://doc.rust-lang.org/book/ch04-00-understanding-ownership.html).
With Burn, each module is the owner of its weights. It is therefore possible to send a module to
another thread for computing the gradients, then send the gradients to the main thread that can
aggregate them, and _voilà_, you get multi-device training.

This is a very different approach from what PyTorch does, where backpropagation actually mutates the
_grad_ attribute of each tensor parameter. This is not a thread-safe operation and therefore
requires lower level synchronization primitives, see
[distributed training](https://pytorch.org/docs/stable/distributed.html) for reference. Note that
this is still very fast, but not compatible across different backends and quite hard to implement.

</details>

<details>
<summary>
Intelligent memory management 🦀
</summary>
<br />

One of the main roles of a deep learning framework is to reduce the amount of memory necessary to
run models. The naive way of handling memory is that each tensor has its own memory space, which is
allocated when the tensor is created then deallocated as the tensor gets out of scope. However,
allocating and deallocating data is very costly, so a memory pool is often required to achieve good
throughput. Burn offers an infrastructure that allows for easily creating and selecting memory
management strategies for backends. For more details on memory management in Burn, see
[this blog post](https://burn.dev/blog/creating-high-performance-asynchronous-backends-with-burn-compute).

Another very important memory optimization of Burn is that we keep track of when a tensor can be
mutated in-place just by using the ownership system well. Even though it is a rather small memory
optimization on its own, it adds up considerably when training or running inference with larger
models and contributes to reduce the memory usage even more. For more information, see
[this blog post about tensor handling](https://burn.dev/blog/burn-rusty-approach-to-tensor-handling).

</details>

<details>
<summary>
Automatic kernel selection 🎯
</summary>
<br />

A good deep learning framework should ensure that models run smoothly on all hardware. However, not
all hardware share the same behavior in terms of execution speed. For instance, a matrix
multiplication kernel can be launched with many different parameters, which are highly sensitive to
the size of the matrices and the hardware. Using the wrong configuration could reduce the speed of
execution by a large factor (10 times or even more in extreme cases), so choosing the right kernels
becomes a priority.

With our home-made backends, we run benchmarks automatically and choose the best configuration for
the current hardware and matrix sizes with a reasonable caching strategy.

This adds a small overhead by increasing the warmup execution time, but stabilizes quickly after a
few forward and backward passes, saving lots of time in the long run. Note that this feature isn't
mandatory, and can be disabled when cold starts are a priority over optimized throughput.

</details>

<details>
<summary>
Hardware specific features 🔥
</summary>
<br />

It is no secret that deep learning is mosly relying on matrix multiplication as its core operation,
since this is how fully-connected neural networks are modeled.

More and more, hardware manufacturers optimize their chips specifically for matrix mutiliplication
workloads. For instance, Nvidia has its _Tensor Cores_ and today most cellphones have AI specialized
chips. As of this moment, we support Tensor Cores with our LibTorch and Candle backends, but not
other accelerators yet. We hope [this issue](https://github.com/gpuweb/gpuweb/issues/4195) gets
resolved at some point to bring support to our WGPU backend.

</details>

<details>
<summary>
Custom Backend Extension 🎒
</summary>
<br />

Burn aims to be the most flexible deep learning framework. While it's crucial to maintain
compatibility with a wide variety of backends, Burn also provides the ability to extend the
functionalities of a backend implementation to suit your personal modeling requirements.

This versatility is advantageous in numerous ways, such as supporting custom operations like flash
attention or manually writing your own kernel for a specific backend to enhance performance. See
[this section](https://burn.dev/book/advanced/backend-extension/index.html) in the Burn Book 🔥 for
more details.

</details>

<br />

## Training & Inference

<div align="left">
<img align="right" src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/ember-wall.png" height="96px"/>

The whole deep learning workflow is made easy with Burn, as you can monitor your training progress
with an ergonomic dashboard, and run inference everywhere from embedded devices to large GPU
clusters.

Burn was built from the ground up with training and inference in mind. It's also worth noting how
Burn, in comparison to frameworks like PyTorch, simplifies the transition from training to
deployment, eliminating the need for code changes.

</div>

<div align="center">

<br />

<a href="https://www.youtube.com/watch?v=N9RM5CQbNQc" target="_blank">
    <img src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/burn-train-tui.png" alt="Burn Train TUI" width="75%">
  </a>
</div>

<br />

**Click on the following sections to expand 👇**

<details>
<summary>
Training Dashboard 📈
</summary>
<br />

As you can see in the previous video (click on the picture!), a new terminal UI dashboard based on
the [Ratatui](https://github.com/ratatui-org/ratatui) crate allows users to follow their training
with ease without having to connect to any external application.

You can visualize your training and validation metrics updating in real-time and analyze the
lifelong progression or recent history of any registered metrics using only the arrow keys. Break
from the training loop without crashing, allowing potential checkpoints to be fully written or
important pieces of code to complete without interruption 🛡

</details>

<details>
<summary>
ONNX Support 🐫
</summary>
<br />

ONNX (Open Neural Network Exchange) is an open-standard format that exports both the architecture
and the weights of a deep learning model.

Burn supports the importation of models that follow the ONNX standard so you can easily port a model
you have written in another framework like TensorFlow or PyTorch to Burn to benefit from all the
advantages our framework offers.

Our ONNX support is further described in
[this section of the Burn Book 🔥](https://burn.dev/book/import/onnx-model.html).

> **Note**: This crate is in active development and currently supports a
> [limited set of ONNX operators](./burn-import/SUPPORTED-ONNX-OPS.md).

</details>

<details>
<summary>
Importing PyTorch Models 🚚
</summary>
<br />

Support for loading of PyTorch model weights into Burn’s native model architecture, ensuring
seamless integration. See
[Burn Book 🔥 section on importing PyTorch](https://burn.dev/book/import/pytorch-model.html)

</details>

<details>
<summary>
Inference in the Browser 🌐
</summary>
<br />

Several of our backends can compile to Web Assembly: Candle and NdArray for CPU, and WGPU for GPU.
This means that you can run inference directly within a browser. We provide several examples of
this:

- [MNIST](./examples/mnist-inference-web) where you can draw digits and a small convnet tries to
  find which one it is! 2️⃣ 7️⃣ 😰
- [Image Classification](./examples/image-classification-web) where you can upload images and
  classify them! 🌄

</details>

<details>
<summary>
Embedded: <i>no_std</i> support ⚙️
</summary>
<br />

Burn's core components support [no_std](https://docs.rust-embedded.org/book/intro/no-std.html). This
means it can run in bare metal environment such as embedded devices without an operating system.

> As of now, only the NdArray backend can be used in a _no_std_ environment.

</details>

<br />

## Backends

<div align="left">
<img align="right" src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/backend-chip.png" height="96px"/>
Burn strives to be as fast as possible on as many hardwares as possible, with robust implementations.
We believe this flexibility is crucial for modern needs where you may train your models in the cloud, then deploy on customer hardwares, which vary from user to user.
</div>

<br />

Compared to other frameworks, Burn has a very different approach to supporting many backends. By
design, most code is generic over the Backend trait, which allows us to build Burn with swappable
backends. This makes composing backend possible, augmenting them with additional functionalities
such as autodifferentiation and automatic kernel fusion.

**We already have many backends implemented, all listed below 👇**

<details>
<summary>
WGPU (WebGPU): Cross-Platform GPU Backend 🌐
</summary>
<br />

**The go-to backend for running on any GPU.**

Based on the most popular and well-supported Rust graphics library, [WGPU](https://wgpu.rs), this
backend automatically targets Vulkan, OpenGL, Metal, Direct X11/12, and WebGPU, by using the WebGPU
shading language [WGSL](https://www.w3.org/TR/WGSL/https://www.w3.org/TR/WGSL/). It can also be
compiled to Web Assembly to run in the browser while leveraging the GPU, see
[this demo](https://antimora.github.io/image-classification/). For more information on the benefits
of this backend, see [this blog](https://burn.dev/blog/cross-platform-gpu-backend).

The WGPU backend is our first "in-house backend", which means we have complete control over its
implementation details. It is fully optimized with the
[performance characteristics mentioned earlier](#performance), as it serves as our research
playground for a variety of optimizations.

See the [WGPU Backend README](./burn-wgpu/README.md) for more details.

</details>

<details>
<summary>
Candle: Backend using the Candle bindings 🕯
</summary>
<br />

Based on [Candle by Hugging Face](https://github.com/huggingface/candle), a minimalist ML framework
for Rust with a focus on performance and ease of use, this backend can run on CPU with support for
Web Assembly or on Nvidia GPUs using CUDA.

See the [Candle Backend README](./burn-candle/README.md) for more details.

> _Disclaimer:_ This backend is not fully completed yet, but can work in some contexts like
> inference.

</details>

<details>
<summary>
LibTorch: Backend using the LibTorch bindings 🎆
</summary>
<br />

PyTorch doesn't need an introduction in the realm of deep learning. This backend leverages
[PyTorch Rust bindings](https://github.com/LaurentMazare/tch-rs), enabling you to use LibTorch C++
kernels on CPU, CUDA and Metal.

See the [LibTorch Backend README](./burn-tch/README.md) for more details.

</details>

<details>
<summary>
NdArray: Backend using the NdArray primitive as data structure 🦐
</summary>
<br />

This CPU backend is admittedly not our fastest backend, but offers extreme portability.

It is our only backend supporting _no_std_.

See the [NdArray Backend README](./burn-ndarray/README.md) for more details.

</details>

<details>
<summary>
Autodiff: Backend decorator that brings backpropagation to any backend 🔄
</summary>
<br />

Contrary to the aforementioned backends, Autodiff is actually a backend _decorator_. This means that
it cannot exist by itself; it must encapsulate another backend.

The simple act of wrapping a base backend with Autodiff transparently equips it with
autodifferentiation support, making it possible to call backward on your model.

```rust
use burn::backend::{Autodiff, Wgpu};
use burn::tensor::{Distribution, Tensor};

fn main() {
    type Backend = Autodiff<Wgpu>;

    let x: Tensor<Backend, 2> = Tensor::random([32, 32], Distribution::Default);
    let y: Tensor<Backend, 2> = Tensor::random([32, 32], Distribution::Default).require_grad();

    let tmp = x.clone() + y.clone();
    let tmp = tmp.matmul(x);
    let tmp = tmp.exp();

    let grads = tmp.backward();
    let y_grad = y.grad(&grads).unwrap();
    println!("{y_grad}");
}
```

Of note, it is impossible to make the mistake of calling backward on a model that runs on a backend
that does not support autodiff (for inference), as this method is only offered by an Autodiff
backend.

See the [Autodiff Backend README](./burn-autodiff/README.md) for more details.

</details>

<details>
<summary>
Fusion: Backend decorator that brings kernel fusion to backends that support it 💥
</summary>
<br />

This backend decorator enhances a backend with kernel fusion, provided that the inner backend
supports it. Note that you can compose this backend with other backend decorators such as Autodiff.
For now, only the WGPU backend has support for fused kernels.

```rust
use burn::backend::{Autodiff, Fusion, Wgpu};
use burn::tensor::{Distribution, Tensor};

fn main() {
    type Backend = Autodiff<Fusion<Wgpu>>;

    let x: Tensor<Backend, 2> = Tensor::random([32, 32], Distribution::Default);
    let y: Tensor<Backend, 2> = Tensor::random([32, 32], Distribution::Default).require_grad();

    let tmp = x.clone() + y.clone();
    let tmp = tmp.matmul(x);
    let tmp = tmp.exp();

    let grads = tmp.backward();
    let y_grad = y.grad(&grads).unwrap();
    println!("{y_grad}");
}

```

Of note, we plan to implement automatic gradient checkpointing based on compute bound and memory
bound operations, which will work gracefully with the fusion backend to make your code run even
faster during training, see [this issue](https://github.com/tracel-ai/burn/issues/936).

See the [Fusion Backend README](./burn-fusion/README.md) for more details.

</details>

<br />

## Getting Started

<div align="left">
<img align="right" src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/ember-walking.png" height="96px"/>

Just heard of Burn? You are at the right place! Just continue reading this section and we hope you
can get on board really quickly.

</div>

<details>
<summary>
The Burn Book 🔥
</summary>
<br />

To begin working effectively with Burn, it is crucial to understand its key components and
philosophy. This is why we highly recommend new users to read the first sections of
[The Burn Book 🔥](https://burn.dev/book/). It provides detailed examples and explanations covering
every facet of the framework, including building blocks like tensors, modules, and optimizers, all
the way to advanced usage, like coding your own GPU kernels.

> The project is constantly evolving, and we try as much as possible to keep the book up to date
> with new additions. However, we might miss some details sometimes, so if you see something weird,
> let us know! We also gladly accept Pull Requests 😄

</details>

<details>
<summary>
Examples 🙏
</summary>
<br />

Let's start with a code snippet that shows how intuitive the framework is to use! In the following,
we declare a neural network module with some parameters along with its forward pass.

```rust
use burn::nn;
use burn::module::Module;
use burn::tensor::backend::Backend;

#[derive(Module, Debug)]
pub struct PositionWiseFeedForward<B: Backend> {
    linear_inner: nn::Linear<B>,
    linear_outer: nn::Linear<B>,
    dropout: nn::Dropout,
    gelu: nn::Gelu,
}

impl<B: Backend> PositionWiseFeedForward<B> {
    pub fn forward<const D: usize>(&self, input: Tensor<B, D>) -> Tensor<B, D> {
        let x = self.linear_inner.forward(input);
        let x = self.gelu.forward(x);
        let x = self.dropout.forward(x);

        self.linear_outer.forward(x)
    }
}
```

We have a somewhat large amount of [examples](./examples) in the repository that shows how to use
the framework in different scenarios. For more practical insights, you can clone the repository and
run any of them directly on your computer!

</details>

<details>
<summary>
Pre-trained Models 🤖
</summary>
<br />

We keep an updated and curated list of models and examples built with Burn, see the
[tracel-ai/models repository](https://github.com/tracel-ai/models) for more details.

Don't see the model you want? Don't hesitate to open an issue, and we may prioritize it. Built a
model using Burn and want to share it? You can also open a Pull Request and add your model under the
community section!

</details>

<details>
<summary>
Why use Rust for Deep Learning? 🦀
</summary>
<br />

Deep Learning is a special form of software where you need very high level abstractions as well as
extremely fast execution time. Rust is the perfect candidate for that use case since it provides
zero-cost abstractions to easily create neural network modules, and fine-grained control over memory
to optimize every detail.

It's important that a framework be easy to use at a high level so that its users can focus on
innovating in the AI field. However, since running models relies so heavily on computations,
performance can't be neglected.

To this day, the mainstream solution to this problem has been to offer APIs in Python, but rely on
bindings to low-level languages such as C/C++. This reduces portability, increases complexity and
creates frictions between researchers and engineers. We feel like Rust's approach to abstractions
makes it versatile enough to tackle this two languages dichotomy.

Rust also comes with the Cargo package manager, which makes it incredibly easy to build, test, and
deploy from any environment, which is usually a pain in Python.

Although Rust has the reputation of being a difficult language at first, we strongly believe it
leads to more reliable, bug-free solutions built faster (after some practice 😅)!

</details>

<br />

## Community

<div align="left">
<img align="right" src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/ember-community.png" height="96px"/>

If you are excited about the project, don't hesitate to join our
[Discord](https://discord.gg/uPEBbYYDB6)! We try to be as welcoming as possible to everybody from
any background. You can ask your questions and share what you built with the community!

</div>

<br/>

**Contributing**

Before contributing, please take a moment to review our
[code of conduct](https://github.com/tracel-ai/burn/tree/main/CODE-OF-CONDUCT.md). It's also highly
recommended to read our
[architecture document](https://github.com/tracel-ai/burn/tree/main/ARCHITECTURE.md), which explains
some of our architectural decisions. Refer to out [contributing guide](/CONTRIBUTING.md) for more
details.

## Status

Burn is currently in active development, and there will be breaking changes. While any resulting
issues are likely to be easy to fix, there are no guarantees at this stage.

## License

Burn is distributed under the terms of both the MIT license and the Apache License (Version 2.0).
See [LICENSE-APACHE](./LICENSE-APACHE) and [LICENSE-MIT](./LICENSE-MIT) for details. Opening a pull
request is assumed to signal agreement with these licensing terms.

</div>
-												Update projects (#29)


											
										
										
											2022-09-05 02:22:56 +08:00
+								<div align="center">
-												Update readme (#1045)


											
										
										
											2023-12-04 22:39:55 +08:00
+								<img src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/logo-burn-neutral.webp" width="350px"/>
-												doc: update README

											
										
										
											2022-07-28 04:15:48 +08:00
-												Fix discord link (fixes #239) (#240)

The previous link stopped working. This link does not expire or have other limits.
											
										
										
											2023-03-17 21:32:22 +08:00
+								[![Discord](https://img.shields.io/discord/1038839012602941528.svg?color=7289da&&logo=discord)](https://discord.gg/uPEBbYYDB6)
-												Update README.md

											
										
										
											2023-01-02 23:40:30 +08:00
+								[![Current Crates.io Version](https://img.shields.io/crates/v/burn.svg)](https://crates.io/crates/burn)
-												Update docs link

											
										
										
											2023-10-25 23:09:06 +08:00
+								[![Documentation](https://img.shields.io/badge/docs-latest-blue)](https://burn.dev/docs/burn)
-												Chore/release (#1031)


											
										
										
											2023-12-02 03:33:28 +08:00
+								[![Test Status](https://github.com/tracel-ai/burn/actions/workflows/test.yml/badge.svg)](https://github.com/tracel-ai/burn/actions/workflows/test.yml)
 								[![CodeCov](https://codecov.io/gh/tracel-ai/burn/branch/main/graph/badge.svg)](https://codecov.io/gh/tracel-ai/burn)
-												[CI] Set minimum supported version of Rust to 1.75 (#1297)


											
										
										
											2024-02-13 04:16:42 +08:00
+								[![Rust Version](https://img.shields.io/badge/Rust-1.75.0+-blue)](https://releases.rs/docs/1.75.0)
-												Fix and update readme docs (#244)


											
										
										
											2023-03-20 23:51:07 +08:00
+								![license](https://shields.io/badge/license-MIT%2FApache--2.0-blue)
-												doc: update README

											
										
										
											2022-07-28 04:15:48 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								---
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								**Burn is a new comprehensive dynamic Deep Learning Framework built using Rust <br /> with extreme
 								flexibility, compute efficiency and portability as its primary goals.**
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								<br/>
 								</div>
-												Update landing README.md (#296)

I made a few things more prominent and updated wording, so that it's more understandable what the project is about.
											
										
										
											2023-04-13 03:29:59 +08:00
-												Update projects (#29)


											
										
										
											2022-09-05 02:22:56 +08:00
+								<div align="left">
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								## Performance
-												Doc/burn (#54)


											
										
										
											2022-10-05 08:30:03 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								<div align="left">
-												Update readme (#1045)


											
										
										
											2023-12-04 22:39:55 +08:00
+								<img align="right" src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/ember-blazingly-fast.png" height="96px"/>
-												Add WebGPU backend to the feature list (#470)


											
										
										
											2023-07-06 20:51:57 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Because we believe the goal of a deep learning framework is to convert computation into useful
 								intelligence, we have made performance a core pillar of Burn. We strive to achieve top efficiency by
 								leveraging multiple optimization techniques described below.
-												 Add supported platforms section to readme (#444)


											
										
										
											2023-06-28 21:54:10 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								**Click on each section for more details** 👇
-												 Add supported platforms section to readme (#444)


											
										
										
											2023-06-28 21:54:10 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								</div>
-												Doc/readme (#55)

* doc: readme

* update

* update repository
											
										
										
											2022-10-07 05:44:04 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								<br />
-												Link models repository (#596)


											
										
										
											2023-08-07 00:53:33 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								<details>
 								<summary>
 								Automatic kernel fusion 💥
 								</summary>
 								<br />
-												Link models repository (#596)


											
										
										
											2023-08-07 00:53:33 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Using Burn means having your models optimized on any backend. When possible, we provide a way to
 								automatically and dynamically create custom kernels that minimize data relocation between different
 								memory spaces, extremely useful when moving memory is the bottleneck.
-												Doc/burn (#54)


											
										
										
											2022-10-05 08:30:03 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								As an example, you could write your own GELU activation function with the high level tensor api (see
 								Rust code snippet below).
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								```rust
 								fn gelu_custom<B: Backend, const D: usize>(x: Tensor<B, D>) -> Tensor<B, D> {
 								    let x = x.clone() * ((x / SQRT_2).erf() + 1);
 								    x / 2
 								}
 								```
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Then, at runtime, a custom low-level kernel will be automatically created for your specific
 								implementation and will rival a handcrafted GPU implementation. The kernel consists of about 60
 								lines of WGSL [WebGPU Shading Language]("https://www.w3.org/TR/WGSL/https://www.w3.org/TR/WGSL/"),
 								an extremely verbose lower level shader language you probably don't want to program your deep
 								learning models in!
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								> As of now, our fusion strategy is only implemented for our own WGPU backend and supports only a
 								> subset of operations. We plan to add more operations very soon and extend this technique to other
 								> future in-house backends.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</details>
 								<details>
 								<summary>
 								Asynchronous execution ❤️‍🔥
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								For [backends developed from scratch by the Burn team](#backends), an asynchronous execution style
 								is used, which allows to perform various optimizations, such as the previously mentioned automatic
 								kernel fusion.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Asynchronous execution also ensures that the normal execution of the framework does not block the
 								model computations, which implies that the framework overhead won't impact the speed of execution
 								significantly. Conversely, the intense computations in the model do not interfere with the
 								responsiveness of the framework. For more information about our asynchronous backends, see
 								[this blog post](https://burn.dev/blog/creating-high-performance-asynchronous-backends-with-burn-compute).
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</details>
 								<details>
 								<summary>
 								Thread-safe building blocks 🦞
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Burn emphasizes thread safety by leveraging the
 								[ownership system of Rust](https://doc.rust-lang.org/book/ch04-00-understanding-ownership.html).
 								With Burn, each module is the owner of its weights. It is therefore possible to send a module to
 								another thread for computing the gradients, then send the gradients to the main thread that can
 								aggregate them, and _voilà_, you get multi-device training.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								This is a very different approach from what PyTorch does, where backpropagation actually mutates the
 								_grad_ attribute of each tensor parameter. This is not a thread-safe operation and therefore
 								requires lower level synchronization primitives, see
 								[distributed training](https://pytorch.org/docs/stable/distributed.html) for reference. Note that
 								this is still very fast, but not compatible across different backends and quite hard to implement.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</details>
 								<details>
 								<summary>
 								Intelligent memory management 🦀
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								One of the main roles of a deep learning framework is to reduce the amount of memory necessary to
 								run models. The naive way of handling memory is that each tensor has its own memory space, which is
 								allocated when the tensor is created then deallocated as the tensor gets out of scope. However,
 								allocating and deallocating data is very costly, so a memory pool is often required to achieve good
 								throughput. Burn offers an infrastructure that allows for easily creating and selecting memory
 								management strategies for backends. For more details on memory management in Burn, see
 								[this blog post](https://burn.dev/blog/creating-high-performance-asynchronous-backends-with-burn-compute).
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Another very important memory optimization of Burn is that we keep track of when a tensor can be
 								mutated in-place just by using the ownership system well. Even though it is a rather small memory
 								optimization on its own, it adds up considerably when training or running inference with larger
 								models and contributes to reduce the memory usage even more. For more information, see
 								[this blog post about tensor handling](https://burn.dev/blog/burn-rusty-approach-to-tensor-handling).
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</details>
 								<details>
 								<summary>
 								Automatic kernel selection 🎯
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								A good deep learning framework should ensure that models run smoothly on all hardware. However, not
 								all hardware share the same behavior in terms of execution speed. For instance, a matrix
 								multiplication kernel can be launched with many different parameters, which are highly sensitive to
 								the size of the matrices and the hardware. Using the wrong configuration could reduce the speed of
 								execution by a large factor (10 times or even more in extreme cases), so choosing the right kernels
 								becomes a priority.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								With our home-made backends, we run benchmarks automatically and choose the best configuration for
 								the current hardware and matrix sizes with a reasonable caching strategy.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								This adds a small overhead by increasing the warmup execution time, but stabilizes quickly after a
 								few forward and backward passes, saving lots of time in the long run. Note that this feature isn't
 								mandatory, and can be disabled when cold starts are a priority over optimized throughput.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</details>
 								<details>
 								<summary>
 								Hardware specific features 🔥
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								It is no secret that deep learning is mosly relying on matrix multiplication as its core operation,
 								since this is how fully-connected neural networks are modeled.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								More and more, hardware manufacturers optimize their chips specifically for matrix mutiliplication
 								workloads. For instance, Nvidia has its _Tensor Cores_ and today most cellphones have AI specialized
 								chips. As of this moment, we support Tensor Cores with our LibTorch and Candle backends, but not
 								other accelerators yet. We hope [this issue](https://github.com/gpuweb/gpuweb/issues/4195) gets
 								resolved at some point to bring support to our WGPU backend.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</details>
 								<details>
 								<summary>
 								Custom Backend Extension 🎒
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Burn aims to be the most flexible deep learning framework. While it's crucial to maintain
 								compatibility with a wide variety of backends, Burn also provides the ability to extend the
 								functionalities of a backend implementation to suit your personal modeling requirements.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								This versatility is advantageous in numerous ways, such as supporting custom operations like flash
 								attention or manually writing your own kernel for a specific backend to enhance performance. See
 								[this section](https://burn.dev/book/advanced/backend-extension/index.html) in the Burn Book 🔥 for
 								more details.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</details>
 								<br />
 								## Training & Inference
 								<div align="left">
-												Update readme (#1045)


											
										
										
											2023-12-04 22:39:55 +08:00
+								<img align="right" src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/ember-wall.png" height="96px"/>
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								The whole deep learning workflow is made easy with Burn, as you can monitor your training progress
 								with an ergonomic dashboard, and run inference everywhere from embedded devices to large GPU
 								clusters.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Burn was built from the ground up with training and inference in mind. It's also worth noting how
 								Burn, in comparison to frameworks like PyTorch, simplifies the transition from training to
 								deployment, eliminating the need for code changes.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</div>
 								<div align="center">
 								<br />
 								<a href="https://www.youtube.com/watch?v=N9RM5CQbNQc" target="_blank">
-												Update readme (#1045)


											
										
										
											2023-12-04 22:39:55 +08:00
+								    <img src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/burn-train-tui.png" alt="Burn Train TUI" width="75%">
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								  </a>
 								</div>
 								<br />
 								**Click on the following sections to expand 👇**
 								<details>
 								<summary>
 								Training Dashboard 📈
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								As you can see in the previous video (click on the picture!), a new terminal UI dashboard based on
 								the [Ratatui](https://github.com/ratatui-org/ratatui) crate allows users to follow their training
 								with ease without having to connect to any external application.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								You can visualize your training and validation metrics updating in real-time and analyze the
 								lifelong progression or recent history of any registered metrics using only the arrow keys. Break
 								from the training loop without crashing, allowing potential checkpoints to be fully written or
 								important pieces of code to complete without interruption 🛡
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</details>
 								<details>
 								<summary>
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								ONNX Support 🐫
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								ONNX (Open Neural Network Exchange) is an open-standard format that exports both the architecture
 								and the weights of a deep learning model.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Burn supports the importation of models that follow the ONNX standard so you can easily port a model
 								you have written in another framework like TensorFlow or PyTorch to Burn to benefit from all the
 								advantages our framework offers.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Our ONNX support is further described in
 								[this section of the Burn Book 🔥](https://burn.dev/book/import/onnx-model.html).
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								> **Note**: This crate is in active development and currently supports a
 								> [limited set of ONNX operators](./burn-import/SUPPORTED-ONNX-OPS.md).
 								</details>
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								<details>
 								<summary>
 								Importing PyTorch Models 🚚
 								</summary>
 								<br />
 								Support for loading of PyTorch model weights into Burn’s native model architecture, ensuring
 								seamless integration. See
 								[Burn Book 🔥 section on importing PyTorch](https://burn.dev/book/import/pytorch-model.html)
 								</details>
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								<details>
 								<summary>
 								Inference in the Browser 🌐
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Several of our backends can compile to Web Assembly: Candle and NdArray for CPU, and WGPU for GPU.
 								This means that you can run inference directly within a browser. We provide several examples of
 								this:
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								- [MNIST](./examples/mnist-inference-web) where you can draw digits and a small convnet tries to
 								  find which one it is! 2️⃣ 7️⃣ 😰
 								- [Image Classification](./examples/image-classification-web) where you can upload images and
 								  classify them! 🌄
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</details>
 								<details>
 								<summary>
 								Embedded: <i>no_std</i> support ⚙️
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Burn's core components support [no_std](https://docs.rust-embedded.org/book/intro/no-std.html). This
 								means it can run in bare metal environment such as embedded devices without an operating system.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								> As of now, only the NdArray backend can be used in a _no_std_ environment.
 								</details>
 								<br />
 								## Backends
 								<div align="left">
-												Update readme (#1045)


											
										
										
											2023-12-04 22:39:55 +08:00
+								<img align="right" src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/backend-chip.png" height="96px"/>
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								Burn strives to be as fast as possible on as many hardwares as possible, with robust implementations.
 								We believe this flexibility is crucial for modern needs where you may train your models in the cloud, then deploy on customer hardwares, which vary from user to user.
 								</div>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Compared to other frameworks, Burn has a very different approach to supporting many backends. By
 								design, most code is generic over the Backend trait, which allows us to build Burn with swappable
 								backends. This makes composing backend possible, augmenting them with additional functionalities
 								such as autodifferentiation and automatic kernel fusion.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								**We already have many backends implemented, all listed below 👇**
 								<details>
 								<summary>
 								WGPU (WebGPU): Cross-Platform GPU Backend 🌐
 								</summary>
 								<br />
 								**The go-to backend for running on any GPU.**
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Based on the most popular and well-supported Rust graphics library, [WGPU](https://wgpu.rs), this
 								backend automatically targets Vulkan, OpenGL, Metal, Direct X11/12, and WebGPU, by using the WebGPU
 								shading language [WGSL](https://www.w3.org/TR/WGSL/https://www.w3.org/TR/WGSL/). It can also be
 								compiled to Web Assembly to run in the browser while leveraging the GPU, see
 								[this demo](https://antimora.github.io/image-classification/). For more information on the benefits
 								of this backend, see [this blog](https://burn.dev/blog/cross-platform-gpu-backend).
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								The WGPU backend is our first "in-house backend", which means we have complete control over its
 								implementation details. It is fully optimized with the
 								[performance characteristics mentioned earlier](#performance), as it serves as our research
 								playground for a variety of optimizations.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								See the [WGPU Backend README](./burn-wgpu/README.md) for more details.
 								</details>
 								<details>
 								<summary>
 								Candle: Backend using the Candle bindings 🕯
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Based on [Candle by Hugging Face](https://github.com/huggingface/candle), a minimalist ML framework
 								for Rust with a focus on performance and ease of use, this backend can run on CPU with support for
 								Web Assembly or on Nvidia GPUs using CUDA.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								See the [Candle Backend README](./burn-candle/README.md) for more details.
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								> _Disclaimer:_ This backend is not fully completed yet, but can work in some contexts like
 								> inference.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</details>
 								<details>
 								<summary>
 								LibTorch: Backend using the LibTorch bindings 🎆
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								PyTorch doesn't need an introduction in the realm of deep learning. This backend leverages
 								[PyTorch Rust bindings](https://github.com/LaurentMazare/tch-rs), enabling you to use LibTorch C++
 								kernels on CPU, CUDA and Metal.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								See the [LibTorch Backend README](./burn-tch/README.md) for more details.
 								</details>
 								<details>
 								<summary>
 								NdArray: Backend using the NdArray primitive as data structure 🦐
 								</summary>
 								<br />
 								This CPU backend is admittedly not our fastest backend, but offers extreme portability.
 								It is our only backend supporting _no_std_.
 								See the [NdArray Backend README](./burn-ndarray/README.md) for more details.
 								</details>
 								<details>
 								<summary>
 								Autodiff: Backend decorator that brings backpropagation to any backend 🔄
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Contrary to the aforementioned backends, Autodiff is actually a backend _decorator_. This means that
 								it cannot exist by itself; it must encapsulate another backend.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								The simple act of wrapping a base backend with Autodiff transparently equips it with
 								autodifferentiation support, making it possible to call backward on your model.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								```rust
 								use burn::backend::{Autodiff, Wgpu};
 								use burn::tensor::{Distribution, Tensor};
 								fn main() {
 								    type Backend = Autodiff<Wgpu>;
 								    let x: Tensor<Backend, 2> = Tensor::random([32, 32], Distribution::Default);
 								    let y: Tensor<Backend, 2> = Tensor::random([32, 32], Distribution::Default).require_grad();
 								    let tmp = x.clone() + y.clone();
 								    let tmp = tmp.matmul(x);
 								    let tmp = tmp.exp();
 								    let grads = tmp.backward();
 								    let y_grad = y.grad(&grads).unwrap();
 								    println!("{y_grad}");
 								}
 								```
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Of note, it is impossible to make the mistake of calling backward on a model that runs on a backend
 								that does not support autodiff (for inference), as this method is only offered by an Autodiff
 								backend.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								See the [Autodiff Backend README](./burn-autodiff/README.md) for more details.
 								</details>
 								<details>
 								<summary>
 								Fusion: Backend decorator that brings kernel fusion to backends that support it 💥
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								This backend decorator enhances a backend with kernel fusion, provided that the inner backend
 								supports it. Note that you can compose this backend with other backend decorators such as Autodiff.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								For now, only the WGPU backend has support for fused kernels.
 								```rust
 								use burn::backend::{Autodiff, Fusion, Wgpu};
 								use burn::tensor::{Distribution, Tensor};
 								fn main() {
 								    type Backend = Autodiff<Fusion<Wgpu>>;
 								    let x: Tensor<Backend, 2> = Tensor::random([32, 32], Distribution::Default);
 								    let y: Tensor<Backend, 2> = Tensor::random([32, 32], Distribution::Default).require_grad();
 								    let tmp = x.clone() + y.clone();
 								    let tmp = tmp.matmul(x);
 								    let tmp = tmp.exp();
 								    let grads = tmp.backward();
 								    let y_grad = y.grad(&grads).unwrap();
 								    println!("{y_grad}");
 								}
 								```
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Of note, we plan to implement automatic gradient checkpointing based on compute bound and memory
 								bound operations, which will work gracefully with the fusion backend to make your code run even
 								faster during training, see [this issue](https://github.com/tracel-ai/burn/issues/936).
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								See the [Fusion Backend README](./burn-fusion/README.md) for more details.
 								</details>
 								<br />
 								## Getting Started
 								<div align="left">
-												Update readme (#1045)


											
										
										
											2023-12-04 22:39:55 +08:00
+								<img align="right" src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/ember-walking.png" height="96px"/>
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Just heard of Burn? You are at the right place! Just continue reading this section and we hope you
 								can get on board really quickly.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</div>
 								<details>
 								<summary>
 								The Burn Book 🔥
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								To begin working effectively with Burn, it is crucial to understand its key components and
 								philosophy. This is why we highly recommend new users to read the first sections of
 								[The Burn Book 🔥](https://burn.dev/book/). It provides detailed examples and explanations covering
 								every facet of the framework, including building blocks like tensors, modules, and optimizers, all
 								the way to advanced usage, like coding your own GPU kernels.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								> The project is constantly evolving, and we try as much as possible to keep the book up to date
 								> with new additions. However, we might miss some details sometimes, so if you see something weird,
 								> let us know! We also gladly accept Pull Requests 😄
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</details>
 								<details>
 								<summary>
 								Examples 🙏
 								</summary>
 								<br />
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Let's start with a code snippet that shows how intuitive the framework is to use! In the following,
 								we declare a neural network module with some parameters along with its forward pass.
-												Doc/burn (#54)


											
										
										
											2022-10-05 08:30:03 +08:00
 								```rust
 								use burn::nn;
-												Update module doc + add sponsors section (#267)


											
										
										
											2023-04-03 05:37:01 +08:00
+								use burn::module::Module;
-												Doc/burn (#54)


											
										
										
											2022-10-05 08:30:03 +08:00
+								use burn::tensor::backend::Backend;
 								#[derive(Module, Debug)]
-												Doc/update readme (#241)


											
										
										
											2023-03-18 01:46:36 +08:00
+								pub struct PositionWiseFeedForward<B: Backend> {
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								    linear_inner: nn::Linear<B>,
 								    linear_outer: nn::Linear<B>,
 								    dropout: nn::Dropout,
-												Rename GELU to Gelu (#1311)

* Rename GELU to Gelu

This is to follow Rust naming convention
											
										
										
											2024-02-16 00:39:06 +08:00
+								    gelu: nn::Gelu,
-												Doc/update readme (#241)


											
										
										
											2023-03-18 01:46:36 +08:00
+								}
 								impl<B: Backend> PositionWiseFeedForward<B> {
 								    pub fn forward<const D: usize>(&self, input: Tensor<B, D>) -> Tensor<B, D> {
 								        let x = self.linear_inner.forward(input);
 								        let x = self.gelu.forward(x);
 								        let x = self.dropout.forward(x);
 								        self.linear_outer.forward(x)
 								    }
-												Doc/burn (#54)


											
										
										
											2022-10-05 08:30:03 +08:00
+								}
 								```
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								We have a somewhat large amount of [examples](./examples) in the repository that shows how to use
 								the framework in different scenarios. For more practical insights, you can clone the repository and
 								run any of them directly on your computer!
-												Doc/burn (#54)


											
										
										
											2022-10-05 08:30:03 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								</details>
-												Doc/readme (#55)

* doc: readme

* update

* update repository
											
										
										
											2022-10-07 05:44:04 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								<details>
 								<summary>
 								Pre-trained Models 🤖
 								</summary>
 								<br />
-												Doc/readme (#55)

* doc: readme

* update

* update repository
											
										
										
											2022-10-07 05:44:04 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								We keep an updated and curated list of models and examples built with Burn, see the
 								[tracel-ai/models repository](https://github.com/tracel-ai/models) for more details.
-												Doc/update readme (#241)


											
										
										
											2023-03-18 01:46:36 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Don't see the model you want? Don't hesitate to open an issue, and we may prioritize it. Built a
 								model using Burn and want to share it? You can also open a Pull Request and add your model under the
 								community section!
-												Doc/readme (#55)

* doc: readme

* update

* update repository
											
										
										
											2022-10-07 05:44:04 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								</details>
-												Doc/readme (#55)

* doc: readme

* update

* update repository
											
										
										
											2022-10-07 05:44:04 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								<details>
 								<summary>
 								Why use Rust for Deep Learning? 🦀
 								</summary>
 								<br />
-												Doc/readme (#55)

* doc: readme

* update

* update repository
											
										
										
											2022-10-07 05:44:04 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Deep Learning is a special form of software where you need very high level abstractions as well as
 								extremely fast execution time. Rust is the perfect candidate for that use case since it provides
 								zero-cost abstractions to easily create neural network modules, and fine-grained control over memory
 								to optimize every detail.
-												Doc/readme (#55)

* doc: readme

* update

* update repository
											
										
										
											2022-10-07 05:44:04 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								It's important that a framework be easy to use at a high level so that its users can focus on
 								innovating in the AI field. However, since running models relies so heavily on computations,
 								performance can't be neglected.
-												Doc/readme (#55)

* doc: readme

* update

* update repository
											
										
										
											2022-10-07 05:44:04 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								To this day, the mainstream solution to this problem has been to offer APIs in Python, but rely on
 								bindings to low-level languages such as C/C++. This reduces portability, increases complexity and
 								creates frictions between researchers and engineers. We feel like Rust's approach to abstractions
 								makes it versatile enough to tackle this two languages dichotomy.
-												Doc/readme (#55)

* doc: readme

* update

* update repository
											
										
										
											2022-10-07 05:44:04 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Rust also comes with the Cargo package manager, which makes it incredibly easy to build, test, and
 								deploy from any environment, which is usually a pain in Python.
-												 Add supported platforms section to readme (#444)


											
										
										
											2023-06-28 21:54:10 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Although Rust has the reputation of being a difficult language at first, we strongly believe it
 								leads to more reliable, bug-free solutions built faster (after some practice 😅)!
-												Make burn and burn-core packages no_std compatible (#168) (#173)

* Make burn-ndarray and burn-tensor no_std compatible (#168)
											
										
										
											2023-02-25 22:38:01 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								</details>
-												Make burn and burn-core packages no_std compatible (#168) (#173)

* Make burn-ndarray and burn-tensor no_std compatible (#168)
											
										
										
											2023-02-25 22:38:01 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								<br />
-												Make burn and burn-core packages no_std compatible (#168) (#173)

* Make burn-ndarray and burn-tensor no_std compatible (#168)
											
										
										
											2023-02-25 22:38:01 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								## Community
-												doc: link architecture doc (#310)


											
										
										
											2023-04-26 00:46:29 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								<div align="left">
-												Update readme (#1045)


											
										
										
											2023-12-04 22:39:55 +08:00
+								<img align="right" src="https://raw.githubusercontent.com/tracel-ai/burn/main/assets/ember-community.png" height="96px"/>
-												Clean up the main readme to pop a bit (#421)


											
										
										
											2023-06-21 22:24:29 +08:00
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								If you are excited about the project, don't hesitate to join our
 								[Discord](https://discord.gg/uPEBbYYDB6)! We try to be as welcoming as possible to everybody from
 								any background. You can ask your questions and share what you built with the community!
-												Clean up the main readme to pop a bit (#421)


											
										
										
											2023-06-21 22:24:29 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								</div>
-												doc: link architecture doc (#310)


											
										
										
											2023-04-26 00:46:29 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								<br/>
-												Update module doc + add sponsors section (#267)


											
										
										
											2023-04-03 05:37:01 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								**Contributing**
-												Update module doc + add sponsors section (#267)


											
										
										
											2023-04-03 05:37:01 +08:00
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
+								Before contributing, please take a moment to review our
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								[code of conduct](https://github.com/tracel-ai/burn/tree/main/CODE-OF-CONDUCT.md). It's also highly
 								recommended to read our
 								[architecture document](https://github.com/tracel-ai/burn/tree/main/ARCHITECTURE.md), which explains
 								some of our architectural decisions. Refer to out [contributing guide](/CONTRIBUTING.md) for more
 								details.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								## Status
-												Update documentation and book sections on PyTorch import (#1180)


											
										
										
											2024-01-30 06:55:20 +08:00
+								Burn is currently in active development, and there will be breaking changes. While any resulting
 								issues are likely to be easy to fix, there are no guarantees at this stage.
-												Update module doc + add sponsors section (#267)


											
										
										
											2023-04-03 05:37:01 +08:00
-												Doc/readme (#55)

* doc: readme

* update

* update repository
											
										
										
											2022-10-07 05:44:04 +08:00
+								## License
 								Burn is distributed under the terms of both the MIT license and the Apache License (Version 2.0).
-												Clean up the main readme to pop a bit (#421)


											
										
										
											2023-06-21 22:24:29 +08:00
+								See [LICENSE-APACHE](./LICENSE-APACHE) and [LICENSE-MIT](./LICENSE-MIT) for details. Opening a pull
 								request is assumed to signal agreement with these licensing terms.
-												Update readme (#962)


											
										
										
											2023-11-18 02:04:41 +08:00
 								</div>