burn/crates/burn-candle
Guillaume Lagrange c0211e2f94
Add static tensor quantization (#1963)
* Add QuantizationBackend, QTensorOps and QTensor

* Refactor QTensorOps as part of Backend trait

* Add tensor dequantize, QFloat dtype and default affine/symmetric quant

* Add ndarray default quantization implementation

* Fix clippy

* Add rayon parallel iter

* Add quantization operations to book

* Add q_shape and q_device ops to avoid converting the tensor just to get attributes

* Implement autodiff grad ops

* Mark autodiff todo for QAT

* Remove note

* Add q_inner and q_from_inner
2024-07-08 10:16:58 -04:00
..
src Add static tensor quantization (#1963) 2024-07-08 10:16:58 -04:00
Cargo.toml Fix `DataSerialize` conversion for elements of the same type (#1832) 2024-05-28 18:12:44 -04:00
LICENSE-APACHE Update licenses symlinks (#1613) 2024-04-12 14:43:58 -04:00
LICENSE-MIT Update licenses symlinks (#1613) 2024-04-12 14:43:58 -04:00
README.md [refactor] Move burn crates to their own crates directory (#1336) 2024-02-20 13:57:55 -05:00

README.md

Burn Candle Backend

This crate provides a backend for Burn based on the Candle framework.

It is still in alpha stage, not all operations are supported. It is usable for some use cases, like for inference.

It can be used with CPU or CUDA. On macOS computations can be accelerated by using the Accelerate framework.

Feature Flags

The following features are supported:

  • cuda - Cuda GPU device (NVIDIA only)
  • accelerate - Accelerate framework (macOS only)