* chore(candle): Allow enabling accelerate
* Temporarily disable test for accelerate feature
* Allow enabling accelerate from upstream
* Update the README
* Have xtask also test using accelerate
* Renable failing test
* Fix matmul on candle when using accelerate
* Add additional comment to xtask method
* Update kernel mod.rs
* Wgpu crate implementations and add shader files
* Direct backends to the correct implementation
* Use mask method for candle
* Add index out of bounds protection
* Use a macro to avoid duplication
* Use unary_scalar templates
* New shaders for clamp and clamp_inplace
* Remove unneccessary clamp shaders
* Clamp implementation and test
* Use new clamp implementation for float and int ops
* Better variable names for clamp_min/max
* Revert changes to tensor/ops/tensor.rs
* Fix clamp.wgsl
* Fix shader types
* Use native candle clamp
* Use candle ops for clamp_min/max and revert tensor.rs
* Maximum/minimum were reversed