# Changelog This documents the main changes to the `candle` crate. ## v0.3.1 - Unreleased ### Added ### Modified ## v0.3.0 - 2023-10-01 ### Added - Added the Mistral 7b v0.1 model [983](https://github.com/huggingface/candle/pull/983). - Quantized version of the Mistral model [1009](https://github.com/huggingface/candle/pull/1009). - Add the gelu-erf op and activation function [969](https://github.com/huggingface/candle/pull/969). - Add the mixformer/phi-v1.5 model [930](https://github.com/huggingface/candle/pull/930). - Add the sclice-scatter op [927](https://github.com/huggingface/candle/pull/927). - Add the Wuerstchen diffusion model [911](https://github.com/huggingface/candle/pull/911). ### Modified - Support for simd128 intrinsics in some quantized vecdots [982](https://github.com/huggingface/candle/pull/982). - Optimize the index-select cuda kernel [976](https://github.com/huggingface/candle/pull/976). - Self-contained safetensor wrappers [946](https://github.com/huggingface/candle/pull/946). ## v0.2.2 - 2023-09-18 ### Added - Support for `top_p` sampling [819](https://github.com/huggingface/candle/pull/819). - T5 model including decoding [864](https://github.com/huggingface/candle/pull/864). - 1-d upsampling [839](https://github.com/huggingface/candle/pull/839). ### Modified - Bugfix for conv2d [820](https://github.com/huggingface/candle/pull/820). - Support tensor based indexing using `.i` [842](https://github.com/huggingface/candle/pull/842). ## v0.2.1 - 2023-09-11 ### Added - Add some RNNs (GRU and LSTM) in `candle-nn` [674](https://github.com/huggingface/candle/pull/674), [688](https://github.com/huggingface/candle/pull/688). - gguf v2 support [725](https://github.com/huggingface/candle/pull/725). - Quantized llama example in Python using the pyo3 api [716](https://github.com/huggingface/candle/pull/716). - `candle-nn` layer for conv2d-transposed [760](https://github.com/huggingface/candle/pull/760). - Add the Segment-Anything Model (SAM) as an example [773](https://github.com/huggingface/candle/pull/773). - TinyViT backbone for the segment anything example [787](https://github.com/huggingface/candle/pull/787). - Shape with holes support [770](https://github.com/huggingface/candle/pull/770). ### Modified - Dilations are now supported in conv-transpose2d. [671](https://github.com/huggingface/candle/pull/671). - Interactive mode for the quantized model [690](https://github.com/huggingface/candle/pull/690). - Faster softmax operation [747](https://github.com/huggingface/candle/pull/747). - Faster convolution operations on CPU and CUDA via im2col [802](https://github.com/huggingface/candle/pull/802). - Moving some models to a more central location [796](https://github.com/huggingface/candle/pull/796). ## v0.2.0 - 2023-08-30 ### Added - Add the powf op [664](https://github.com/huggingface/candle/pull/664). - Stable Diffusion XL support [647](https://github.com/huggingface/candle/pull/647). - Add the conv-transpose2d op [635](https://github.com/huggingface/candle/pull/635). - Refactor the VarBuilder api [627](https://github.com/huggingface/candle/pull/627). - Add some quantization command [625](https://github.com/huggingface/candle/pull/625). - Support more quantized types, e.g. Q2K, Q4K, Q5K... [586](https://github.com/huggingface/candle/pull/586). - Add pose estimation to the yolo example [589](https://github.com/huggingface/candle/pull/589). - Api to write GGUF files [585](https://github.com/huggingface/candle/pull/585). - Support more quantization types [580](https://github.com/huggingface/candle/pull/580). - Add EfficientNet as an example Computer Vision model [572](https://github.com/huggingface/candle/pull/572). - Add a group parameter to convolutions [566](https://github.com/huggingface/candle/pull/566). - New dtype: int64 [563](https://github.com/huggingface/candle/pull/563). - Handling of the GGUF file format. [559](https://github.com/huggingface/candle/pull/559). ## v0.1.2 - 2023-08-21