candle/candle-transformers
Laurent Mazare d01207dbf3
Add a RotatingKVCache. (#2493)
* Add a RotatingKVCache.

* Add some KvCache tests.

* Test the reset too.

* More kv-cache testing.

* More tests for the rotating kv-cache.

* Improve the api for the rotating cache so that the whole src tensor gets returned when it's overlarge.

* Handle contiguity + bugfix + use in mimi.

* Add a way to test the mimi streaming mode.

* Mimi streaming fixes.

* More rotating kv-cache.

* Fix the attn mask generation.

* Handle the abs case.

* Add some tests for the generated mask.
2024-09-23 13:14:32 +02:00
..
src Add a RotatingKVCache. (#2493) 2024-09-23 13:14:32 +02:00
tests Soft Non-Maximum Suppression (#2400) 2024-08-10 07:57:52 +02:00
Cargo.toml Metavoice - first cut (#1717) 2024-03-02 18:50:01 +01:00
README.md Add some missing readme files. (#304) 2023-08-02 10:57:12 +01:00

README.md

candle-transformers