candle/candle-flash-attn/tests
Laurent Mazare 0ace420e66
Flash attention without padding (varlen). (#281)
* Expose the seqlen variable for flash-attn without padding.

* Fix the batched call.

* Adapt for the varlen variant.

* No need to set the batch strides when in varlen mode.

* Add a test (disabled at the moment).

* Get the test to work properly.
2023-07-31 09:45:39 +01:00
..
flash_attn_tests.rs Flash attention without padding (varlen). (#281) 2023-07-31 09:45:39 +01:00