0ace420e66
* Expose the seqlen variable for flash-attn without padding. * Fix the batched call. * Adapt for the varlen variant. * No need to set the batch strides when in varlen mode. * Add a test (disabled at the moment). * Get the test to work properly. |
||
---|---|---|
.. | ||
flash_attn_tests.rs |