candle/candle-flash-attn
Laurent Mazare e7fc1daa21
Bump the crate versions to 0.4.2. (#1821)
2024-03-08 22:01:51 +01:00
..
cutlass@c4f6b8c6bc Add flash attention (#241) 2023-07-26 07:48:10 +01:00
kernels chore: update flash attention kernels (#1518) 2024-01-05 18:28:55 +01:00
src chore: update flash attention kernels (#1518) 2024-01-05 18:28:55 +01:00
tests Flash attention without padding (varlen). (#281) 2023-07-31 09:45:39 +01:00
Cargo.toml Bump the crate versions to 0.4.2. (#1821) 2024-03-08 22:01:51 +01:00
README.md Add some missing readme files. (#304) 2023-08-02 10:57:12 +01:00
build.rs Moving to a proper build crate `bindgen_cuda`. (#1531) 2024-01-07 12:29:24 +01:00

README.md

candle-flash-attn