Instructions to use kernels-community/flash-attn3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Kernels
How to use kernels-community/flash-attn3 with Kernels:
# !pip install kernels from kernels import get_kernel kernel = get_kernel("kernels-community/flash-attn3") - Notebooks
- Google Colab
- Kaggle
| from kernels.benchmarks import ( | |
| FlashAttentionBenchmark, | |
| FlashAttentionCausalBenchmark, | |
| FlashAttentionVarlenBenchmark, | |
| ) | |
| class FlashAttn(FlashAttentionBenchmark): | |
| pass | |
| class FlashAttnCausal(FlashAttentionCausalBenchmark): | |
| pass | |
| class FlashAttnVarlen(FlashAttentionVarlenBenchmark): | |
| pass | |