SageAttention2++ needs CUDA 12.8

#3
by jt-zhang - opened

Hi,
Please note that SageAttention2++ needs CUDA version >= 12.8.

jt-zhang pinned discussion

Hello !
despite that :
** Python version: 3.11.6 (tags/v3.11.6:8b6ee5b, Oct 2 2023, 14:57:12) [MSC v.1935 64 bit (AMD64)]
Device: cuda:0 NVIDIA GeForce RTX 4090 : cudaMallocAsync
pytorch version: 2.7.1+cu128

i am getting an error
0%| | 0/28 [00:00<?, ?it/s]Error running sage attention: SM89 kernel is not available. Make sure you GPUs with compute capability 8.9., using pytorch attention instead.
Error running sage attention: SM89 kernel is not available. Make sure you GPUs with compute capability 8.9., using pytorch attention instead.

Sign up or log in to comment