Instructions to use yetter-ai/sage-attention with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Kernels
How to use yetter-ai/sage-attention with Kernels:
# !pip install kernels from kernels import get_kernel kernel = get_kernel("yetter-ai/sage-attention") - Notebooks
- Google Colab
- Kaggle
Add Python 3.13 binaries for torch 2.11 SageAttention
Browse files- .gitattributes +0 -1
- build/torch211-cxx11-cu130-x86_64-linux/fp4attn_cuda.cpython-313-x86_64-linux-gnu.so +3 -0
- build/torch211-cxx11-cu130-x86_64-linux/fp4quant_cuda.cpython-313-x86_64-linux-gnu.so +3 -0
- build/torch211-cxx11-cu130-x86_64-linux/sageattention/_fused.cpython-313-x86_64-linux-gnu.so +3 -0
- build/torch211-cxx11-cu130-x86_64-linux/sageattention/_qattn_sm80.cpython-313-x86_64-linux-gnu.so +3 -0
- build/torch211-cxx11-cu130-x86_64-linux/sageattention/_qattn_sm89.cpython-313-x86_64-linux-gnu.so +3 -0
- build/torch211-cxx11-cu130-x86_64-linux/sageattention/_qattn_sm90.cpython-313-x86_64-linux-gnu.so +3 -0
.gitattributes
CHANGED
|
@@ -33,5 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
-
build/torch211-cxx11-cu130-x86_64-linux/_sage_attention_cuda_d7d1339_dirty.abi3.so filter=lfs diff=lfs merge=lfs -text
|
| 37 |
build/**/*.so filter=lfs diff=lfs merge=lfs -text
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
| 36 |
build/**/*.so filter=lfs diff=lfs merge=lfs -text
|
build/torch211-cxx11-cu130-x86_64-linux/fp4attn_cuda.cpython-313-x86_64-linux-gnu.so
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ec82b4d3ac1a8ab68ee32ff085dae61418b63adeefebe9b8674816081bef9e03
|
| 3 |
+
size 14269296
|
build/torch211-cxx11-cu130-x86_64-linux/fp4quant_cuda.cpython-313-x86_64-linux-gnu.so
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2049382ae89163c4a103efff266cd3d90a0484497d875d835204be6cfaa64c08
|
| 3 |
+
size 778928
|
build/torch211-cxx11-cu130-x86_64-linux/sageattention/_fused.cpython-313-x86_64-linux-gnu.so
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2d3d512441183a710d98a7660f2c04550c5dddf5a8f7a1247b4042cd21ebd166
|
| 3 |
+
size 12137072
|
build/torch211-cxx11-cu130-x86_64-linux/sageattention/_qattn_sm80.cpython-313-x86_64-linux-gnu.so
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ef9358cc0113d4a10627c0a8553735029908eea735201900e71b1fc9b42e781d
|
| 3 |
+
size 17898728
|
build/torch211-cxx11-cu130-x86_64-linux/sageattention/_qattn_sm89.cpython-313-x86_64-linux-gnu.so
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:49d78c3d31c500e500f5b6a31d3c8932d1318e0a929df8fea9f52d6f2a1b94cb
|
| 3 |
+
size 65407064
|
build/torch211-cxx11-cu130-x86_64-linux/sageattention/_qattn_sm90.cpython-313-x86_64-linux-gnu.so
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6e576f52b46c472ca89076cbb5acd72650bd0cf253689fe53edc61120c329614
|
| 3 |
+
size 12627048
|