sdpa

Here are 2 public repositories matching this topic...

📚FFPA: Yet antother Faster Flash Prefill Attention with O(1)⚡️SRAM complexity for headdim > 256, 1.8x~3x↑🎉faster than SDPA EA.

cuda attention sdpa mla mlsys tensor-cores flash-attention deepseek deepseek-v3 deepseek-r1 fused-mla

An open-source interface to use the multiple-precision solver SDPA-GMP with YALMIP

optimization semidefinite-programming yalmip sdpa-gmp sdpa

Add a description, image, and links to the sdpa topic page so that developers can more easily learn about it.

To associate your repository with the sdpa topic, visit your repo's landing page and select "manage topics."