About Flash attention and FlashAttention only supports Ampere GPUs or newer #47

xiatiandefeng666 · 2025-02-21T05:26:01Z

Thank you very much for your work. I would like to know when my server is training the model, it prompts: FlashAttention only supports Ampere GPUs or newer, so can I train the model without changing the server? For example, without using FlashAttention，Thank you very much.

RunsenXu · 2025-02-21T07:19:17Z

Hi,

You could try using vanilla attention. Just comment this row:

PointLLM/pointllm/train/train_mem.py

Line 8 in e4875ce

replace_llama_attn_with_flash_attn()

RunsenXu closed this as completed Feb 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About Flash attention and FlashAttention only supports Ampere GPUs or newer #47

About Flash attention and FlashAttention only supports Ampere GPUs or newer #47

xiatiandefeng666 commented Feb 21, 2025

RunsenXu commented Feb 21, 2025

About Flash attention and FlashAttention only supports Ampere GPUs or newer #47

About Flash attention and FlashAttention only supports Ampere GPUs or newer #47

Comments

xiatiandefeng666 commented Feb 21, 2025

RunsenXu commented Feb 21, 2025