Finetune with/without flash attention #983

chiroiu96 · 2024-01-10T15:13:32Z

chiroiu96
Jan 10, 2024

I am trying to finetune LLaVA on a custom dataset. Flash attention is not supported by my GPU.

Do I necessarily need to finetune with flash attention?
Will results differ with or without flash attention?

Lawhori · 2025-02-25T16:46:42Z

Lawhori
Feb 25, 2025

No, you do not need flash attention to finetune, in the finetuning script use train.py instead of train_mem.py. It worked for me and i didnt see much difference in results.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetune with/without flash attention #983

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Finetune with/without flash attention #983

chiroiu96 Jan 10, 2024

Replies: 1 comment

Lawhori Feb 25, 2025

chiroiu96
Jan 10, 2024

Lawhori
Feb 25, 2025