Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support #4285
lint-and-deploy.yaml
on: pull_request
lint-and-deploy
7m 33s
Annotations
1 warning
lint-and-deploy
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|