[Sync] Readme #6

blefaudeux · 2021-10-18T17:38:21Z

What does this PR do?

Fixes # (issue).

Before submitting

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

+ Added `pyre-check` to `requirements-test.txt`. + Added a `.watchmanconfig` as that is needed for fast incremental results. + Added empty stubs for missing imports to `stubs/`. This is the equivalent of mypy's `--ignore-missing-imports`. This PR has nothing to do with shape types. Remaining work: There are currently 85 Pyre errors. I will address the non-Tensor ones in the next commit. The rest will be addressed when we add stubs for Tensor shape types.

* adding the missing coc * opportunist doc fix

This fixes all of the valid errors. The remaining 56 errors are stub issues that will be fixed when we move to the Tensor shape type stubs.

[chore] Set up basic Pyre configuration.

* adding missing licences, missing are sputnik but will set an exception

Take in key padding mask in blocksparse attention

)

* Update README.md * adding a Transformer illustration, minor improvements on the repo map * adding a `key features` section * wip populating the foldable links * Update README.md * Update README.md * revert accidental removal of references * rebase onto main, add benchmark plots * small additions and markdown linting * go through some old FIXMEs * adding the benchmark requirements * adding more installation instructions * more examples, restructuring a little * Expanding on the HOWTO * adding a note about Triton cache * add a reference to the license in the README, as requested * update figure size * update badges * adding a runnable example for sparse attention * clearing up the message on FusedLinear kernel * adding another small example + some explanations * missing saved file, getting there * fixing the xformers citation * updating the plots, adding Layer norm Co-authored-by: Benjamin Lefaudeux <[email protected]> Co-authored-by: Benjamin Lefaudeux <[email protected]> Co-authored-by: Benjamin Lefaudeux <[email protected]>

It now takes an argument: https://torchmetrics.readthedocs.io/en/stable/classification/accuracy.html Change in pytorch lightning: Lightning-AI/torchmetrics@20eab43 Somehow this is failing with a SEGFAULT on my A100 (in a triton kernel): ``` #0 0x00007fffc0f62e10 in ?? () from /lib/x86_64-linux-gnu/libcuda.so #1 0x00007fffc0f9303c in ?? () from /lib/x86_64-linux-gnu/libcuda.so #2 0x00007fffc0f2ea13 in ?? () from /lib/x86_64-linux-gnu/libcuda.so #3 0x00007fffc0f94603 in ?? () from /lib/x86_64-linux-gnu/libcuda.so #4 0x00007fffc119e4a0 in ?? () from /lib/x86_64-linux-gnu/libcuda.so #5 0x00007fffc0f3728f in ?? () from /lib/x86_64-linux-gnu/libcuda.so #6 0x00007fffc0f3999f in ?? () from /lib/x86_64-linux-gnu/libcuda.so #7 0x00007fffc0fdb1c2 in ?? () from /lib/x86_64-linux-gnu/libcuda.so #8 0x00007fff502234c0 in _launch () from /data/home/XXXXX/.triton/cache/704a3e6949e60326bc68d18a620bee50/layer_norm_fw.so #9 0x00007fff3c0eea25 in launch () from /data/home/XXXXX/.triton/cache/2cebb5590a024a2e06fe9de08c6b7079/k_dropout_bw.so #10 0x0000555555698422 in cfunction_call (func=0x7fff3c6e5760, args=<optimized out>, kwargs=<optimized out>) at /usr/local/src/conda/python-3.10.6/Objects/methodobject.c:552 ``` [ghstack-poisoned]

Add Triton Flash Attention 2 forward op

blefaudeux and others added 18 commits October 13, 2021 20:46

[hotfix] doc build requires numpy (#316)

0f8f5b8

[fix] Code of conduct (#318)

d0a5742

* adding the missing coc * opportunist doc fix

[chore] Fix other Pyre errors.

11e2aa0

This fixes all of the valid errors. The remaining 56 errors are stub issues that will be fixed when we move to the Tensor shape type stubs.

Merge pull request #310 from pradeep90/main

cf55c70

[chore] Set up basic Pyre configuration.

[minor] adding an issue template (#319)

51a2bca

take in key padding mask in blocksparse attn

8102435

flag specifying whether masks must be passed in separately

2f1a42e

[fix] Adding missing license headers (#320)

5b495a1

* adding missing licences, missing are sputnik but will set an exception

fix linting error, not related to previous changes

2869445

fix

904daee

fix default to False

f3de7d0

Merge pull request #321 from fairinternal/key_padding_blocksparse

e763b07

Take in key padding mask in blocksparse attention

[feat] Bump triton to 1.1.1 in requirements-benchmark (#322)

70c7999

[feat][LN] Gracefully handle non contiguous tensors + unit test (#323)

92b3ad5

[minor] Triton layernorm integration in the encoder/decoder blocks (#317

18a6b1d

)

Merge branch 'origin_main' into sync_readme

da7995e

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 18, 2021

blefaudeux merged commit 133595c into main Oct 18, 2021

blefaudeux deleted the sync_readme branch October 18, 2021 17:38

qianfengz added a commit to qianfengz/xformers that referenced this pull request Feb 7, 2024

Merge pull request facebookresearch#6 from sgrigory/add-triton-fa2

43e7797

Add Triton Flash Attention 2 forward op

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Sync] Readme #6

[Sync] Readme #6

blefaudeux commented Oct 18, 2021

[Sync] Readme #6

[Sync] Readme #6

Conversation

blefaudeux commented Oct 18, 2021

What does this PR do?

Before submitting

PR review