Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Benchmarks] index #14996

Open
stas00 opened this issue Dec 31, 2021 · 0 comments
Open

[Benchmarks] index #14996

stas00 opened this issue Dec 31, 2021 · 0 comments
Assignees
Labels
Benchmarks Issues related to Memory regressions in tests and scripts WIP Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress

Comments

@stas00
Copy link
Contributor

stas00 commented Dec 31, 2021

This issue is to document the important transformers benchmarks in one place, so that they are easy to find.

To add a new benchmark entry post it in an Issue (separately or as a comment in an existing issue) and then link from here. If you have edit rights please add a link directly to this post, otherwise please add a note in the comments and I will update this post.

Please do not post actual benchmarks in the comments of this Issue. This is only an index.

Thank you!

Fastest speed combinations

Precision: fp16 vs bf16 vs tf32 vs fp32

Batch size / gradient accumulation steps

Gradient checkpointing

Optimizers:

  • Adam torch vs. apex vs HF vs adafactor: RTX-3090, A100
  • re-run the above a year later with the same list of optimizers, plus BNB's 8bit optimizer and fused torch AdamW PCIe 80GB A100

Network / Interconnects:

@stas00 stas00 self-assigned this Dec 31, 2021
@stas00 stas00 added Benchmarks Issues related to Memory regressions in tests and scripts WIP Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress labels Dec 31, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Benchmarks Issues related to Memory regressions in tests and scripts WIP Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
Projects
None yet
Development

No branches or pull requests

1 participant