-
Notifications
You must be signed in to change notification settings - Fork 6.8k
[TEST] Add microbenchmark for FC + add fusion #20780
Conversation
+ utils scripts to run it
Hey @anko-intel , Thanks for submitting the PR
CI supported jobs: [centos-gpu, website, miscellaneous, edge, clang, windows-cpu, unix-gpu, sanity, unix-cpu, windows-gpu, centos-cpu] Note: |
@mxnet-bot run ci [centos-gpu, windows-cpu, windows-cpu ] |
Jenkins CI successfully triggered : [windows-cpu, centos-gpu] |
@mxnet-bot run ci [windows-gpu ] |
Jenkins CI successfully triggered : [windows-gpu] |
@mxnet-bot run ci [centos-gpu] |
Jenkins CI successfully triggered : [centos-gpu] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Description
Add microbenchmark for Fully Connected with add fusion
and utility scripts to run it with proper OMP parameters set and check performance against different number of threads
Checklist
Essentials
Comments
Examples of test output:
elemwise_add, float
npi_add, float
elemwise_add, mode = smart, granularity = tensor-wise
NUM_THREADS = 48 56 64
elemwise_add, float