-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why is the performance of the mkl_conv in the torch test worse than that of the benchdnn test? #2672
Comments
@Serenagirl Thanks for reaching out.
Here are some best practices for Configuring oneDNN for Benchmarking. |
@Serenagirl Can you please share share the details? |
Sorry to reply so late.I wanted to test the performance of a single core. My platform is aarch64,and the theoretical computing power of the CPU is 92.8GFlops/s single core,the 8.95025 below looks too small |
@oneapi-src/onednn-cpu-aarch64 It looks like a reference implementation is being used instead of an optimized one on the aarch64 platform for the given shape. Can you please look into this performance issue? |
I tested aten::mkldnn_convolutioin in pytorch:
and got time total:144ms,but I tested the same convolution in benchdnn:
OMP_NUM_THREADS=1 taskset -c 0 ./benchdnn --conv --mode=P --dt=f32 --dir=FWD_B --mb=1 ic64ih170iw256oc64oh168ow254kh3kw3sh1sw1dh1dw1ph1pw1
why the difference
The text was updated successfully, but these errors were encountered: