polyval: match ideal assembly #44

tarcieri · 2019-12-21T19:11:53Z

The previous implementation used separate #[target_feature(...)] blocks for each core::arch intrinsic. This thwarts the inliner, so these all translated to call instructions.

This change inlines the intrinsic calls into larger #[target_feature(...)]-gated functions.

When compiling with -C target-cpu=skylake, the generated assembly matches the idealized version (for at least the Montgomery fast reduction) as described in this QuarksLab blog post:

https://blog.quarkslab.com/reversing-a-finite-field-multiplication-optimization.html

Their version:

Godbolt: https://godbolt.org/z/Zjuvwu

The previous implementation used separate `#[target_feature(...)]` blocks for each `core::arch` intrinsic. This thwarts the inliner, so these all translated to `call` instructions. This change inlines the intrinsic calls into larger `#[target_feature(...)]`-gated functions. When compiling with `-C target-cpu=skylake`, the generated assembly matches the idealized version as described in this QuarksLab blog post: https://blog.quarkslab.com/reversing-a-finite-field-multiplication-optimization.html

tarcieri merged commit a191d71 into master Dec 21, 2019

tarcieri deleted the polyval/match-ideal-assembly branch December 21, 2019 19:27

tarcieri mentioned this pull request Dec 21, 2019

polyval v0.3.3 #45

Merged

tarcieri mentioned this pull request Feb 27, 2020

Review AES/GCM and ChaCha20+Poly1305 Audit by NCCGroup RustCrypto/AEADs#87

Closed

This was referenced Jul 26, 2023

VAES support RustCrypto/block-ciphers#372

Open

polyval: detect VPCLMULQDQ at runtime #184

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

polyval: match ideal assembly #44

polyval: match ideal assembly #44

tarcieri commented Dec 21, 2019

polyval: match ideal assembly #44

polyval: match ideal assembly #44

Conversation

tarcieri commented Dec 21, 2019