argon2: Add parallel lane processing #149

aumetra · 2021-03-26T17:38:02Z

Adds parallel processing for lanes
Closes #103

The parallelism is gated behind the feature parallel

When the feature is activated, the #![forbid(unsafe_code)] gets downgraded to #![deny(unsafe_code)] due to unsafe usage (here)
The #![no_std] flag is being disabled as well

tarcieri · 2021-03-26T17:40:37Z

Thanks for implementing this.

I think it might make sense to use rayon to manage the thread pool, similar to what we have in the pbkdf2 crate:

password-hashes/pbkdf2/src/lib.rs

Lines 110 to 123 in 48e3c53

    
           /// Generic implementation of PBKDF2 algorithm. 
        
           #[cfg(feature = "parallel")] 
        
           #[inline] 
        
           pub fn pbkdf2<F>(password: &[u8], salt: &[u8], rounds: u32, res: &mut [u8]) 
        
           where 
        
               F: Mac + NewMac + Clone + Sync, 
        
           { 
        
               let n = F::OutputSize::to_usize(); 
        
               let prf = F::new_varkey(password).expect("HMAC accepts all key sizes"); 
        
               res.par_chunks_mut(n).enumerate().for_each(|(i, chunk)| { 
        
                   pbkdf2_body(i as u32, chunk, &prf, salt, rounds); 
        
               }); 
        
           }

argon2/src/instance.rs

argon2/src/lib.rs

argon2/src/instance.rs

argon2/src/lib.rs

tarcieri

Cool, looks good to me.

I'll give @newpavlov a few days to comment in case he can think of a better solution to the mutable aliasing problem.

nikomatsakis

I got intrigued by the twitter thread, as @Bascule hoped, but I don't quite understand what this code is trying to do. :( More pointers would be helpful! I do see some problems with it as written, though.

argon2/src/instance.rs

tarcieri · 2021-03-28T15:16:55Z

I can try to write a short synopsis of how the Argon2 paper describes parallel operation of the algorithm (mostly summarizing sections 3.2 and 3.3, see also section 6.2 Implementing parallelism):

https://www.password-hashing.net/argon2-specs.pdf

The algorithm operates over a matrix of "blocks" consisting of:

𝒑 rows ("lanes"), where 𝒑 is the desired number of worker threads
𝒒 columns

Each lane is further subdivided into 𝑺 = 4 "slices" (in Argon2 terminology, and referred to as SYNC_POINTS in the code), and the intersection of a slice and a lane forms a segment of length 𝒒 / 𝑺.

Segments of the same slice are computed in parallel and therefore cannot reference each other. So from a memory model perspective, what we'd really like is to mutably borrow the values of a particular slice, partition them into segments, and give each worker thread access to a particular segment.

However, we also need to allow all of the worker threads to simultaneously borrow all of the other blocks which do not belong to the "slice" being operated on to reference as inputs.

This is the tricky part: the fill_segment operation can reference blocks from the current lane, or other lanes, but will not reference blocks from the same "slice" being operated on.

Having just written all of that down (thanks for rubber ducking if nothing else), I think I have a better idea of how to model this problem safely in Rust: "slices" (in the Argon2 sense) should be the core level of granularity in which the working "memory" is organized.

The main loop of the algorithm iterates over the slices. Provided I'm actually understanding this correctly, we can borrow one slice mutably at the time and the others immutably. The mutably borrowed slice can then be subdivided into a segment for each lane, given to the worker threads along with immutable references to all of the other slices.

I think a big part of what's making this so tricky right now is the memory consists of a contiguous Vec<Block>. I think that might still be fine for the backing storage, but perhaps we could mediate access to segments through another type that splits the borrows by Argon2 "slice", allowing one "slice" to be acted on mutably and the others referenced as inputs.

tarcieri · 2021-03-28T15:37:34Z

I think a next step which might be helpful in general is to extract a struct Memory which borrows from a backing [Block] slice, pass that to Instance::new instead of the raw &'a mut [Block], and provide a method on Memory for accessing blocks by their Position.

From there we can look at borrow splitting the backing buffer so Memory only holds 3 of the 4 "slices" at a time, and can be used to look up the "reference" blocks being used to fill a segment, but it would not have access to the slice being operated over (which would be borrowed mutably and split up into segments among the worker threads).

tarcieri

Also based on @nikomatsakis's comments I'm going to unapprove this for now

…nter in rayon closure

aumetra · 2021-04-18T18:23:36Z

This should fix the UB as every thread now dereferences the pointer itself

tarcieri · 2021-04-18T18:31:11Z

@smallglitch nice! I think that's a start.

Do you want to mark the PR as ready for review?

aumetra · 2021-04-18T18:33:12Z

Sure, I wasn't sure if you'd be ok with an unsafe implementation
If an unsafe implementation is ok, I think this should be fine

tarcieri

Using unsafe is fine for now. I can circle back on a safe implementation.

Thanks for extracting a Memory type.

tarcieri · 2021-04-18T19:03:18Z

I opened #154 to track making the implementation safe

…k-dev-macro digest: fix benchmark dev macro

add parallel lane processing

f06f51d

replace own threading solution with rayon

03ac904

tarcieri reviewed Mar 26, 2021

View reviewed changes

argon2/src/instance.rs Outdated Show resolved Hide resolved

tarcieri reviewed Mar 27, 2021

View reviewed changes

argon2/src/lib.rs Outdated Show resolved Hide resolved

remove cfg_attr from no_std

15db90f

tarcieri reviewed Mar 27, 2021

View reviewed changes

argon2/src/instance.rs Show resolved Hide resolved

tarcieri reviewed Mar 27, 2021

View reviewed changes

argon2/src/lib.rs Outdated Show resolved Hide resolved

replace conditional deny/forbid with just deny

87b0dd7

tarcieri approved these changes Mar 27, 2021

View reviewed changes

tarcieri requested a review from newpavlov March 27, 2021 19:07

nikomatsakis suggested changes Mar 28, 2021

View reviewed changes

argon2/src/instance.rs Outdated Show resolved Hide resolved

argon2/src/instance.rs Show resolved Hide resolved

tarcieri requested changes Mar 28, 2021

View reviewed changes

aumetra marked this pull request as draft March 29, 2021 13:38

create memory struct and move blocks slice to it, dereference raw poi…

c250bfc

…nter in rayon closure

aumetra marked this pull request as ready for review April 18, 2021 18:31

tarcieri approved these changes Apr 18, 2021

View reviewed changes

tarcieri merged commit dd8d13b into RustCrypto:master Apr 18, 2021

tarcieri mentioned this pull request Apr 18, 2021

argon2: make parallel implementation safe #154

Closed

tarcieri mentioned this pull request Apr 18, 2021

argon2 v0.1.5 #155

Merged

tarcieri mentioned this pull request Oct 2, 2021

Argon2 refactor #247

Merged

dns2utf8 pushed a commit to dns2utf8/password-hashes that referenced this pull request Jan 24, 2023

Merge pull request RustCrypto#149 from RustCrypto/digest/fix-benchmar…

d8c0b4d

…k-dev-macro digest: fix benchmark dev macro

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

argon2: Add parallel lane processing #149

argon2: Add parallel lane processing #149

aumetra commented Mar 26, 2021

tarcieri commented Mar 26, 2021

tarcieri left a comment

nikomatsakis left a comment

tarcieri commented Mar 28, 2021 •

edited

Loading

tarcieri commented Mar 28, 2021

tarcieri left a comment

aumetra commented Apr 18, 2021

tarcieri commented Apr 18, 2021

aumetra commented Apr 18, 2021

tarcieri left a comment

tarcieri commented Apr 18, 2021

argon2: Add parallel lane processing #149

argon2: Add parallel lane processing #149

Conversation

aumetra commented Mar 26, 2021

tarcieri commented Mar 26, 2021

tarcieri left a comment

Choose a reason for hiding this comment

nikomatsakis left a comment

Choose a reason for hiding this comment

tarcieri commented Mar 28, 2021 • edited Loading

tarcieri commented Mar 28, 2021

tarcieri left a comment

Choose a reason for hiding this comment

aumetra commented Apr 18, 2021

tarcieri commented Apr 18, 2021

aumetra commented Apr 18, 2021

tarcieri left a comment

Choose a reason for hiding this comment

tarcieri commented Apr 18, 2021

tarcieri commented Mar 28, 2021 •

edited

Loading