Improve brillig execution speed by unrolling small loops #6470

TomAFrench · 2024-11-07T11:19:04Z

unconstrained fn __validate_gt_remainder(a_u60: [u64; 6]) -> [u64; 6] {
    let mut result_u60: [u64; 6] = [0; 6];

    for i in 0..6 {
        result_u60[i] = a_u60[i] + 1;
    }

    result_u60
}

Here we've got a for-loop with a small and simple body, this function compiles to:

brillig(inline) fn __validate_gt_remainder f2 {
  b0(v0: [u64; 6]):
    inc_rc v0
    inc_rc [u64 0, u64 0, u64 0, u64 0, u64 0, u64 0]
    v4 = allocate
    store [u64 0, u64 0, u64 0, u64 0, u64 0, u64 0] at v4
    jmp b1(u32 0)                                                 // loop boilerplate
  b1(v1: u32):
    v7 = lt v1, u32 6                                             // loop boilerplate
    jmpif v7 then: b3, else: b2                                   // loop boilerplate
  b3():
    v9 = load v4                                                  // loop boilerplate
    v10 = array_get v0, index v1                                  // logic
    v12 = add v10, u64 1                                          // logic
    v13 = array_set v9, index v1, value v12                       // logic
    v15 = add v1, u32 1                                           // loop boilerplate
    store v13 at v4                                               // loop boilerplate
    v16 = add v1, u32 1                                           // duplicate instruction
    jmp b1(v16)                                                   // loop boilerplate
  b2():
    v8 = load v4                                                  // loop boilerplate
    dec_rc v0
    return v8
}

We've then got 3 instructions of actual logic and 8 of boilerplate and when it comes to execution we have 3*num_iterations instructions of actual logic and 2+6*num_iterations of boilerplate. This gives us a tradeoff between bytecode size and execution speed.

We should aim to unroll any loops where loop_instructions * num_iterations <= loop_instructions + 8 => loop_instructions * (num_iterations - 1) <= 8 as this will reduce bytecode sizes. There's a decent chance that we'd get benefits for unrolling larger loops as well due to constant folding.

The text was updated successfully, but these errors were encountered:

github-project-automation bot added this to Noir Nov 7, 2024

github-project-automation bot moved this to 📋 Backlog in Noir Nov 7, 2024

TomAFrench assigned aakoshh Nov 7, 2024

aakoshh mentioned this issue Nov 12, 2024

feat(ssa): Unroll small loops in brillig #6505

Merged

16 tasks

aakoshh closed this as completed in #6505 Nov 19, 2024

github-project-automation bot moved this from 📋 Backlog to ✅ Done in Noir Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve brillig execution speed by unrolling small loops #6470

Improve brillig execution speed by unrolling small loops #6470

TomAFrench commented Nov 7, 2024 •

edited by aakoshh

Loading

Improve brillig execution speed by unrolling small loops #6470

Improve brillig execution speed by unrolling small loops #6470

Comments

TomAFrench commented Nov 7, 2024 • edited by aakoshh Loading

TomAFrench commented Nov 7, 2024 •

edited by aakoshh

Loading