defer implementation: instead of duplicating the code at every exit path, make a basic block that exit paths point to #283

andrewrk · 2017-03-26T07:54:15Z

shawnl · 2019-04-06T01:33:42Z

That's what __attribute__((cleanup)) does in C code.

Sircular · 2020-05-23T06:10:57Z

I'm not sure how this would work inside loops, but I'm guessing that outside of loops this would require a jump to a point near the end of the function. Is goto implemented in Zig IR? I haven't seen it, and that would be the most straightforward way to implement this, but I might have just missed it.

andrewrk · 2020-05-23T06:46:04Z

~~Yes goto is available in zig ir~~

andrewrk · 2020-07-30T23:01:30Z

I no longer think this is important to do in stage1. Here's a proposal for how to make this work in stage2:

zig code

fn entry(cond: bool) i32 {
    defer foo();
    {
        defer bar();
        if (cond) {
            return 10;
        }
        baz();
    }
    quux();
    return ret();
}

ZIR

fn(@fnty, {
    %ret_flag = alloc(bool)
    %18 = store(%ret_flag, false)
    %0 = block(defer_foo, {
        %14 = block(user, {
            %4 = block(defer_bar, {
                %6 = block(if, {
                    %7 = condbr(%arg0, {
                        %16 = int(10)
                        %17 = set_ret_val(%16)
                        %19 = store(%ret_flag, true)
                        %8 = break_void(defer_bar)
                    }, {
                        %11 = break_void(if)
                    })
                })
                %12 = call(baz, [])
                %10 = break_void(defer_bar)
            })
            %5 = call(bar, [])
            %21 = load(%ret_flag)
            %9 = cond_break_void(%21, defer_foo, user)
        }
        %13 = call(quux, [])
        %2 = call(ret, [])
        %3 = set_ret_val(%2)
        %20 = store(%ret_flag, true)
        %14 = break_void(defer_foo)
    })
    %1 = call(foo, [])
})

So the idea here is to create a block for every defer expression. The defer expression goes after the block ends, which makes it run unconditionally upon exiting the block. Here I've renamed brvoid to break_void for clarity, and introduced cond_break_void which is the same thing except has a boolean condition that decides which block to break from.

At the very end, the "return" from the function is implied.

You can probably spot some easy optimizations to do here. Sometimes blocks could be elided, and defer expressions that occur in a row could be put into the same block.

I should come up with an example with errdefer but it does work nicely with this. The %ret_flag turns into an enum instead of a bool, and the cond_break_void turns into a switch_break_void with the 3 possibilities being enum { not_returning, return_with_error, return_with_payload }, and depending on the tag, control flow will proceed to, respectively, the next block in scope, the next defer (including errdefers), or the next non-err-defer.

andrewrk · 2021-04-21T00:07:35Z

I spent some time on this. Here are some points:

It is not obvious that one way or the other will be better in terms of performance. Empirically, LLVM has no problem optimizing status quo (duplicated expressions). The alternative presented here has not been tested.
The duplicated expressions strategy (status quo) is simpler to implement.

Therefore I will stick with the duplicated expressions strategy, and mark this issue as an optimization, that needs to be investigated before it is accepted as the plan for the future.

LemonBoy · 2021-04-21T09:10:13Z

The duplicated-expression approach can be seen as an optimized version of the multiple-block one where you're inlining every single block.
IMO it makes sense to implement the block approach and allow LLVM but also the Zig frontend to eventually inline them if needed, eg. depending on the optimization flags.
If you consider a function with a decent number of exit points the block-approach produces much smaller code and so it's better suited for ReleaseSize, while for ReleaseFast we may get better performance by inlining the sequence at every callsite provided it doesn't expand to too many ZIR instructions.

lerno · 2022-03-31T08:23:21Z

What about static variables in the duplicated expression strategy? Does Zig currently correctly handle that?

andrewrk added enhancement Solving this issue will likely involve adding new logic or components to the codebase. optimization labels Mar 26, 2017

andrewrk modified the milestone: 0.3.0 May 20, 2017

andrewrk modified the milestones: 0.3.0, 0.4.0 Feb 28, 2018

andrewrk mentioned this issue May 17, 2018

support code coverage when testing #352

Open

andrewrk modified the milestones: 0.4.0, 0.5.0 Feb 22, 2019

andrewrk mentioned this issue Apr 5, 2019

unreachable code error prevents idiomatic usage of defer #2198

Closed

andrewrk modified the milestones: 0.5.0, 0.6.0 Aug 22, 2019

andrewrk added the stage1 The process of building from source via WebAssembly and the C backend. label Feb 10, 2020

andrewrk modified the milestones: 0.6.0, 0.7.0 Feb 10, 2020

LemonBoy mentioned this issue May 13, 2020

errdefer in json.parseInternal causes lots of code generation #5327

Closed

Vexu mentioned this issue Jul 29, 2020

stage2: remove operand from return instruction #5951

Closed

andrewrk added frontend Tokenization, parsing, AstGen, Sema, and Liveness. and removed stage1 The process of building from source via WebAssembly and the C backend. labels Jul 30, 2020

andrewrk removed the optimization label Aug 19, 2020

andrewrk modified the milestones: 0.7.0, 0.8.0 Oct 9, 2020

andrewrk added optimization proposal This issue suggests modifications. If it also has the "accepted" label then it is planned. labels Apr 21, 2021

andrewrk modified the milestones: 0.8.0, 1.0.0 Apr 21, 2021

This was referenced Jun 1, 2022

stage2-generated binaries are slower and more bloated than stage1-generated binaries #11498

Closed

introduce a "try" ZIR and AIR instruction #11772

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

defer implementation: instead of duplicating the code at every exit path, make a basic block that exit paths point to #283

defer implementation: instead of duplicating the code at every exit path, make a basic block that exit paths point to #283

andrewrk commented Mar 26, 2017

shawnl commented Apr 6, 2019

Sircular commented May 23, 2020

andrewrk commented May 23, 2020 •

edited

Loading

andrewrk commented Jul 30, 2020

andrewrk commented Apr 21, 2021

LemonBoy commented Apr 21, 2021

lerno commented Mar 31, 2022

defer implementation: instead of duplicating the code at every exit path, make a basic block that exit paths point to #283

defer implementation: instead of duplicating the code at every exit path, make a basic block that exit paths point to #283

Comments

andrewrk commented Mar 26, 2017

shawnl commented Apr 6, 2019

Sircular commented May 23, 2020

andrewrk commented May 23, 2020 • edited Loading

andrewrk commented Jul 30, 2020

zig code

ZIR

andrewrk commented Apr 21, 2021

LemonBoy commented Apr 21, 2021

lerno commented Mar 31, 2022

andrewrk commented May 23, 2020 •

edited

Loading