Rust: Some closures are not inlined in release mode

Created on 17 Aug 2019 · 8Comments · Source: rust-lang/rust

Consider the following code (playground):

fn main() {
    let err = Err(());
    let _: usize = err.unwrap_or_else(|err| err_exit(err));
    unreachable!();
}

fn err_exit(_: ()) -> ! {
    std::process::exit(1);
}

When compiled with rustc 1.36, it gives the following assembly:

core::result::Result<T,E>::unwrap_or_else:
    pushq   %rax
    callq   playground::main::{{closure}}
    ud2

playground::main:
    pushq   %rax
    callq   core::result::Result<T,E>::unwrap_or_else
    ud2

playground::main::{{closure}}:
    pushq   %rax
    callq   playground::err_exit
    ud2

playground::err_exit:
    pushq   %rax
    movl    $1, %edi
    callq   *std::process::exit@GOTPCREL(%rip)
    ud2

Note how the closure is not inlined, even though it would be trivial to do so (replace callq playground::main::{{closure}} with callq playground::err_exit).

A-codegen C-bug I-slow T-compiler

Source

jyn514

Most helpful comment

LLVM intentionally discourages inlining for such call sites, see https://github.com/llvm/llvm-project/blob/master/llvm/lib/Analysis/InlineCost.cpp#L782

Though arguably in this case, the bonus for completely eliminating the called function should still be applied, but isn't because it's handled only at the end of that method, thus the early return skips it. Moving the check for allowSizeGrowth to the end gives the expected result, main directly calling process_exit, but that seems suboptimal. I'll try to prepare a testcase and bug report (or patch) for LLVM.

dotdash on 6 Sep 2019

👍4

All 8 comments

Your playground link is wrong, use the share button on playground to get a permalink

RustyYato on 17 Aug 2019

👍1

Does
````rust

[inline(always)]

fn err_exit(_: ()) -> ! {
std::process::exit(1);
}
````
do the trick? Perhaps llvm is missing something here...

matthiaskrgr on 17 Aug 2019

That fixes it, yeah. #[inline] does not, which seems odd.

I don't want to use #[inline(always)] in my actual code because err_exit is a pretty big function and I'd like it to only be inlined if it doesn't trash the instruction cache. The closure seems like an ideal place to inline because the calling function is really small.

jyn514 on 17 Aug 2019

I noticed some time ago that if your code unconditionally ends with a function that returns ! then some or all of preceding functions may not get inlined unless marked #[inline(always)], which is pretty awful.

Interestingly, in this example print is not inlined on Stable 1.37 and Beta 1.38, but is inlined on Nightly.

MSxDOS on 21 Aug 2019

beta 1.38 version 2019-08-13 e450539c2a8d7f791268 is showing print as inlined on playground, what version of the compiler did you use?

jyn514 on 21 Aug 2019

The one on playground, but I didn't check its version. Either it was updated since then, or I made a mistake, or something else.

MSxDOS on 21 Aug 2019

LLVM intentionally discourages inlining for such call sites, see https://github.com/llvm/llvm-project/blob/master/llvm/lib/Analysis/InlineCost.cpp#L782

dotdash on 6 Sep 2019

👍4

There was already a bug report at https://bugs.llvm.org/show_bug.cgi?id=26495

I commented there with what I found out so far.

dotdash on 9 Sep 2019

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Lifetime elision is too greedy without explicit type declaration

dnsl48 · 3Comments

Use #[repr(C)] HList's to infer type-erased fmt fn pointers in format_args!'s static data.

eddyb · 3Comments

Internal compiler error: cannot relate bound region (likely caused by `conservative_impl_trait`)

jmegaffin · 3Comments

Callback parameter names are missing from rustdoc

dtolnay · 3Comments

Comments for macro expansions

Robbepop · 3Comments