Rust: Replace our fragile safety scheme around erroneous constants

Created on 10 Dec 2019 · 46Comments · Source: rust-lang/rust

Discussion: https://github.com/rust-lang/rust/pull/67134#discussion_r355899976

The situation is as follows:

Sometimes code relies on a failing constant for safety (see https://github.com/rust-lang/rust/issues/67083)
Those failing constants may have a ZST type (e.g. ())
A MIR optimization may see the constant and optimize it out (this happened: https://github.com/rust-lang/rust/pull/66216)
The constant not showing up in MIR can cause it to never get evaluated, thus never reporting its error
The runtime code relying on safety by having a hard error during const eval is now being executed even though it's unsound

The proposed solution is to gather all unevaluated constants in a MIR right after mir building and put them in a Vec<(DefId, SubstsRef<'tcx>, Promoted)> (or maybe a HashSet?). This should be placed on the mir::Body and inlining should also carry it down to the function that its being inlined into (and adjust substitutions on the SubstsRef). In the end, instead of having the collector go through the MIR to find constants to evaluate, we go through the vector and evaluate all of them.

This will require more memory and some additional evaluation time, but it would be significantly less fragile than the current setup which relies on optimizations not removing even guaranteed dead code containing constants.

cc @RalfJung @wesleywiser

A-mir C-cleanup C-enhancement P-medium T-compiler

Source

oli-obk

Most helpful comment

I think it would be mighty confusing if whether if foo() { ... } compiled depended on whether foo is a const fn.

RalfJung on 2 Jan 2020

👍7

All 46 comments

Also Cc @Centril -- I am curious about your position on "compilation failure" being a side-effect of CTFE that we want to guarantee is being preserved. On the one hand this seems useful for some things, on the other hand it makes CTFE somewhat impure.

Relevant discussions:

RalfJung on 10 Dec 2019

@RalfJung I think I'm lacking some context; could you elaborate re. the various positions one could take and their trade-offs?

Centril on 10 Dec 2019

Essentially the question boils down to "is it ok for the following program to never emit a hard error, but at best emit a dead code warning or a const_err lint"

const FOO: usize = 0 - 1;
fn main() {
    if false {
        println!("{}", FOO);
    }
}

Everything else is just a more complex instance of the shown example. Although some of these situations may in fact be relied upon by users for safety:

const FOO: ! = panic!();
fn main() {
    let _ = FOO;
    unsafe {
        *(5 as *mut i32) = 49;
    }
}

In the above code it can be expected that the unsound code is in fact dead code (as the compiler tells you: https://play.rust-lang.org/?version=nightly&mode=debug&edition=2018&gist=a2e0b258329500d44609bd82afd8ed07) and it even is unreachable, but only because the let _ = FOO; essentially translates to the equivalent of std::intrinsics::unreachable(). So the program exhibits UB because the code compiled successfully, when it should not.

oli-obk on 10 Dec 2019

Hm, that uninhabited type example is interesting, I was actually thinking of inhabited ZSTs so far... there we don't even have much of a choice. We have to abort compilation. (Well we could generate a [safe] trap but that would be very surprising.)

Things are less clear in situations like this:

const FOO: ! = panic!();
fn main() {
    if false {
        let _ = FOO;
    }
}

One could argue that we shouldn't error about bad constants in dead code. Is there a way people could rely on compilation passing when their CTFE error is only used in dead code?

RalfJung on 10 Dec 2019

What's the reason behind having bad constants inside dead code? The only situation I can imagine is some platform specific constant evaluation that would succeed in one platform and fail in another. But even then you could be better using conditional compilation instead.

christianpoveda on 10 Dec 2019

@oli-obk @RalfJung So I think I understand the question now, but some elaboration re. "impure" and what drawbacks checking constants in dead code would be good. It sounds like the drawback would be potentially regressing perf and memory usage (but the evaluation of the constants used in live code should be memoized (?) so we would mostly only pay for the ones in dead code?

My position for now is that:

Optimizations like const-prop et al. should not affect what code compiles under #![allow(const_err)] to maintain a basic "as-if" rule for defining the language. Thus the notion of "dead code" should be the pre-optimization notion. E.g. if according to NLL if false $block has $block live, then it is also live according to language rules. However, if { return; } $block would have $block dead according to NLL and so that legitimizes the idea that we do not error on bad constants in $block.
Using #[deny(const_err)] for soundness of library code is dubious as it is a lint.

Centril on 11 Dec 2019

👍1

(Sorry if I'm misunderstanding the question here)

From a user's point of view I think control-flow based analyses are confusing. I think in this code

const FOO: ! = panic!();
fn main() {
    let _ = FOO;
    unsafe {
        *(5 as *mut i32) = 49;
    }
}

We should abort compilation. Same in this example

const FOO: ! = panic!();
fn main() {
    if false {
        let _ = FOO;
    }
}

If you ever worked with a langauge with control-flow based static checking (e.g. RPython) it's really confusing for a user for several reasons:

You get different compile-time feedback from the compiler based on how good its control-flow analysis is, e.g. a compiler may be able to figure out if false { ... } will never be taken and allow compile-time errors inside the branch, but you change it to if f() { ... } where fn f() -> bool { false } and suddenly your code doesn't compile. It's even worse if the analysis is done post-optimization as that makes your program compile or not depending on the optimizer as well.
During development sometimes you start implementing a function but do not call it yet. Until you actually call that function you don't get any feedback from the compiler for that function which is really annoying. E.g. in the example above if main() were actually f() and were not called by main() just yet you wouldn't get an abort in compile time until you call it from main. That's really bad IMHO.

I'm very new at rustc development but I think compile-time evaluation is done in MIR, which can be optimized. If this is done post-optimizations then we can also add an item here about getting different compile-time feedback depending on optimizations, which is also bad (GHC also has such problems, though they're really minor stuff and most users are probably not even aware of these).

osa1 on 12 Dec 2019

T-compiler triage: leaving unprioritized until we see feedback from T-lang on what we should do here.

pnkfelix on 12 Dec 2019

We discussed this briefly in the language team meeting this Thursday. In this meeting, only @Centril, @nikomatsakis, and @ecstatic-morse attended.

Our basic conclusions were that having one consistent standard for "dead" / "live" as a matter of language definition (unaffected by compiler optimizations and lints like const_err) would be good. As NLL already has one standard, using that one would be sensible. In other words, if false { BAD_CONST } would still be a hard error whereas if {return} { BAD_CONST } would not be (as NLL would consider BAD_CONST dead). We also felt that relying on const_err for safety was not a good idea.

We briefly touched upon implementation strategy and wondered whether e.g. collecting things during the NLL analysis would be a good idea. This is of course left open for the compiler team to decide. :)

Centril on 13 Dec 2019

Some extra information: it appears that #67164 caused a significant performance regression. (It's not 100% certain, because it was part of a rollup and I am having trouble reproducing the regression locally.) According to this comment, this PR would allow #67164 to be reverted, thus undoing the performance regression. So that increases the urgency of this.

nnethercote on 16 Dec 2019

Will this safety scheme also apply the negative bounds like !Sync for structs that are referenced as const? Because currently, that is also an issue and it is stale for a while.

I assume this kind of bound problem will be triggered before actual control flow checking. Are we going to touch parts that disables this kind of unsafe impl boundary problems?

vertexclique on 17 Dec 2019

triage: P-high to resolve desired semantics and implementation strategy here. (I don't know what priority to assign to the actual implementation work yet though...)

pnkfelix on 19 Dec 2019

Will this safety scheme also apply the negative bounds like !Sync for structs that are referenced as const? Because currently, that is also an issue and it is stale for a while.

No, this PR is not about any kind of changes to the way constants by themselves behave. This is solely about the question whether any used constant should cause an error, even if that use is in dead code. Though there's also the question as to what kind of dead code we look at. E.g. is

if false {
    CONSTANT;
}

a use? If yes, is

if super_hard_to_const_eval_to_false() {
    CONSTANT;
}

a use?

panic!();
CONSTANT;

I assume this kind of bound problem will be triggered before actual control flow checking. Are we going to touch parts that disables this kind of unsafe impl boundary problems?

I don't understand the questoin. Can you give an example?

oli-obk on 19 Dec 2019

(According to https://github.com/rust-lang/rust/issues/67191#issuecomment-565270786 it would be yes, yes, and no.)

Centril on 19 Dec 2019

👍1

We also felt that relying on const_err for safety was not a good idea.

Note that there were examples where it wasn't const_err that was being relied on, but that a const FOO: ! = panic!(); being used in live code will abort compilation, even if an optimization decides that code is irrelevant because using a ZST constant is essentially a NOP

oli-obk on 19 Dec 2019

@oli-obk

I don't understand the question. Can you give an example?

This answers my question actually:

Note that there were examples where it wasn't const_err that was being relied on, but that a const FOO: ! = panic!(); being used in live code will abort compilation

I wanted to know that const FOO: ! = panic!(); the condition will cause abort or not? because that is control flow checking and in type manner, this is purely correct. And tbh someone might want to deref this in their code? Is this unsound?

iirc: derefing this on embedded systems causes abort instr. to execute. That might be useful I assume.

vertexclique on 20 Dec 2019

So, I definitely don't expect a full optimization-based liveness analysis; it seems fine to me if (say) if non_const_function() { const FOO: ! = panic!(); } aborts compilation. But on the other hand, I'd like to have the following not abort compilation:

if false { const FOO: ! = panic!(); }

const CONST_FALSE: bool = false;
if CONST_FALSE { const FOO: ! = panic!(); }

const fn const_false_fn() -> bool { false }
if const_false_fn() { const FOO: ! = panic!(); }

const fn const_false_fn() -> bool { false }
if const_false_fn() { return AnError; }
const FOO: ! = panic!();

Rationale: this makes it easy to use patterns similar to those in the Linux kernel, where a module uses compile-time configuration to either provide certain functionality or stub out that functionality, and the stub functionality could use const and const fn and expect to get eliminated with dead-code elimination.

joshtriplett on 2 Jan 2020

I think it would be mighty confusing if whether if foo() { ... } compiled depended on whether foo is a const fn.

RalfJung on 2 Jan 2020

👍7

@joshtriplett I feel that Linux's configuration system is an outgrowth of the lack of a good macro system and build tool in C. In rust, I feel that cfg should be used instead...

mark-i-m on 2 Jan 2020

@mark-i-m I'm not talking about kconfig. I'm talking about code modules.

Suppose you have an API with half a dozen functions and a thousand callers. You can use cfg to define the functions when configured in, or to define stub versions of the functions when not configured in, and either way, you don't need to change the thousand callers and litter myriad source files with additional cfg directives.

The stub versions of the functions can be const functions, and any types involved can become ZSTs, so a feature that has been configured out has zero overhead.

Depending on the semantics of the feature you compile out, sometimes you want it to return a constant (such as true or false or NULL or ()), and sometimes you want it to error (panic) if ever actually invoked at runtime (and not eliminated by dead code elimination).

joshtriplett on 3 Jan 2020

Hmm... I see what your getting at, but isn't that already possible today? Do you need erroneous constants for that?

mark-i-m on 3 Jan 2020

I'm just coming back online, but I'm surprised that static_assertions has not been brought up at all. In my mind, this question is already settled: we need to guarantee that const X will evaluate the initializer of X even if X is crate-private and never used. The discussions about control-flow are immaterial. Every consumer of static_assertions as well as rustc itself relies on this behavior. In fact, this seems so clear to me that I'm starting to wonder whether I've misunderstood @joshtriplett's position.

This does mean that we can no longer optimize by skipping evaluation of unused constants, but this particular optimization is not very important IMO. We should be discussing the extension of the aforementioned rule to associated constants and constants in a generic context, e.g. impl<T: HasAssocConst> X<T> { const C: usize = T::ASSOC_CONST - 1; }.

ecstatic-morse on 3 Jan 2020

👍2

we need to guarantee that const X will evaluate the initializer of X even if X is crate-private and never used.
[...]
We should be discussing the extension of the aforementioned rule to associated constants and constants in a generic context, e.g. impl X { const C: usize = T::ASSOC_CONST - 1; }.

For generic/associated consts, that rule is impossible to implement as the data for the computation is not all available.

Also note that for unused consts, some initializer errors just trigger an err-by-default lint, but not a hard error. This is for backwards compatibility reasons. I do not know if there is any chance of ever cleaning that up.

RalfJung on 3 Jan 2020

For generic/associated consts, that rule is impossible to implement as the data for the computation is not all available.

Well... they already happen post monomorphization. The idea I posted with this issue means that we don't lose them along the way during optimizations.

oli-obk on 3 Jan 2020

...? I was talking about unused constants, based on what I am quoting: "The rule" put up by @ecstatic-morse is (emphasis mine)

we need to guarantee that const X will evaluate the initializer of X even if X is crate-private and never used.

Unused constants don't have anything happen post-monomorphization.

RalfJung on 3 Jan 2020

The simple extension would be: "All monomorphized types have the initializers for any associated constants run, regardless of whether those constants are used". In other words, instantiating TypeWithDependentAssocConst<i32> anywhere would run the initializer for the associated const, even if it is unused. This is not ideal, but might be acceptable? Perhaps there is a way to relax this restriction, but I don't know enough to formulate it.

Anyways, my point is that we need to guarantee that the initializers are run in the non-polymorphic examples above lest we subtly break static_assertions. It's the polymorphic case that requires design work.

ecstatic-morse on 3 Jan 2020

👍3

@mark-i-m That's a good question.

Thinking about it further, I think it'd be possible to work around errors-in-constants, given that such patterns would mostly tend to either be no-op success stubs or no-op error stubs, not panics. And if they did include panics, they still wouldn't need to be called from constants in dead-code if branches; I don't think that case is going to come up for the use cases I was concerned about.

So, I'm withdrawing any concern with the current scheme as proposed.

joshtriplett on 9 Jan 2020

👍1

Based on the consensus to use NLLs notion of liveness in erroring on live bad constants, I'm moving this over to the compiler team for the implementation.

Centril on 10 Jan 2020

Discussed in T-compiler meeting.

This is awaiting an implementor. @oli-obk might be able to provide mentoring notes (their availability was not 100% clear from meeting discussion, though).

Removing nomination, keeping P-high.

pnkfelix on 12 Mar 2020

So once https://github.com/rust-lang/rust/pull/70820 lands, can we revert the extra checking https://github.com/rust-lang/rust/pull/67134 added, and the const_prop checking it mentions, because this is now handled properly by tracking all consts in required_consts?

RalfJung on 22 Apr 2020

70820 reverts the changes #67134 added:

https://github.com/rust-lang/rust/pull/70820/files#diff-91f12a8869ef01be4159cdf9088b7ddeL405-L407

wesleywiser on 22 Apr 2020

Ah, nice!

https://github.com/rust-lang/rust/pull/67134 also does some changes in librustc_codegen_ssa/mir/constant.rs though, what about those? And @oli-obk said const-prop is also doing something special to raise errors from consts.

RalfJung on 23 Apr 2020

Reopening since there are still some leftovers as noted in https://github.com/rust-lang/rust/issues/67191#issuecomment-618218786

oli-obk on 24 Apr 2020

❤2

Also see the Zulip discussion for this (which is still ongoing)

RalfJung on 27 Apr 2020

Changed the priority, this doesn't seem to be P-high anymore.

spastorino on 28 Apr 2020

👍1

After reading around some more, I think I have the following two action items / open questions for this issue:

what about the FIXME, is that still up-to-date (it was added by the PR that got otherwise partially reverted because we now have a better system)?
can we remove all this now? It makes no sense that const_prop would do all sorts of error handling for consts that other optimizations wouldn't do.

RalfJung on 30 Apr 2020

https://github.com/rust-lang/rust/pull/71747 takes care of item 1 (the FIXME).

RalfJung on 1 May 2020

I don't think we should remove item 2. As mentioned on zulip, we do want to trigger these lints on generic code and not just during codegen/monomorphization.

oli-obk on 1 May 2020

I agree we want to get these lints e.g. also for check builds. But why is const_prop out of all transformations the right one to do that? Wouldn't it be better to have some dedicated pass somewhere that just iterates over all required_const, evaluates them, and emits errors when needed? (Codegen already has such a loop. Miri will also need one.)

RalfJung on 1 May 2020

Also it seems like we have quite a bit of code duplication here: both codegen and const_prop contain logic that turns const errors into hard errors. That code should be shared.

Moreover, the diagnostics we give with allow(const_err) are actually pretty bad because we just say "that constant has an error" without saying anything about what the error is. Isn't there a chance that we can make const_err a hard error at some point to clean up this awful mess a bit and avoid duplicate errors?

RalfJung on 1 May 2020

But why is const_prop out of all transformations the right one to do that? Wouldn't it be better to have some dedicated pass somewhere that just iterates over all required_const, evaluates them, and emits errors when needed?

Const prop evaluates only those that it needs for propping, though with the current scheme I think we can actually stop doing that as we know they are either unevaluable (because they are in the required_const set) or they are already evaluated (because they are in the MIR). Unfortunately those from the required_const set often are in the MIR, too, unless they have been optimized away.

There's no point in trying to evaluate those in required_const, as we already know they can't be evaluated without more concrete generics.

Isn't there a chance that we can make const_err a hard error at some point to clean up this awful mess a bit and avoid duplicate errors?

Remember const_err is twofold:

linting that code will panic at runtime: I tried (https://github.com/rust-lang/rfcs/pull/1229) and the decision was to stay with a lint, because then we can arbitrarily add more to it.
linting about broken constants. We're linting them only because that's needed for back-compat, and we could turn that into a future incompat lint, which is a lot more aggressive (and can be turned into a hard error in future editions).

oli-obk on 1 May 2020

linting that code will panic at runtime

That's not const_err any more, that's unconditional_panic or whatever we called it -- right?

linting about broken constants. We're linting them only because that's needed for back-compat, and we could turn that into a future incompat lint, which is a lot more aggressive (and can be turned into a hard error in future editions).

That's what I meant. There even is precedent for making future-incompat lints hard errors on earlier editions. But anyway, if you, too, are in favor of this path -- should I open an issue to track that?

Const prop evaluates only those that it needs for propping, though with the current scheme I think we can actually stop doing that

So now you are saying we can indeed remove this or am I misunderstanding?

RalfJung on 1 May 2020

linting that code will panic at runtime

That's not const_err any more, that's unconditional_panic or whatever we called it -- right?

oops yea

linting about broken constants. We're linting them only because that's needed for back-compat, and we could turn that into a future incompat lint, which is a lot more aggressive (and can be turned into a hard error in future editions).

That's what I meant. There even is precedent for making future-incompat lints hard errors on earlier editions. But anyway, if you, too, are in favor of this path -- should I open an issue to track that?

yes, I really want that.

Const prop evaluates only those that it needs for propping, though with the current scheme I think we can actually stop doing that

So now you are saying we can indeed remove this or am I misunderstanding?

That highlight is pointing at a weird subset of code, I presume you mean

https://github.com/rust-lang/rust/blob/dae90c195989b09475b6c0225a3018cbd7afa587/src/librustc_mir/transform/const_prop.rs#L418-L427

which we can't remove (because we also use it for
https://github.com/rust-lang/rust/blob/dae90c195989b09475b6c0225a3018cbd7afa587/src/librustc_mir/transform/const_prop.rs#L412
), but can seperately do a future incompat lint for just the generic part. We want to lint/error here, because while it's guaranteed that we error during monomorphization, monomorphization may happen only in a crate depending on the currently compiling crate.

But we can probably move it to https://github.com/rust-lang/rust/blob/0b958790b336738540d027d645718713849638d7/src/librustc_mir/transform/required_consts.rs#L19 if we make that a MutVisitor that evaluates all constants that it can. This way we never have to evaluate another constant in the entire MIR optimization pipeline, because we know none can be evaluated.

EDIT: I thought we were already evaluating them during collection, but that's not the case, thus my statement

with the current scheme we can actually stop doing that

is wrong right now, but easily amended to be right.

oli-obk on 2 May 2020

yes, I really want that.

Let's get the ball rolling then: https://github.com/rust-lang/rust/issues/71800

     // Out of backwards compatibility we cannot report hard errors in unused 
     // generic functions using associated constants of the generic parameters.

So this is a bit like the "unused broken consts" lint situation?
But if the assoc constants are of the generic parameters then how can we know them here, since this is running pre-monomorphization?

 // Promoteds must lint and not error as the user didn't ask for them

We already have such logic in const_eval.rs, why does not need to be duplicated here? That seems wrong.

This way we never have to evaluate another constant in the entire MIR optimization pipeline, because we know none can be evaluated.

Yes I was imagining something like that, I think.

I thought we were already evaluating them during collection

Looks like we got another concrete action item then?

RalfJung on 2 May 2020

But if the assoc constants are of the generic parameters then how can we know them here, since this is running pre-monomorphization?

Good question, I don't remember the test that this touched. Maybe I should have referenced it in the comment :/

We already have such logic in const_eval.rs, why does not need to be duplicated here? That seems wrong.

Another good point. We may be able to just not report anything for promoteds here, maybe that's one of our dupliation sites?

Looks like we got another concrete action item then?

Yes: https://github.com/rust-lang/rust/issues/71802

oli-obk on 2 May 2020

Okay, looks like all the const-eval error reporting cleanup that we might be able to do is now tracked at https://github.com/rust-lang/rust/issues/71802. Closing this one.

RalfJung on 2 May 2020

Was this page helpful?

0 / 5 - 0 ratings