Rust: Comparing to infinity is buggy on x87

Created on 18 May 2020 · 19Comments · Source: rust-lang/rust

When code is compiled using the X87 FPU, comparing to infinity can lead to unexpected behavior.

I tried this code:

#[inline(never)]
fn get_num() -> f64 {
    let num: f64 = 1.0e300;
    // volatile is to avoid optimizations
    unsafe { std::ptr::read_volatile(&num) }
}

fn main() {
    let x = get_num();
    let y = get_num();
    let z = x * y;
    if z != f64::INFINITY && z != f64::NEG_INFINITY && !z.is_nan() {
        let exp = (z.to_bits() >> 52) & 0x7FF;
        assert!(exp != 0x7FF);
        println!("is finite, exp = {}", exp);
    } else {
        println!("is not finite");
    }
}

I expected to see this happen:

This should either print "infinite or nan" or "is finite, exp = ...". The assert! should never fail because the exponent bits are 0x7FF if and only if the number is infinity or nan, so the if condition should have been false.

Instead, this happened:

When compiled for X86 without SSE, the assert will fail. This could also cause safety issues if unsafe code relies on it.

The following command can be used to compile without SSE:

RUSTFLAGS="-C target-cpu=pentium" cargo run --target i686-unknown-linux-gnu

All 19 comments

For reference, can you post the machine code this results in? I'm particularly curious if there's any calls to functions already built in the distributed libstd, since those would potentially suffer from ABI mismatches (see https://github.com/rust-lang/rust/issues/63597) that could explain the weird behavior.

hanna-kruppe on 20 May 2020

For reference, can you post the machine code this results in? I'm particularly curious if there's any calls to functions already built in the distributed libstd, since those would potentially suffer from ABI mismatches (see #63597) that could explain the weird behavior.

Sure, the assembly of the main function is (after removing the println!s to simplify it):

_ZN10float_test4main17h3d6b0b215bf83903E:
    .cfi_startproc
    pushl   %ebx
    .cfi_def_cfa_offset 8
    subl    $24, %esp
    .cfi_def_cfa_offset 32
    .cfi_offset %ebx, -8
    calll   .L9$pb
    .cfi_adjust_cfa_offset 4
.L9$pb:
    popl    %ebx
    .cfi_adjust_cfa_offset -4
.Ltmp6:
    addl    $_GLOBAL_OFFSET_TABLE_+(.Ltmp6-.L9$pb), %ebx
    calll   _ZN10float_test7get_num17hedf1e4650ddf3052E
    fstpl   16(%esp)
    calll   _ZN10float_test7get_num17hedf1e4650ddf3052E
    fldl    16(%esp)
    fmulp   %st, %st(1)
    fucom   %st(0)
    fnstsw  %ax
    sahf
    jp  .LBB9_5
    flds    .LCPI9_0@GOTOFF(%ebx)
    fucomp  %st(1)
    fnstsw  %ax
    sahf
    jae .LBB9_5
    flds    .LCPI9_1@GOTOFF(%ebx)
    fxch    %st(1)
    fucom   %st(1)
    fstp    %st(1)
    fnstsw  %ax
    sahf
    jae .LBB9_5
    fstpl   8(%esp)
    movl    12(%esp), %eax
    notl    %eax
    testl   $2146435072, %eax
    fldz
    je  .LBB9_4
.LBB9_5:
    fstp    %st(0)
    addl    $24, %esp
    .cfi_def_cfa_offset 8
    popl    %ebx
    .cfi_def_cfa_offset 4
    retl
.LBB9_4:
    .cfi_def_cfa_offset 32
    fstp    %st(0)
    calll   _ZN3std9panicking11begin_panic17h261cc8b487132e56E
    ud2

...

.LCPI9_0:
    .long   4286578688
.LCPI9_1:
    .long   2139095040

Generated with:

RUSTFLAGS="-C target-cpu=pentium" cargo rustc --release --target i686-unknown-linux-gnu -- --emit asm

I think I forgot to mention in the OP that this happens in both debug and release mode.

eduardosm on 20 May 2020

Uh-oh... should this be labelled "unsound"? Any time codegen'd behavior does not match the spec, that can be considered a soundness issue as well.

RalfJung on 20 May 2020

I just found out that this can be reproduced without RUSTFLAGS="-C target-cpu=pentium" using --target i586-unknown-linux-gnu .

eduardosm on 21 May 2020

❤1

I was searching for open floating point issues related to #73328 and stumbled upon this one, which seems to have slipped through the cracks. Marking as "unsound" and "needs prioritization". I'm still not sure exactly what causes this.

cc @rust-lang/wg-prioritization

ecstatic-morse on 13 Jun 2020

I believe this happens because x87 internally works on 80-bits numbers.
This format can handle numbers up to about 10^4932, which means that the result of the multiplication is not infinity (when it is on the FP stack), hence the comparisons do not succeed.
On the other hand, the code that extracts the exponent casts it to 64-bits, so the number is rounded (truncated?) to the positive infinity.

ranma42 on 13 Jun 2020

👍1

In a way, this is what is happening behind the scenes:

#[inline(never)]
fn get_num() -> f64 {
    let num: f64 = 1.0e30;
    // volatile is to avoid optimizations
    unsafe { std::ptr::read_volatile(&num) }
}

fn main() {
    let x = get_num();
    let y = get_num();
    let z = x * y;
    if z != f64::INFINITY && z != f64::NEG_INFINITY && !z.is_nan() {
        let exp = ((z as f32).to_bits() >> 23) & 0x7F;
        assert!(exp != 0x7F);
        println!("is finite, exp = {}", exp);
    } else {
        println!("is not finite");
    }
}

Note that in this case the downcast as f32 is explicit.

ranma42 on 13 Jun 2020

Indeed. The underlying cause is clear. I wonder what we should do here, though? Does Rust currently guarantee that extended precision is not used for operations on f64? If so, this is technically a miscompilation. However, I don't know whether it's worth fixing. Maybe we should just document the status quo and move on?

ecstatic-morse on 13 Jun 2020

ecstatic-morse added I-prioritize I-unsound boom labels

Shouldn't we rather tag https://github.com/rust-lang/rust/issues/73328 instead of tagging the 4 issues that are all instances of the same problem?

RalfJung on 14 Jun 2020

This is wholly distinct from #73328. There's no NaN anywhere in this program.

ecstatic-morse on 14 Jun 2020

Oh, good point. I had missed that.

So this is truly an i686-only artifact caused by using the x87 instructions, and has nothing to do with LLVM's sloppiness around NaNs?

RalfJung on 14 Jun 2020

👍1

Could we argue that this is a bug in LLVM, or is there some statement in LLVM's LangRef that actually makes this a correct compilation of the LLVM IR rustc produces?

RalfJung on 14 Jun 2020

Not sure. The LLVM reference says:

The binary format of half, float, double, and fp128 correspond to the IEEE-754-2008 specifications for binary16, binary32, binary64, and binary128 respectively.

I don't have a copy of the IEEE spec, so I don't know if sharing a "binary format" implies that they must not use a higher precision for intermediate results. I think that the IEEE-754 may explicitly allow for this, however? In any case, it's unlikely that LLVM will guarantee the same precision everywhere as long as x87 instructions are supported. C has had the same issue for 20 years now. The GCC wiki discusses what would be required to make floating point math for double predictably 64-bit on x87. Presumably this kind of approach was also considered by LLVM and rejected.

As long as 32-bit x86 is a tier 1 target, I believe we won't be able to guarantee at the language level that floating point math is always performed at 64-bit precision for f64 (same goes for f32). Instead, we will have to say something like "the precision of intermediate floating point calculations is platform-defined but rustc guarantees that all tier 1 platforms besides i686 use the same precision as the floating-point type for all computations".

ecstatic-morse on 14 Jun 2020

As long as 32-bit x86 is a tier 1 target, I believe we won't be able to guarantee at the language level that floating point math is always performed at 64-bit precision for f64 (same goes for f32).

We could require SSE. (No idea if that is even remotely realistic, but it felt worth mentioning.)

RalfJung on 14 Jun 2020

The original SSE only supported single-precision floating point arithmetic. You need at least SSE2 for the semantics we want. I believe that all i686 processors have SSE but not all have SSE2. Not sure what we do by default here?

But yes, I think that would be fine.

ecstatic-morse on 14 Jun 2020

FWIW, the target triples i686-* are a misnomer, they include SSE2 already.

hanna-kruppe on 14 Jun 2020

👍1

See also: https://github.com/rust-lang/rfcs/pull/2686 (“Allow floating-point operations to provide extra precision than specified, as an optimization”)

comex on 15 Jun 2020

@hanna-kruppe Ah, that's why the OP had to do RUSTFLAGS="-C target-cpu=pentium".
Can we declare non-SSE2-i686 "unsupported" to "fix" the problem here? I am not sure what that would actually mean in practice, maybe error or at least warn when SSE2 is disabled, or so. Without proper hardware support, it seems hard to do anything better, and if it's a warning people can still proceed with caution.

@comex there were many concerns in that RFC about enabling such optimizations by default (https://github.com/rust-lang/rfcs/pull/2686#discussion_r276418997, https://github.com/rust-lang/rfcs/pull/2686#discussion_r276421551). Also, would that RFC really lead to problems such as this? f64 is still a 64-bit type, so the assertion here should still hold, right? The issue arises because f64 is actually represented as 80bits on x87 and thus there are more possible bit patterns than there should be?

RalfJung on 15 Jun 2020

👍1

Is this related to or a duplicate of https://github.com/rust-lang/rust/issues/73288? Both are about x87 floating point weirdness, from what I can tell.

RalfJung on 7 Aug 2020

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Tracking issue for `?` operator and `try` blocks (RFC 243, `question_mark` & `try_blocks` features)

nikomatsakis · 340Comments

Tracking issue for `..=` inclusive ranges (RFC #1192) -- originally `...`

nikomatsakis · 331Comments

Tracking issue for promoting `!` to a type (RFC 1216)

nikomatsakis · 259Comments

Tracking issue for specialization (RFC 1210)

nikomatsakis · 236Comments

Tracking issue for RFC 1892, "Deprecate uninitialized in favor of a new MaybeUninit type"

Centril · 382Comments

Rust: Comparing to infinity is buggy on x87

Meta

All 19 comments

Related issues