Floating point: does same formula with same inputs give same result?

T4r4sB · September 25, 2023, 5:14pm

There is a bug appeared with gcc+x87: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=323
If intermediate result is not stored from 80-bit fpu register to 32-bit variable, the result can change, and it is related from optimization options.
If we use SSE, the problem will not occured.

I'm compiling program in i686-pc-windows-gnu target.
Does compiler use SSE instructions by default?
Can I expect same formula results with same imputs in different context?

ZiCog · September 25, 2023, 5:34pm

That bug "appeared" a long time ago. In the year 2000. And closed as "not a bug" as far as I can tell.

You should not expect precision in your results greater what the IEEE 754 standard specifies. Some intel floating point units have 80 bit precision internally so expect so some little differences.

Also that test is testing for the equality of two floating pint numbers. This is generally a bad idea.

See: What Every Computer Scientist Should Know About Floating-Point Arithmetic

I have no idea about how Rust handles this but the same advice applies no matter what language.

T4r4sB · September 25, 2023, 5:39pm

I dont talk about comparing numbers calculated by different formulas. 2.0.sqrt() * 2.0.sqrt() is not equal to 2.0 in floating-point numbers, I know it.
I talk about determinism. Does 2.0.sqrt() * 2.0.sqrt() equal to 2.0.sqrt() * 2.0.sqrt() or not?

cuviper · September 25, 2023, 5:48pm

Yes, the target definition uses "pentium4" as the base CPU, which enables sse and sse2.

khimru · September 25, 2023, 7:13pm

It does occur, but less often. In particular rcpps produces different results for the same input on Intel/Via CPUs and AMD CPUs.

And clang likes to use FMA instructions if they are available, which may also produce different results.

T4r4sB · September 25, 2023, 8:27pm

Do they produce different results on different machines?
Can they produce different results on same machine?

scottmcm · September 25, 2023, 8:36pm

32-bit x86 (especially without sse) is a disaster. Just stop using it, if you care about floats.

github.com/rust-lang/rust

add notes about non-compliant FP behavior on 32bit x86 targets

rust-lang:master ← RalfJung:x86_32-float

opened 11:10AM - 26 Jun 23 UTC

RalfJung

+26 -21

Based on ton of prior discussion (see all the issues linked from https://github.…com/rust-lang/unsafe-code-guidelines/issues/237), the consensus seems to be that these targets are simply cursed and we cannot implement the desired semantics for them. I hope I properly understood what exactly the extent of the curse is here, let's make sure people with more in-depth FP knowledge take a close look! In particular for the tier 3 targets I have no clue which target is affected by which particular variant of the x86_32 FP curse. I assumed that `i686` meant SSE is used so the "floating point return value" is the only problem, while everything lower (`i586`, `i386`) meant x87 is used. I opened https://github.com/rust-lang/rust/issues/114479 to concisely describe and track the issue. Cc @workingjubilee @thomcc @chorman0773 @rust-lang/opsem Fixes https://github.com/rust-lang/rust/issues/73288 Fixes https://github.com/rust-lang/rust/issues/72327

T4r4sB · September 25, 2023, 8:59pm

I use -i686, so it enables SSE2, isnt it?
And I dont use flag -C target-cpu=pentium

bjorn3 · September 25, 2023, 9:30pm

Even with SSE enabled, the x86 C calling convention states that floats have to be returned in an x87 register, not an SSE one. And libm functions like sin and sqrt will likely still use x87 internally.

T4r4sB · September 25, 2023, 10:07pm

I tried these functions, sqrt compiled to sqrtss %xmm0, %xmm0, sin compiled to

movss	%xmm0, (%esp)
calll	_sinf

user16251 · September 26, 2023, 12:33am

fma(x, y, z) has to return the same thing no matter what: the closest number to x*y + z. ("Closest" depends on your rounding mode which as far as I know you can't even set in Rust.) But x86 has seemingly redundant fma instructions that differ in which NaN propagates to the result (i.e., if all three inputs are NaN, which input should the output copy?).

khimru · September 26, 2023, 10:36am

Three “redundant” instructions are not redundant at all. Like with many RISC implementations you pick the output argument from three input ones.

And the trouble with fma is not that it's defined differently on different CPUs (it haven't), but with the fact that when you replace x * y + z with one fma instructions you may get different result.

It's, in some sense, “better” (read numerous papers about how fma is important for many algorithms), but, for the context of this thread, the important thing is that it's different.

The only way to guaranteed that floating point would be 100% stable is to encode everything manually in assembler, I'm afraid.

And even then you have to remember which instructions produce different results on different CPUs.

That's why GoogleTest includes set of macros to deal with floats, e.g.

user16251 · September 26, 2023, 10:53am

That's why I wrote "seemingly." The difference between overwriting x or y in fma(x, y, z) has to do with NaN.

Is it legal for rustc to optimize the expression x*y + z to an fma? This seems bad for code that needs to recover the low bits of the product with fma(x, y, -x*y).

scottmcm · September 26, 2023, 6:14pm

Currently no.

Not everyone agrees with that, however, see https://github.com/rust-lang/rfcs/pull/2686

user16251 · September 27, 2023, 12:18am

I would be pretty upset if fma(x, y, -x*y) got optimized to 0. But I would be happy with a multiplication operator that couldn't be fused with an addition.

system · December 26, 2023, 12:19am

This topic was automatically closed 90 days after the last reply. We invite you to open a new topic if you have further questions or comments.

Topic		Replies	Views
Question about SSE/RVd/RVf/RVq... and related processor extensions regarding floating-point conversions help	3	409	September 5, 2021
Strictfp in rust	3	605	December 14, 2019
How are f64 literals handled by the compiler? help	7	400	January 25, 2022
Assert_eq! for float numbers help	4	10586	January 12, 2023
Subtle floating-point differences between C library and its Rust re-write help	25	1269	January 8, 2023

Floating point: does same formula with same inputs give same result?

Related Topics