Questions tagged [ieee-754]

IEEE 754 is the most common & widely used floating-point standard, notably the single-precision binary32 aka float and double-precision binary64 aka double formats.

IEEE 754 is the Institute of Electrical and Electronics Engineers standard for floating-point computation, and is the most common & widely used implementation thereof.

Wikipedia on IEEE 754 (2008)
ieee.org documentation
https://en.wikipedia.org/wiki/Single-precision_floating-point_format aka binary32, usually called float or real4. Nice diagrams of the bit-pattern, and range over which it can represent every integer exactly, and so on.
https://en.wikipedia.org/wiki/Double-precision_floating-point_format usually called double or real8
Algorithm to convert an IEEE 754 double to a string? including the recent Ryū: fast float-to-string conversion

As well as formats, IEEE754 also defines the basic operations, + - * / and sqrt, as producing correctly-rounded results (error <= 0.5ulp). Other functions like pow and sin are not required to be as accurate; that's an implementation choice between precision and performance.

This is why many CPU instruction sets only include the basic operations (including sqrt).

1447 questions

-5

votes

1 answer

Different pattern using the same float constant values leads to different results

in the following snippet of go code I struggle to understand why the results are different: func main() { a := -0.2; b := -0.1; fmt.Println(a+b) //Outputs expected float value with rounding error : -0.30000000000000004 c :=…

go floating-point ieee-754

asked Aug 13 '19 at 09:54

Lou-adrien

-5

votes

2 answers

float to embedded c float over UART

I am trying to send python float's over UART to an Embedded c processor, the MKE14 from NXP. In python I am using the Struct library to make a 32 bit float and send this over UART. I checked both float impelementations and there both "IEEE-754". I…

python c struct floating-point ieee-754

asked Jul 04 '18 at 13:50

R Coppens

-5

votes

2 answers

I knew that 0.1D+0.2D==0.30000000000000004D - why 0.1D+0.1D==0.2D?

Neither 0.1d, 0.2d or 0.3d can be represented exactly in binary. Why is 0.1D + 0.1D == 0.2D true in Java? the 64-bit binary number is how to become 0.1,0.2,0.3?????Where is the code?

java floating-point ieee-754

asked Mar 22 '16 at 09:39

pppp

-6

votes

2 answers

a is a double, printf("%d", a); works differently in IA32 and IA32-64

Why does The following code work totally differently on IA-32 and x86-64? #include int main() { double a = 10; printf("a = %d\n", a); return 0; } On IA-32, the result is always 0. However, on x86-64 the result…

c memory printf ieee-754 stdio

asked Jun 13 '16 at 02:04

Martin Gao

-6

votes

2 answers

Problematic understanding of IEEE 754

First of all i woild like to point out that i am not native speaker and i really need some terms used more commonly. And the second thing i would like to mention is that i am not a math genious. I am really trying to understand everything about…

c floating-point ieee-754

asked Oct 18 '14 at 15:54

Genis

-7

votes

2 answers

Blatant floating point error in C++ program

I am assigning a double literal to a double variable. The variable's value gets truncated, otherwise I cannot understand why, for example the difference diff is 0.0. Sorry for the code duplication at setprecision but I am really pissed off. #include…

c++ floating-point ieee-754

asked May 30 '19 at 09:16

Emil Mocan

-11

votes

1 answer

What is IEEE-754?

MDN puts the isNaN() function in a nutshell: "is this value, when coerced to a numeric value, an IEEE-754 'Not A Number' value?" What is IEEE-754 ? P.S. I have read and researched quite a bit about isNaN , and have seen THIS, thread too. I just…

javascript ieee-754

asked Oct 09 '15 at 14:04

Alexander Solonik

9,838
18
76
174

Prev 1 2 3

…