Double Precision float Question

November 21, 20178 yr

Explain this in detail I am a little hazy on the finer details of a double float.

Untitled.png.b391f9a78c0707598fcfe2ecdb93d9b7.png

How and why does it define these as the infinities.

Edited November 21, 20178 yr by Vmedvil

November 21, 20178 yr

15 minutes ago, Vmedvil said:

How and why does it define these as the infinities.

Arbitrary decision of IEEE float/double creators.

For example, you (programmer) can by yourself decide that -128 means -infinity and +127 means +infinity and numbers between them -127....+126 are normal numbers (when working with 8 bit signed integer). Then overload operators +, -, *, /, comparison to support custom made infinities.

Edited November 21, 20178 yr by Sensei

November 21, 20178 yr

11 minutes ago, Vmedvil said:

How does it define these as the infinities

Not sure what you are asking. The IEEE standard wanted a way to represent infinity and chose a couple of encodings for that purpose. As with the NaN values, they would represent valid numbers if they hadn't been reserved for these uses.

November 21, 20178 yr

Author

2 minutes ago, Strange said:

Not sure what you are asking. The IEEE standard wanted a way to represent infinity and chose a couple of encodings for that purpose. As with the NaN values, they would represent valid numbers if they hadn't been reserved for these uses.

3 minutes ago, Sensei said:

Arbitrary decision of IEEE float/double creators.

For example, you (programmer) can by yourself decide that -128 means -infinity and +127 means +infinity and numbers between them -127....+126 are normal numbers (when working with 8 bit signed integer). Then overload operators +, -, *, /, comparison to support custom made infinities.

Is there a triple float?

November 21, 20178 yr

The standard defines a quadruple precision, but I don't know if anyone implements it. There is also extended precision, which use 80 bits, that is quite widely supported.

November 21, 20178 yr

Author

25 minutes ago, Strange said:

The standard defines a quadruple precision, but I don't know if anyone implements it. There is also extended precision, which use 80 bits, that is quite widely supported.

Well, whatever I will let CS advance until then double float says infinity so accurate for an electron radius which is about half as close as needed being 10^-16 vs 10^-35 until Quad floats are used because it should be a number and not infinity which that number should be around, Volume =(4/3)(1/(t_pC)^2)^3 , which is 4.4704601196572883072076801920048 * 10^208

Edited November 21, 20178 yr by Vmedvil

November 21, 20178 yr

There are arbitrary precision libraries as well: https://en.wikipedia.org/wiki/List_of_arbitrary-precision_arithmetic_software

November 21, 20178 yr

You can make any bits length floating point number as you wish/need. It's often needed when making scientific application. Operations on regular floats/doubles introduce errors. Every operation, they accumulate together. So after a while error can be quite significant. Therefor scientists-programmers make their own floating point C++ classes.

typedef double SciFloat;
// work with SciFloat (instead of double directly) as long as you need in project...

// then when there is error caused by low precision:
class SciFloat
{
 // custom made float implementation..
 // or use 3rd party library made by somebody else already (could contain unknown errors, as usual)
 // put them in overloaded operators..
};

When storing on disk, or transferring through Internet etc. etc. you will have to write such custom object as string, and parse string during loading back, as it's not binary compatible with regular IEEE float/double.

Edited November 21, 20178 yr by Sensei

Sign In

Double Precision float Question

Featured Replies

Archived

Important Information

Account

Navigation

Search

Configure browser push notifications

Chrome (Android)

Chrome (Desktop)

Safari (iOS 16.4+)

Safari (macOS)

Edge (Android)

Edge (Desktop)

Firefox (Android)

Firefox (Desktop)