256-bit_floating-point_format

Octuple-precision floating-point format

256-bit computer number format

In computing, octuple precision is a binary floating-point-based computer number format that occupies 32 bytes (256 bits) in computer memory. This 256-bit octuple precision is for applications requiring results in higher than quadruple precision. This format is rarely (if ever) used and very few environments support it.

IEEE 754 octuple-precision binary floating-point format: binary256

In its 2008 revision, the IEEE 754 standard specifies a binary256 format among the interchange formats (it is not a basic format), as having:

Sign bit: 1 bit
Exponent width: 19 bits
Significand precision: 237 bits (236 explicitly stored)

The format is written with an implicit lead bit with value 1 unless the exponent is all zeros. Thus only 236 bits of the significand appear in the memory format, but the total precision is 237 bits (approximately 71 decimal digits: log₁₀(2²³⁷) ≈ 71.344). The bits are laid out as follows:

Octuple-precision examples

These examples are given in bit representation, in hexadecimal, of the floating-point value. This includes the sign, (biased) exponent, and significand.

0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000₁₆ = +0
8000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000₁₆ = −0

7fff f000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000₁₆ = +infinity
ffff f000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000₁₆ = −infinity

0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0001₁₆
= 2^−262142 × 2⁻²³⁶ = 2^−262378
≈ 2.24800708647703657297018614776265182597360918266100276294348974547709294462 × 10⁻⁷⁸⁹⁸⁴
  (smallest positive subnormal number)

0000 0fff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff₁₆
= 2^−262142 × (1 − 2⁻²³⁶)
≈ 2.4824279514643497882993282229138717236776877060796468692709532979137875392 × 10⁻⁷⁸⁹¹³
  (largest subnormal number)

0000 1000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000₁₆
= 2^−262142
≈ 2.48242795146434978829932822291387172367768770607964686927095329791378756168 × 10⁻⁷⁸⁹¹³
  (smallest positive normal number)

7fff efff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff₁₆
= 2²⁶²¹⁴³ × (2 − 2⁻²³⁶)
≈ 1.61132571748576047361957211845200501064402387454966951747637125049607182699 × 10⁷⁸⁹¹³
  (largest normal number)

3fff efff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff₁₆
= 1 − 2⁻²³⁷
≈ 0.999999999999999999999999999999999999999999999999999999999999999999999995472
  (largest number less than one)

3fff f000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000₁₆
= 1 (one)

3fff f000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0001₁₆
= 1 + 2⁻²³⁶
≈ 1.00000000000000000000000000000000000000000000000000000000000000000000000906
  (smallest number larger than one)

By default, 1/3 rounds down like double precision, because of the odd number of bits in the significand. So the bits beyond the rounding point are 0101... which is less than 1/2 of a unit in the last place.

Share this article:

This article uses material from the Wikipedia article 256-bit_floating-point_format, and is written by contributors. Text is available under a CC BY-SA 4.0 International License; additional terms may apply. Images, videos and audio are available under their respective licenses.

[Crandall-Papadopoulos_2002-1] [1]
Crandall, Richard E.; Papadopoulos, Jason S. (2002-05-08). "Octuple-precision floating point on Apple G4 (archived copy on web.archive.org)" (PDF). Archived from the original on 2006-07-28.{{cite web}}: CS1 maint: unfit URL (link) (8 pages)

[1]

Exponent	Significand zero	Significand non-zero	Equation
00000₁₆	0, −0	subnormal numbers	(-1)^signbit × 2^−262142 × 0.significandbits₂
00001₁₆, ..., 7FFFE₁₆	normalized value		(-1)^signbit × 2^{exponent bits₂} × 1.significandbits₂
7FFFF₁₆	±∞	NaN (quiet, signalling)

256-bit_floating-point_format

Octuple-precision floating-point format

IEEE 754 octuple-precision binary floating-point format: binary256

Exponent encoding

Octuple-precision examples

Implementations

Hardware support

See also

References

Further reading

Share this article: