Floating Point

Floating point Notes

Uploaded by

Pawan Saini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views2 pages

Floating Point

Floating point Notes

Uploaded by

Pawan Saini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Floating Point Number Representation:

 When you have to represent very small or very large numbers, a fixed point representation will not do.
The accuracy will be lost.
 Therefore, you will have to look at floating-point representations, where the binary point is assumed to
be floating.
 When you consider a decimal number 12.34*107, this can also be treated as 0.1234*109, where 0.1234 is
the fixed-point mantissa.
 The other part represents the exponent value, and indicates that the actual position of the binary point is
9 positions to the right (left) of the indicated binary point in the fraction.
 Since the binary point can be moved to any position and the exponent value adjusted appropriately, it is
called a floating-point representation.
 By convention, you generally go in for a normalized representation, wherein the floating-point is placed
to the right of the first nonzero (significant) digit.
 The base need not be specified explicitly and the sign, the significant digits and the signed exponent
constitute the representation.
 The IEEE (Institute of Electrical and Electronics Engineers) has produced a standard for floating point
arithmetic.
 This standard specifies how single precision (32 bit) and double precision (64 bit) floating point
numbers are to be represented, as well as how arithmetic should be carried out on them.
 The IEEE single precision floating point standard representation requires a 32 bit word, which may be
represented as numbered from 0 to 31, left to right.
 The first bit is the sign bit, S, the next eight bits are the exponent bits, 'E', and the final 23 bits are the
fraction 'F'.
 Instead of the signed exponent E, the value stored is an unsigned integer E' = E + 127, called the excess-
127 format. Therefore, E' is in the range 0 < E' < 255.

S E'E'E'E'E'E'E'E' FFFFFFFFFFFFFFFFFFFFFFF
01 89 31

The value V represented by the word may be determined as follows:

• If E' = 255 and F is nonzero, then V = NaN ("Not a number")

• If E' = 255 and F is zero and S is 1, then V = -Infinity
• If E' = 255 and F is zero and S is 0, then V = Infinity
• If 0 < E< 255 then V = (-1)S*2 (E-127)
*(1.F) where "1.F" is intended to represent the binary number
created by prefixing F with an implicit leading 1 and a binary point.
 If E' 0 and F is nonzero, then V = (-1)S * 2 (-126) values.
• If E'= 0 and F is zero and S is 1, then V = -0
• If E' = 0 and F is zero and S is 0, then V = 0

For example:
0 00000000 00000000000000000000000 = 0
0 00000000 00000000000000000000000 = 0
1 00000000 00000000000000000000000 = -0
0 11111111 00000000000000000000000 = Infinity
1 11111111 00000000000000000000000 = -Infinity
0 11111111 00000100000000000000000 = NaN
1 11111111 00100010001001010101010 = NaN
0 10000000 00000000000000000000000= +1 * 2**(128-127) * 1.0 = 2
0 10000001 10100000000000000000000= +1*2**(129-127) * 1.101 = 6.5
1 10000001 10100000000000000000000= -1 * 2**(129-127) * 1.101 = -6.5
0 00000001 00000000000000000000000= +1 * *2**(1-127) * 1.0 = 2**(-126)
0 00000000 10000000000000000000000= +1*2**(-126) * 0.1 = 2**(-127)
0 00000000 00000000000000000000001 = +1*2**(-126)*
0.00000000000000000000001 = 2**(-149) (Smallest positive value)
(unnormalized values)

Double Precision Numbers:

 The IEEE double precision floating point standard representation requires a 64-bit word, which may be
represented as numbered from 0 to 63, left to right.
 The first bit is the sign bit, S, the next eleven bits are the excess-1023 exponent bits, E', and the final 52
bits are the fraction 'F':
S E'E'E'E'E'E'E'E'E'E'E' FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF
01 11 12 63

The value V represented by the word may be determined as follows:

• If E' = 2047 and F is nonzero, then V = NaN ("Not a number")
• If E'= 2047 and F is zero and S is 1, then V = -Infinity
• If E' = 2047 and F is zero and S is 0, then V = Infinity
• If 0 < E'< 2047 then V = (-1)**S* 2 ** (E-1023) * (1.F) where "1.F" is intended to represent
the binary number created by prefixing F with an implicit leading 1 and a binary point.
• If E'= 0 and F is nonzero, then V = (-1)**S* 2 ** (-1022)* (0.F) These are "unnormalized" values.
• If E' = 0 and F is zero and S is 1, then V = -0
• If E'= 0 and F is zero and S is 0, then V = 0

Maths 3
86% (7)
Maths 3
6 pages
Floating Point Arithmetic
100% (1)
Floating Point Arithmetic
30 pages
c10 Indices
100% (2)
c10 Indices
28 pages
The IEEE Standard For Floating Point Arithmetic
No ratings yet
The IEEE Standard For Floating Point Arithmetic
9 pages
Codigo 4221
0% (1)
Codigo 4221
3 pages
Math Basics for JEE Aspirants
100% (1)
Math Basics for JEE Aspirants
24 pages
Fixed and Floating Point Representation
No ratings yet
Fixed and Floating Point Representation
5 pages
Fixed Point and Floating Point Number Representations
No ratings yet
Fixed Point and Floating Point Number Representations
7 pages
Data Representation
No ratings yet
Data Representation
28 pages
SMO Open 2022
No ratings yet
SMO Open 2022
6 pages
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
No ratings yet
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
51 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
Floating Point Arithmetic Guide
No ratings yet
Floating Point Arithmetic Guide
42 pages
9 Math
No ratings yet
9 Math
137 pages
COA - Unit2 Floating Point Arithmetic 3
No ratings yet
COA - Unit2 Floating Point Arithmetic 3
19 pages
LEC03 Data II
No ratings yet
LEC03 Data II
45 pages
Module 2 - PART D Floating
No ratings yet
Module 2 - PART D Floating
30 pages
Lec 06
No ratings yet
Lec 06
49 pages
Binary Number Representations
No ratings yet
Binary Number Representations
14 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
64 pages
Week8 Slides
No ratings yet
Week8 Slides
43 pages
AoPS ComplexNumbers
No ratings yet
AoPS ComplexNumbers
6 pages
Floating Point Numbers: CS031 September 12, 2011
No ratings yet
Floating Point Numbers: CS031 September 12, 2011
22 pages
Computer Architecture: Data Types
No ratings yet
Computer Architecture: Data Types
25 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
8 pages
CC102 - Lesson 4 Bsit - PPT Operators
No ratings yet
CC102 - Lesson 4 Bsit - PPT Operators
28 pages
Cacc
No ratings yet
Cacc
106 pages
ARML 2023 Contest
100% (1)
ARML 2023 Contest
35 pages
1 5 Floating Point Representation
No ratings yet
1 5 Floating Point Representation
9 pages
HP 50g - Excelente
100% (1)
HP 50g - Excelente
129 pages
Elipse Sheet
No ratings yet
Elipse Sheet
23 pages
Computer Arithmetic Basics
No ratings yet
Computer Arithmetic Basics
18 pages
IEEE 754 Floating Point Guide
No ratings yet
IEEE 754 Floating Point Guide
38 pages
IEEE 754 Floating Point Guide
No ratings yet
IEEE 754 Floating Point Guide
26 pages
Floating-Point Representation Guide
No ratings yet
Floating-Point Representation Guide
14 pages
Code Conversion
No ratings yet
Code Conversion
12 pages
Solu of Assignment 3
No ratings yet
Solu of Assignment 3
9 pages
Functions for Math Students
No ratings yet
Functions for Math Students
32 pages
Data Representation
No ratings yet
Data Representation
19 pages
Floating Point
No ratings yet
Floating Point
26 pages
Floating-Point Numbers and Operations Representation
No ratings yet
Floating-Point Numbers and Operations Representation
8 pages
Floating Points
No ratings yet
Floating Points
31 pages
Real Numbers and IEEE 754 Guide
No ratings yet
Real Numbers and IEEE 754 Guide
3 pages
Vedic Maths
No ratings yet
Vedic Maths
19 pages
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
No ratings yet
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
32 pages
Hello World 2015
No ratings yet
Hello World 2015
6 pages
Finite Word Length Effects
No ratings yet
Finite Word Length Effects
31 pages
Lecture 4
No ratings yet
Lecture 4
21 pages
Lecture 4
No ratings yet
Lecture 4
21 pages
IEEE 754 Floating Point Guide
No ratings yet
IEEE 754 Floating Point Guide
2 pages
What Are Floating Point Numbers?
No ratings yet
What Are Floating Point Numbers?
7 pages
Number Representation
No ratings yet
Number Representation
7 pages
RIPMWC 2019 Round 2 Junior
No ratings yet
RIPMWC 2019 Round 2 Junior
9 pages
Percentage Theory Lecture Notes
100% (1)
Percentage Theory Lecture Notes
2 pages
Fractions
No ratings yet
Fractions
4 pages
Floating Point Representation: Reading: B&O 2.4
No ratings yet
Floating Point Representation: Reading: B&O 2.4
44 pages
Architetture Dei Calcolatori 2425 079 092
No ratings yet
Architetture Dei Calcolatori 2425 079 092
14 pages
CH03 Data II
No ratings yet
CH03 Data II
31 pages
Lab 7
No ratings yet
Lab 7
11 pages
Floating Point 6up
No ratings yet
Floating Point 6up
7 pages
Floating-Point Numbers
No ratings yet
Floating-Point Numbers
23 pages
Test Bank For Graphical Approach To College Algebra 4th Edition by John Hornsby Lial Rockswold
No ratings yet
Test Bank For Graphical Approach To College Algebra 4th Edition by John Hornsby Lial Rockswold
15 pages
ENSC254 - Floating Point Computation
No ratings yet
ENSC254 - Floating Point Computation
29 pages
Chapter3 3
No ratings yet
Chapter3 3
13 pages
Floating Point & Fixed Point Representation - BCA II
No ratings yet
Floating Point & Fixed Point Representation - BCA II
24 pages
Unit 2
No ratings yet
Unit 2
16 pages
Adarsh School Math Quiz Rules
No ratings yet
Adarsh School Math Quiz Rules
43 pages
IEEE Standard 754 Floating Point Numbers
No ratings yet
IEEE Standard 754 Floating Point Numbers
7 pages
G7 Math Nat Reviewer
No ratings yet
G7 Math Nat Reviewer
6 pages
IEEE 754: Floating Point Guide
No ratings yet
IEEE 754: Floating Point Guide
10 pages
Ieee Tex
No ratings yet
Ieee Tex
4 pages
IM Progression FPD Connection
No ratings yet
IM Progression FPD Connection
6 pages
FIXED and FLOAT
No ratings yet
FIXED and FLOAT
8 pages
Identities, Equations and The Number System: Provided by Dse - Life
No ratings yet
Identities, Equations and The Number System: Provided by Dse - Life
7 pages
COA
No ratings yet
COA
14 pages
LL - MM Grade 6
No ratings yet
LL - MM Grade 6
2 pages
L2-Variables and Floating Point Number System
No ratings yet
L2-Variables and Floating Point Number System
38 pages
VG Logarithms
No ratings yet
VG Logarithms
9 pages
MCQ 4A Ch1 Number Systems
No ratings yet
MCQ 4A Ch1 Number Systems
2 pages
COA Module6 FloatingPoint
No ratings yet
COA Module6 FloatingPoint
17 pages
JPT-1 (12.01.25)
No ratings yet
JPT-1 (12.01.25)
31 pages
MTH 214 Accuracy in Numerical Calculations and Error Analysis
No ratings yet
MTH 214 Accuracy in Numerical Calculations and Error Analysis
18 pages
Double-Precision Floating-Point Format - Wikipedia
No ratings yet
Double-Precision Floating-Point Format - Wikipedia
8 pages
Introduction To 80×86 Assembly Language and Computer Architecture - Ebook PDF Version PDF Download
No ratings yet
Introduction To 80×86 Assembly Language and Computer Architecture - Ebook PDF Version PDF Download
57 pages
Module2.1 of Nothing
No ratings yet
Module2.1 of Nothing
7 pages
Ol Ahg9
No ratings yet
Ol Ahg9
7 pages
Floatinf Point
No ratings yet
Floatinf Point
11 pages
Lec 08
No ratings yet
Lec 08
36 pages
Ques. Chp.02-Logarithms, Indices
No ratings yet
Ques. Chp.02-Logarithms, Indices
4 pages

Floating Point

Uploaded by

Floating Point

Uploaded by

Floating Point Number Representation:

The value V represented by the word may be determined as follows:

• If E' = 255 and F is nonzero, then V = NaN ("Not a number")

Double Precision Numbers:

The value V represented by the word may be determined as follows:

You might also like