Number Systems and
Computer Arithmetic
Counting to four billion
two fingers at a time
CSE 141, S2'06 Jeff Brown
What do all those bits mean now?
bits (011011011100010 ....01)
data
instruction
number text chars ..............
R-format I-format ...
integer floating point
signed unsigned single precision double precision
... ... ... ...
CSE 141, S2'06 Jeff Brown
Questions About Numbers
• How do you represent
– negative numbers?
– fractions?
– really large numbers?
– really small numbers?
• How do you
– do arithmetic?
– identify errors (e.g. overflow)?
• What is an ALU and what does it look like?
– ALU=arithmetic logic unit
CSE 141, S2'06 Jeff Brown
Review: Binary Numbers
Consider a 4-bit binary number
Decimal Binary Decimal Binary
0 0000 4 0100
1 0001 5 0101
2 0010 6 0110
3 0011 7 0111
Examples of binary arithmetic:
3+2=5 3+3=6
1 1 1
0 0 1 1 0 0 1 1
+ 0 0 1 0 + 0 0 1 1
CSE 141, S2'06 Jeff Brown
Negative Numbers?
• We would like a number system that provides
– obvious representation of 0,1,2...
– uses adder for addition
– single value of 0
– equal coverage of positive and negative numbers
– easy detection of sign
– easy negation
CSE 141, S2'06 Jeff Brown
Some Alternatives
• Sign Magnitude -- MSB is sign bit, rest the same
-1 == 1001
-5 == 1101
• One’s complement -- flip all bits to negate
-1 == 1110
-5 == 1010
CSE 141, S2'06 Jeff Brown
Two’s Complement Representation
• 2’s complement representation of negative numbers
– Take the bitwise inverse and add 1
• Biggest 4-bit Binary Number: 7 Smallest 4-bit Binary Number: -8
Decimal Two’s Complement Binary
-8 1000
-7 1001
-6 1010
-5 1011
-4 1100
-3 1101
-2 1110
-1 1111
0 0000
1 0001
2 0010
3 0011
4 0100
5 0101
6 0110
7 0111
CSE 141, S2'06 Jeff Brown
Two’s Complement Arithmetic
Decimal 2’s Complement Binary Decimal 2’s Complement Binary
0 0000 -1 1111
1 0001 -2 1110
2 0010 -3 1101
3 0011 -4 1100
4 0100 -5 1011
5 0101 -6 1010
6 0110 -7 1001
7 0111 -8 1000
• Examples: 7 - 6 = 7 + (- 6) = 1 3 - 5 = 3 + (- 5) = -2
0 1 1 1 0 0 1 1
+ 1 0 1 0 + 1 0 1 1
CSE 141, S2'06 Jeff Brown
Some Things We Want To Know About Our
Number System
• negation
• sign extension
– +3 => 0011, 00000011, 0000000000000011
– -3 => 1101, 11111101, 1111111111111101
• overflow detection
0101 5
+ 0110 6
CSE 141, S2'06 Jeff Brown
Overflow Detection
0 0 1 0 1 1 0 0
0 0 1 0 2 1 1 0 0 -4
+ 0 0 1 1 3 + 1 1 1 0 -2
0 1 0 1 5 1 0 1 0 -6
0 1 1 1 1 0 1 0
0 1 1 1 7 1 1 0 0 -4
+ 0 0 1 1 3 + 1 0 1 1 -5
1 0 1 0 -6 0 1 1 1 7
So how do we detect overflow?
CSE 141, S2'06 Jeff Brown
Arithmetic -- The heart
Instruction of instruction execution
Fetch
Instruction
Decode
operation
Operand
Fetch
a
Execute 32 ALU
result
32
Result b
Store 32
Next
Instruction
CSE 141, S2'06 Jeff Brown
Designing an Arithmetic Logic Unit
ALUop
3
A
N
Zero
ALU
Result
N
Overflow
B
N
CarryOut
• ALU Control Lines (ALUop) Function
– 000 And
– 001 Or
– 010 Add
– 110 Subtract
– 111 Set-on-less-than
CSE 141, S2'06 Jeff Brown
A One Bit ALU
• This 1-bit ALU will perform AND, OR, and ADD
1 1 0 0
1 1 0 0 -4
+ 1 1 1 0 -2
1 0 1 0 -6
CSE 141, S2'06 Jeff Brown
A 32-bit ALU
1-bit ALU 32-bit ALU
CSE 141, S2'06 Jeff Brown
How About Subtraction?
• Keep in mind the following:
– (A - B) is the same as: A + (-B)
– 2’s Complement negate: Take the inverse of every bit and add 1
• Bit-wise inverse of B is !B:
– A - B = A + (-B) = A + (!B + 1) = A + !B + 1
1 0 1 0
1 1 0 0
+ 1 0 1 1
0 1 1 1
CSE 141, S2'06 Jeff Brown
Overflow Detection Logic
• Carry into MSB ! = Carry out of MSB
– For a N-bit ALU: Overflow = CarryIn[N - 1] XOR CarryOut[N - 1]
CarryIn0
A0 1-bit Result0 X Y X XOR Y
B0 ALU
CarryOut0 0 0 0
CarryIn1
A1 0 1 1
1-bit Result1
ALU 1 0 1
B1
CarryOut1 1 1 0
CarryIn2
A2 1-bit Result2
B2 ALU
CarryIn3 CarryOut2
A3 Overflow
1-bit Result3
B3 ALU
CarryOut3
CSE 141, S2'06 Jeff Brown
Zero Detection Logic
• Zero Detection Logic is just one BIG NOR gate
– Any non-zero input to the NOR gate will cause its output to be zero
CarryIn0
A0 1-bit Result0
B0 ALU
CarryIn1 CarryOut0
A1 1-bit Result1
B1 ALU
Zero
CarryIn2 CarryOut1
A2 1-bit Result2
B2 ALU
CarryIn3 CarryOut2
A3 1-bit Result3
B3 ALU
CarryOut3
CSE 141, S2'06 Jeff Brown
Set-on-less-than
• Do a subtract
• use sign bit
– route to bit 0 of result
– all other bits zero
CSE 141, S2'06 Jeff Brown
Full ALU give signals for:
neg oper
add?
sub?
and?
or?
beq?
slt?
sign bit (adder output from bit 31)
CSE 141, S2'06 Jeff Brown
The Disadvantage of Ripple Carry
• The adder we just built is called a “Ripple Carry Adder”
– The carry bit may have to propagate from LSB to MSB
– Worst case delay for an N-bit RC adder: 2N-gate delay
CarryIn0
A0 1-bit Result0
B0 ALU
CarryIn1 CarryOut0 CarryIn
A1 1-bit Result1 A
B1 ALU
CarryIn2 CarryOut1
A2 1-bit Result2
B2 ALU
CarryIn3 CarryOut2
A3 B CarryOut
1-bit Result3
B3 ALU
CarryOut3
Ripple carry adders are slow. Faster addition schemes are possible that accelerate the
movement of the carry from one end to the other.
MULTIPLY
• Paper and pencil example:
Multiplicand 1000
Multiplier x 1011
Product = ?
• m bits x n bits = m+n bit product
• Binary makes it easy:
– 0 => place 0 ( 0 x multiplicand)
– 1 => place multiplicand ( 1 x multiplicand)
• we’ll look at a couple of versions of multiplication hardware
CSE 141, S2'06 Jeff Brown
MULTIPLY HARDWARE
Version 1
• 64-bit Multiplicand reg, 64-bit ALU, 64-bit Product reg,
32-bit multiplier reg
CSE 141, S2'06 Jeff Brown
Multiply Algorithm
Version 1
Multiplier Multiplicand Product
0101 0000 0110 0000 0000
CSE 141, S2'06 Jeff Brown
Observations on Multiply
Version 1
• 1 clock per cycle => 100 clocks per multiply
– Ratio of multiply to add 100:1
• 1/2 bits in multiplicand always 0
=> 64-bit adder is wasted
• 0’s inserted in left of multiplicand as shifted
=> least significant bits of product never changed once formed
• Instead of shifting multiplicand to left, shift product to right?
• Wasted space (zeros) in product register exactly matches
meaningful bits of multiplier at all times. Combine?
CSE 141, S2'06 Jeff Brown
MULTIPLY HARDWARE
Version 2
• 32-bit Multiplicand reg, 32 -bit ALU, 64-bit Product reg,
(0-bit Multiplier reg)
Multiplicand Product
0110 0000 0101
0011 0010
0001 1001
0011 1100
0001 1110
CSE 141, S2'06 Jeff Brown
Observations on Multiply
Version 2
• 2 steps per bit because Multiplier & Product combined
• 32-bit adder
• MIPS registers Hi and Lo are left and right half of Product
• Gives us MIPS instruction MultU
• What about signed multiplication?
– easiest solution is to make both positive & remember whether to
complement product when done.
CSE 141, S2'06 Jeff Brown
Divide: Paper & Pencil
Quotient
Divisor 1000 1101010 Dividend
Remainder
• See how big a number can be subtracted, creating quotient bit on each step
– Binary => 1 * divisor or 0 * divisor
• Dividend = Quotient * Divisor + Remainder
CSE 141, S2'06 Jeff Brown
DIVIDE HARDWARE
Version 1
• 64-bit Divisor reg, 64-bit ALU, 64-bit Remainder reg,
32-bit Quotient reg
CSE 141, S2'06 Jeff Brown
Divide Algorithm Start
Version 1
• Takes n+1 steps for n-bit Quotient & Rem. 1. Subtract the Divisor register from the
Remainder register, and place the result in the
Quotient Divisor Remainder Remainder register.
0000 0011 0000 0000 0111
Remainder >= 0 Remainder < 0
Test Remainder
2b. Restore the original value by adding the
2a. Shift the Quotient register to the left Divisor register to the Remainder register, and
setting the new rightmost bit to 1. place the sum in the Remainder register. Also
shift the Quotient register to the left, setting the
new least significant bit to 0.
3. Shift the Divisor register right 1 bit.
No: < 33 repetitions
33rd repetition?
Yes: 33 repetitions
Done
CSE 141, S2'06 Jeff Brown
Divide Hardware Version 1
• Again, 64-bit adder is unnecessary.
• Quotient grows as remainder shrinks
CSE 141, S2'06 Jeff Brown
DIVIDE HARDWARE
Version 2
• 32-bit Divisor reg, 32 -bit ALU, 64-bit
Remainder reg,
(0-bit Quotient reg)
CSE 141, S2'06 Jeff Brown
Observations on Divide
Version 2
• Same Hardware as Multiply: just need ALU to add or subtract, and 63-bit
register to shift left or shift right
• Hi and Lo registers in MIPS combine to act as 64-bit register for multiply
and divide
• Signed Divides: Simplest is to remember signs, make positive, and
complement quotient and remainder if necessary
– Note: Dividend and Remainder must have same sign
– Note: Quotient negated if Divisor sign & Dividend sign disagree
CSE 141, S2'06 Jeff Brown
Key Points
• Instruction Set drives the ALU design
• ALU performance, CPU clock speed driven by adder delay
• Multiplication and division take much longer than addition,
requiring multiple addition steps.
CSE 141, S2'06 Jeff Brown
So Far
• Can do logical, add, subtract, multiply, divide, ...
• But........
– what about fractions?
– what about really large numbers?
CSE 141, S2'06 Jeff Brown
Binary Fractions
10112 = 1x23 + 0x22 + 1x21 + 1x20
so...
101.0112 = 1x22 + 0x21 + 1x20 + 0x2-1 + 1x2-2 + 1x2-3
e.g.,
.75 = 3/4 = 3/22 = 1/2 + 1/4 = .11
CSE 141, S2'06 Jeff Brown
Recall Scientific Notation
decimal point exponent
sign 23 -24
+6.02 x 10 1.673 x 10
Mantissa radix (base)
Issues:
° Arithmetic (+, -, *, / )
° Representation, Normal form
° Range and Precision
° Rounding
° Exceptions (e.g., divide by zero, overflow, underflow)
° Errors
° Properties ( negation, inversion, if A = B then A - B = 0 )
CSE 141, S2'06 Jeff Brown
Floating-Point Numbers
Representation of floating point numbers in IEEE 754 standard:
1 8 23
single precision sign S E M
exponent: mantissa:
excess 127 sign + magnitude, normalized
binary integer binary significand w/ hidden
integer bit: 1.M
(actual exponent is e = E - 127)
0 < E < 255
N = (-1)S 2E-127 (1.M)
0 = 0 00000000 0 . . . 0 -1.5 = 1 01111111 10 . . . 0
325 = 101000101 X 2 = 1.01000101 X 28
0
= 0 10000111 01000101000000000000000
.02 = .0011001101100... X 20 = 1.1001101100... X 2-3
= 0 01111100 1001101100...
•range of about 2 X 10-38 to 2 X 1038
•always normalized (so always leading 1, thus never shown)
•special representation of 0 (E = 00000000) (why?)
•can do integer compare for greater-than, sign
CSE 141, S2'06 Jeff Brown
What do you notice?
• 0 0 00000000 0000000000…
• 1.5 * 2-100 0 00011011 1000000000…
• 1.75 * 2-100 0 00011011 1100000000…
• 1.5 * 2100 0 11100011 1000000000…
• 1.75*2100 0 11100011 1100000000…
• Does this work with negative numbers, as well?
CSE 141, S2'06 Jeff Brown
Double Precision Floating Point
Representation of floating point numbers in IEEE 754 standard:
1 11 20 32
double precision sign S E M M
exponent: mantissa:
excess 1023 sign + magnitude, normalized
binary integer binary significand w/ hidden
integer bit: 1.M
actual exponent is e = E - 1023
0 < E < 2048
N = (-1)S 2E-1023 (1.M)
• 52 (+1) bit mantissa
• range of about 2 X 10-308 to 2 X 10308
CSE 141, S2'06 Jeff Brown
Floating Point Addition
• How do you add in scientific notation?
9.962 x 104 + 5.231 x 102
• Basic Algorithm
1. Align
2. Add
3. Normalize
4. Round
CSE 141, S2'06 Jeff Brown
FP Addition Hardware
CSE 141, S2'06 Jeff Brown
Floating Point Multiplication
• How do you multiply in scientific notation?
(9.9 x 104)(5.2 x 102) = 5.148 x 107
• Basic Algorithm
1. Add exponents
2. Multiply
3. Normalize
4. Round
5. Set Sign
CSE 141, S2'06 Jeff Brown
FP Accuracy
• Extremely important in scientific calculations
• Very tiny errors can accumulate over time
• IEEE 754 FP standard has four rounding modes
– always round up (toward +∞)
– always round down (toward -∞)
∞
– truncate
– round to nearest
=> in case of tie, round to nearest even
• Requires extra bits in intermediate representations
CSE 141, S2'06 Jeff Brown
Extra Bits for FP Accuracy
• Guard bits -- bits to the right of the least significant bit of
the significand computed for use in normalization (could
become significant at that point) and rounding.
• IEEE 754 has two extra bits and calls them guard and
round.
CSE 141, S2'06 Jeff Brown
When Keepin' it Real Goes Wrong
• Machine FP only approximates the real numbers
• Just calculating extra bits of precision is often enough
– but not always!
– knowing when, how to tell, and what to do about it can be subtle
– analysis involves algorithm, program implementation, and hardware
• http://www.cs.berkeley.edu/~wkahan/
– "How JAVA's Floating-Point Hurts Everyone Everywhere"
– "Roundoff Degrades an Idealized Cantilever"
– (and lots more)
CSE 141, S2'06 Jeff Brown
Key Points
• Floating Point extends the range of numbers that can be
represented, at the expense of precision (accuracy).
• FP operations are very similar to integer, but with pre- and
post-processing.
• Rounding implementation is critical to accuracy over time.
CSE 141, S2'06 Jeff Brown