Question: Consider a ( binary ) floating point system of the form ( 1 . s 1 s 2 s 3 ) 2 2 m where
Consider a binary floating point system of the form
where min Calculate the relative
error, with respect to norm if we convert the vector to
the given floating point system. When converting to floating point,
first convert to a binary number then truncate any additional bits.
Note: We are interested in the representation accuracy of floating
point. So assume that all operations performed addition
subtraction, etc. do not increase the error.
relative error
number rtol atol
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
