Question: Consider a ( binary ) floating point system of the form ( 1 . s 1 s 2 s 3 ) 2 2 m where

Consider a (binary) floating point system of the form (1.s1s2s3)22m where min[-63,64]. Calculate theConsider a (binary) floating point system of the form (1.s1s2s3)22m where min[-63,64]. Calculate the
relative error, with respect to 2-norm if we convert the vector nR2 to the given floating point system.
When converting to floating point, first convert to a binary number then truncate any additional bits.
x=[3.50000004.5000000]
Note: We are interested in the representation accuracy of floating point. So assume that all operations
performed (addition, subtraction, etc.) do not increase the error.
relative error (x)= number rtol =0.01, atol =0.001
relative error, with respect to 2-norm if we convert the vector nR2 to the given floating point system.
When converting to floating point, first convert to a binary number then truncate any additional bits.
x=[3.50000004.5000000]
We are interested in the representation accuracy of floating point. So assume that all operations
performed (addition, subtraction, etc.) do not increase the error.
relative error (x)= number rtol =0.01, atol =0.001
Consider a ( binary ) floating point system of

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!